Google Research has introduced Gemini-SQL2, a text-to-SQL system built on the Gemini 3.1 Pro model. This system translates natural language into executable SQL queries, with notable performance on the BIRD benchmark. According to Google, Gemini-SQL2 achieved an execution accuracy of 80.04%, placing it at the top of the leaderboard. This outperforms OpenAI's GPT-5.5-xhigh, which scored 72.8%, and Anthropic's Claude Opus 4.6, which scored 70.9%. Models from Databricks, AWS, Tencent, and Alibaba all scored significantly lower.

The performance highlights the model's ability to generate accurate and functional SQL queries, a task that is particularly challenging due to the complexity of data layers and business logic. The research team has not announced a public release of the model, and there is no paper available yet. Source: thedecoder