Google Deepmind's last model, Gemini 2.5 Pro, has reached position #1 in the sand classification. The model achieved a notable 40-point score on its closest competitors, Grok-3 and GPT-4.5, marking the largest jump ever seen in this classification.

Strong performance under the name in code “nebula”
Proven under the name in “Nebula” code, Gemini 2.5 PRO stood out in all categories evaluated in the classification of the sand, winning the first range in all areas. It stood out particularly in mathematics, creative writing, monitoring of instructions, longer consultations and multiple interactions, ensuring unique #1 places in these areas. This shows the model of the model to handle a wide range of tasks, from solving complex mathematical problems to maintaining consistent conversations in multiple shifts.
The Arena Classification Table, directed by LMarena.ai (previously lmsys.org), measures how well the ai models work according to human preferences, which makes the higher classification of Gemini 2.5 pro a clear sign of its quality and versatility. The leadership of 40 points on competitors such as GPT-3 of XAI and GPT-4.5 of OpenAI highlights its strong performance.
A victory for Google Deepmind
Google Deepmind shared that Gemini 2.5 Pro is its “smarter model”, but it works well in the tasks of mathematics, science and coding. For example, it obtained 18.8% in the last examination of humanity, a hard test of knowledge and reasoning, and showed improvements in coding, such as the creation of applications and web games.
<figure class="wp-block-embed is-type-rich is-provider-twitter wp-block-embed-twitter“>
What is Gemini 2.5 Pro?
Gemini 2.5 Pro, the newest Google Deepmind model, improves performance, efficiency and capacities compared to previous models. As part of the Gemini 2.5 series, this professional version offers a profitable balance of energy for developers and companies.
- Multimodal support: He handles text, images, video, audio and code, which makes it versatile in all domains.
- Advanced reasoning: Analyze the information methodically for more precise and conscious responses of the context.
- Larger context window: It supports 1 million tokens, with plans to expand to 2 million.
- Best Codification: It offers improved generation and assistance for developers.
- Updated knowledge: Trained in data until January 2025.
- Availability: Soon to seex ai.
For more details about the model, see our in -depth guide on Gemini 2.5 Pro here.
Looking to the future
The success of Gemini 2.5 PRO in the Arena Classification Table highlights its strengths in the reasoning, coding and management of complex tasks. He also raises questions about how other ai companies could answer, such as Openai and XAI. For now, Gemini 2.5 Pro performance establishes a new standard, and it will be interesting to see how it shapes the future of ai development.
For more information, see the complete thread at x at <a target="_blank" href="https://x.com/lmarena_ai/status/1904581128746656099″ target=”_blank” rel=”noreferrer noopener nofollow”>Lmarena.ai's post.
Log in to continue reading and enjoying content cured by experts.
<script async src="//platform.twitter.com/widgets.js” charset=”utf-8″>