Google's Gemini 3.1 Pro and Deep Think variants maintain a clear lead in mathematical reasoning benchmarks, including strong MATH, AIME, GPQA Diamond, and IMO-level results plus autonomous novel proof generation through the Aletheia system, driving the 65.5% market-implied odds. Anthropic's Claude Opus releases deliver competitive multi-step and agentic performance that supports its 27.5% share, while OpenAI's GPT-5 series excels in select abstract tasks yet trails on consistent math-specific evaluations. These probabilities aggregate trader assessments of verified benchmark leadership and model releases over recent months, with resolution at end of June turning on any final capability demonstrations before the cutoff.
Resumen experimental generado por IA con datos de Polymarket. Esto no es asesoramiento de trading y no influye en cómo se resuelve este mercado. · ActualizadoWhich company has the best Math AI model end of June?
Google 66%
Anthropic 28%
OpenAI 9%
Z.ai 1.8%
$51,134 Vol.
$51,134 Vol.

66%

Anthropic
28%

OpenAI
9%

Z.ai
2%

Baidu
<1%

ByteDance
<1%

Mistral
<1%

xAI
<1%

Alibaba
<1%

Amazon
<1%

Meta
<1%

Moonshot
<1%

DeepSeek
<1%

Microsoft
<1%

Meituan
<1%
Google 66%
Anthropic 28%
OpenAI 9%
Z.ai 1.8%
$51,134 Vol.
$51,134 Vol.

66%

Anthropic
28%

OpenAI
9%

Z.ai
2%

Baidu
<1%

ByteDance
<1%

Mistral
<1%

xAI
<1%

Alibaba
<1%

Amazon
<1%

Meta
<1%

Moonshot
<1%

DeepSeek
<1%

Microsoft
<1%

Meituan
<1%
Results from the "Rank" column under the "Text Arena | Math" Leaderboard tab at https://arena.ai/leaderboard/text/math-no-style-control with style control off will be used to resolve this market.
Models will be ordered primarily by their leaderboard rank at the market’s check time. If two or more models are tied on rank, they will be ordered by their Arena score, including any underlying, unrounded, granular values reflected in the data below the leaderboard. If a tie still remains, alphabetical order of company names as listed in this market group will be used as a final tiebreaker (e.g., if the two models are tied by exact arena score, “Google” would be ranked ahead of “xAI”). This market will resolve based on the company that occupies first place under this ranking.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Mercado abierto: May 26, 2026, 6:36 PM ET
Resolver
0x69c47De9D...Results from the "Rank" column under the "Text Arena | Math" Leaderboard tab at https://arena.ai/leaderboard/text/math-no-style-control with style control off will be used to resolve this market.
Models will be ordered primarily by their leaderboard rank at the market’s check time. If two or more models are tied on rank, they will be ordered by their Arena score, including any underlying, unrounded, granular values reflected in the data below the leaderboard. If a tie still remains, alphabetical order of company names as listed in this market group will be used as a final tiebreaker (e.g., if the two models are tied by exact arena score, “Google” would be ranked ahead of “xAI”). This market will resolve based on the company that occupies first place under this ranking.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Resolver
0x69c47De9D...Google's Gemini 3.1 Pro and Deep Think variants maintain a clear lead in mathematical reasoning benchmarks, including strong MATH, AIME, GPQA Diamond, and IMO-level results plus autonomous novel proof generation through the Aletheia system, driving the 65.5% market-implied odds. Anthropic's Claude Opus releases deliver competitive multi-step and agentic performance that supports its 27.5% share, while OpenAI's GPT-5 series excels in select abstract tasks yet trails on consistent math-specific evaluations. These probabilities aggregate trader assessments of verified benchmark leadership and model releases over recent months, with resolution at end of June turning on any final capability demonstrations before the cutoff.
Resumen experimental generado por IA con datos de Polymarket. Esto no es asesoramiento de trading y no influye en cómo se resuelve este mercado. · Actualizado
Cuidado con los enlaces externos.
Cuidado con los enlaces externos.
Preguntas frecuentes