OpenAI's GPT-5.4 Pro claimed the FrontierMath record in March 2026, hitting 50% on Tiers 1-3 undergraduate-to-postdoc math problems and 38% on research-level Tier 4, with GPT-5.5 Pro pushing Tier 4 to 39.6%—dwarfing rivals like Anthropic's Claude Opus 4.7 at 23%. These gains underscore OpenAI's edge in large language model reasoning via scaled compute and chain-of-thought techniques, though Epoch AI notes slowing progress nearing 40% saturation on elite benchmarks. Trader consensus reflects optimism for a pre-June 30 model iteration amid rapid 2026 releases, but uncertainty lingers over technical hurdles like novel proof generation; key catalysts include OpenAI previews or Epoch updates.
Resumen experimental generado por IA con datos de Polymarket. Esto no es asesoramiento de trading y no influye en cómo se resuelve este mercado. · Actualizado$31,421 Vol.
60%+
44%
70%+
7%
$31,421 Vol.
60%+
44%
70%+
7%
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Mercado abierto: Jan 29, 2026, 12:47 PM ET
Resolver
0x65070BE91...This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Resolver
0x65070BE91...OpenAI's GPT-5.4 Pro claimed the FrontierMath record in March 2026, hitting 50% on Tiers 1-3 undergraduate-to-postdoc math problems and 38% on research-level Tier 4, with GPT-5.5 Pro pushing Tier 4 to 39.6%—dwarfing rivals like Anthropic's Claude Opus 4.7 at 23%. These gains underscore OpenAI's edge in large language model reasoning via scaled compute and chain-of-thought techniques, though Epoch AI notes slowing progress nearing 40% saturation on elite benchmarks. Trader consensus reflects optimism for a pre-June 30 model iteration amid rapid 2026 releases, but uncertainty lingers over technical hurdles like novel proof generation; key catalysts include OpenAI previews or Epoch updates.
Resumen experimental generado por IA con datos de Polymarket. Esto no es asesoramiento de trading y no influye en cómo se resuelve este mercado. · Actualizado
Cuidado con los enlaces externos.
Cuidado con los enlaces externos.
Preguntas frecuentes