Anthropic’s Claude Fable 5 and Opus 4.7/4.8 variants currently anchor the top of live Code Arena and WebDev leaderboards with Elo scores in the 1560–1665 range after May releases emphasizing agentic workflows and tool use. These models widened the gap over GPT-5.5, Gemini 3.1, and Qwen 3.7 through superior SWE-Bench Pro and multi-step coding performance, reflecting trader consensus that incremental gains alone are unlikely to push any model past higher resolution thresholds before June 30. No frontier lab has announced imminent releases or benchmark jumps in the past two weeks, leaving only 16 days for surprise updates or rapid fine-tunes to alter the outcome.
Resumen experimental generado por IA con datos de Polymarket. Esto no es asesoramiento de trading y no influye en cómo se resuelve este mercado. · Actualizado$11,100 Vol.
1550
13%
1560
9%
1570
7%
$11,100 Vol.
1550
13%
1560
9%
1570
7%
Results from the "Score" column under the "Text Arena | Coding" Leaderboard tab at https://arena.ai/leaderboard/text/coding-no-style-control with style control off will be used to resolve this market.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If permanently unavailable, this market will resolve to "No".
Mercado abierto: Apr 2, 2026, 6:09 PM ET
Resolver
0x65070BE91...Results from the "Score" column under the "Text Arena | Coding" Leaderboard tab at https://arena.ai/leaderboard/text/coding-no-style-control with style control off will be used to resolve this market.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If permanently unavailable, this market will resolve to "No".
Resolver
0x65070BE91...Anthropic’s Claude Fable 5 and Opus 4.7/4.8 variants currently anchor the top of live Code Arena and WebDev leaderboards with Elo scores in the 1560–1665 range after May releases emphasizing agentic workflows and tool use. These models widened the gap over GPT-5.5, Gemini 3.1, and Qwen 3.7 through superior SWE-Bench Pro and multi-step coding performance, reflecting trader consensus that incremental gains alone are unlikely to push any model past higher resolution thresholds before June 30. No frontier lab has announced imminent releases or benchmark jumps in the past two weeks, leaving only 16 days for surprise updates or rapid fine-tunes to alter the outcome.
Resumen experimental generado por IA con datos de Polymarket. Esto no es asesoramiento de trading y no influye en cómo se resuelve este mercado. · Actualizado
Cuidado con los enlaces externos.
Cuidado con los enlaces externos.
Preguntas frecuentes