OpenAI's freshly released GPT-5.5 model, launched April 24, 2026, has achieved approximately 43% accuracy on Humanity's Last Exam—a rigorous benchmark comprising 2,500 expert-level, multi-modal questions spanning over 100 subjects in math, science, and humanities, designed to test frontier artificial intelligence capabilities. This marks substantial progress from GPT-4o's sub-10% score in 2024, driven by scaling laws and enhanced reasoning chains, yet trails Google DeepMind's leading Gemini 3.1 Pro Preview at 45.8%, intensifying competitive dynamics among top AI labs. With the June 30 deadline two months away, traders eye potential GPT-5.6 previews or tool-augmented variants that could push scores above 50%, amid uncertain release timelines and regulatory scrutiny on AI safety benchmarks.
Resumen experimental generado por IA con datos de Polymarket. Esto no es asesoramiento de trading y no influye en cómo se resuelve este mercado. · Actualizado$21,129 Vol.
50%+
39%
$21,129 Vol.
50%+
39%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Mercado abierto: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...OpenAI's freshly released GPT-5.5 model, launched April 24, 2026, has achieved approximately 43% accuracy on Humanity's Last Exam—a rigorous benchmark comprising 2,500 expert-level, multi-modal questions spanning over 100 subjects in math, science, and humanities, designed to test frontier artificial intelligence capabilities. This marks substantial progress from GPT-4o's sub-10% score in 2024, driven by scaling laws and enhanced reasoning chains, yet trails Google DeepMind's leading Gemini 3.1 Pro Preview at 45.8%, intensifying competitive dynamics among top AI labs. With the June 30 deadline two months away, traders eye potential GPT-5.6 previews or tool-augmented variants that could push scores above 50%, amid uncertain release timelines and regulatory scrutiny on AI safety benchmarks.
Resumen experimental generado por IA con datos de Polymarket. Esto no es asesoramiento de trading y no influye en cómo se resuelve este mercado. · Actualizado
Cuidado con los enlaces externos.
Cuidado con los enlaces externos.
Preguntas frecuentes