Trader consensus on Polymarket reflects tempered optimism for OpenAI's GPT models achieving a breakthrough score on Humanity's Last Exam—a rigorous 2,500-question benchmark spanning over 100 expert-level subjects—by June 30, 2026, amid rapid but uneven progress. As of late April, OpenAI's GPT-5.4 trails Google's Gemini 3.1 Pro Preview (44.7% vs. 41.6%), with no-tools scores hovering below 45% despite tools boosting performance to 58% in some evaluations; early 2025 models like GPT-4o scored just 2.7%. OpenAI's March GPT-5.4 release drove a 6-8% gain in two months, fueled by enhanced reasoning chains, but competitive pressure from Anthropic's Claude and Google's iterations tempers expectations. Key catalysts include potential GPT-5.5 rollout or o1 successor previews at upcoming developer events, though benchmark saturation risks and scaling hurdles could delay superhuman thresholds.
Experimental AI-generated summary referencing Polymarket data. This is not trading advice and plays no role in how this market resolves. · Updated$21,129 Vol.
50%+
41%
$21,129 Vol.
50%+
41%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Market Opened: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...Trader consensus on Polymarket reflects tempered optimism for OpenAI's GPT models achieving a breakthrough score on Humanity's Last Exam—a rigorous 2,500-question benchmark spanning over 100 expert-level subjects—by June 30, 2026, amid rapid but uneven progress. As of late April, OpenAI's GPT-5.4 trails Google's Gemini 3.1 Pro Preview (44.7% vs. 41.6%), with no-tools scores hovering below 45% despite tools boosting performance to 58% in some evaluations; early 2025 models like GPT-4o scored just 2.7%. OpenAI's March GPT-5.4 release drove a 6-8% gain in two months, fueled by enhanced reasoning chains, but competitive pressure from Anthropic's Claude and Google's iterations tempers expectations. Key catalysts include potential GPT-5.5 rollout or o1 successor previews at upcoming developer events, though benchmark saturation risks and scaling hurdles could delay superhuman thresholds.
Experimental AI-generated summary referencing Polymarket data. This is not trading advice and plays no role in how this market resolves. · Updated


Beware of external links.
Beware of external links.
Frequently Asked Questions