Gemini 3.1 Pro, Google's latest flagship model released in February 2026, scored comparably to Gemini 3 Pro on the FrontierMath benchmark—around 38% on Tiers 1-3 and 17% on Tier 4—lagging leaders like OpenAI's GPT-5.5 (35% Tier 4) and Anthropic's Claude Opus 4.7 amid intensifying AI math reasoning competition. These incremental advances stem from Deep Think upgrades enhancing scientific problem-solving, but no major FrontierMath leaps have occurred in the past month. Traders focus on Google I/O in May 2026 for potential Gemini 4 previews or evals, as model releases and third-party verifications could drive scores toward higher thresholds by June 30, though timelines often slip.
Experimental AI-generated summary referencing Polymarket data. This is not trading advice and plays no role in how this market resolves. · Updated$130,707 Vol.
40%+
48%
45%+
50%
50%+
46%
60%+
13%
$130,707 Vol.
40%+
48%
45%+
50%
50%+
46%
60%+
13%
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Market Opened: Feb 6, 2026, 6:03 PM ET
Resolver
0x65070BE91...This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Resolver
0x65070BE91...Gemini 3.1 Pro, Google's latest flagship model released in February 2026, scored comparably to Gemini 3 Pro on the FrontierMath benchmark—around 38% on Tiers 1-3 and 17% on Tier 4—lagging leaders like OpenAI's GPT-5.5 (35% Tier 4) and Anthropic's Claude Opus 4.7 amid intensifying AI math reasoning competition. These incremental advances stem from Deep Think upgrades enhancing scientific problem-solving, but no major FrontierMath leaps have occurred in the past month. Traders focus on Google I/O in May 2026 for potential Gemini 4 previews or evals, as model releases and third-party verifications could drive scores toward higher thresholds by June 30, though timelines often slip.
Experimental AI-generated summary referencing Polymarket data. This is not trading advice and plays no role in how this market resolves. · Updated


Beware of external links.
Beware of external links.
Frequently Asked Questions