Gemini 3 Pro METR 50% time horizon
19
1káč€14k
resolved Feb 3
100%96%
3.5h - 4h
0.1%
<2h
0.1%
2h - 2.5h
0.1%
2.5h - 3h
0.1%
3h - 3.5h
3%
4h - 4.5h
0.1%
4.5h - 5h
0.1%
5h - 5.5h
0.1%
5.5h - 6h
0.1%Other

This market will resolve to the first 50% time horizon, as reported by METR, of Gemini 3 Pro. If Gemini 3 Pro Preview is evaluated first, the market resolves to Gemini 3 Pro Preview's 50% time horizon. If Gemini 3 Pro GA gets evaluated first, this evaluation determines the market's resolution.

50% time horizon is a measure of AI autonomy based on the length of tasks that AI can do: roughly, it is the time that humans take to complete tasks that an AI system can successfully do 50% of the time. See METR's "Measuring AI Ability to Complete Long Tasks" for the technical definition. Claude 3.7 Sonnet, released in February 2025, was the leading model with a 50% horizon of 59 minutes.

Left bounds inclusive, right bounds exclusive.

See also:

/jim/gpt-52-metr

/Bayesian/gpt-52-pro-metr-time-horizon

/Bayesian/gemini-3s-50-time-horizon-per-metr (Pro Preview version of this market)

/Bayesian/gemini-3-pro-metr-50-time-horizon (this market)

/Bayesian/claude-sonnet-46s-metr-50-time-hori

/Bayesian/claude-sonnet-5-metr-50-time-horizo

/Bayesian/claude-opus-5-metr-50-time-horizon

/Bayesian/grok-420s-metr-50-time-horizon

/Bayesian/grok-5s-50-time-horizon-per-metr

/Bayesian/r2s-50-time-horizon-per-metr

/Bayesian/kimi-k3-thinkings-metr-50-time-hori

Get
áč€1,000
to start trading!

🏅 Top traders

#NameTotal profit
1áč€3,382
2áč€1,589
3áč€126
4áč€71
5áč€47
Ordenar por:

Hi, everyone. I am a bit confused by the resolution of the question. I am not able to locate Gemini 3 in METR. Am I not looking in place?

@MaxLennartson

https://metr.org/assets/benchmark_results_1_1.yaml

gemini_3_pro: benchmark_name: METR-Horizon-v1.1 metrics: average_score: estimate: 0.709822 is_sota: true p50_horizon_length: ci_high: 444.530184 ci_low: 134.565523 estimate: 236.654674 p80_horizon_length: ci_high: 78.496574 ci_low: 21.399446 estimate: 43.435618 usage: usd: 0.0 working_time: 52126.13216666666 release_date: 2025-11-18

@bens Thanks for the information. I don’t see Gemini 3 on either the old or new METR graph.

@MaxLennartson it’s so close to gpt5.1’s time horizon that it doesnt show up in the graph bc it’s covered completely

@Bayesian Okay. Thanks for the clarification.

comprou áč€50 YES

Not sure why 4-4.5 was bought up to 99%. I think there's a decent chance the exact value lies just under 4 hours if you look at the plot closely. METR's tweet said "around 4 hours" lol

vendeu áč€1,491 YES

@bens lmao

comprou áč€3,000 YES

the exact time is available

it's 236.654674 minutes or 3.94something hours

i'm sorry braydon their announcement was an approximate for some reason

in their defense they said "Gemini 3 Pro has a 50%-time-horizon of around 4 hrs"

@Bayesian oh wow hahaha I was looking for it but hadn't been able to find the raw data link, I was too slow

@bens METR reported “about 4 hours” which I interpreted as the same as reporting “4 hours”.

No other market in this category have you had to dig into the raw data of a yaml file.

@BraydonDymm I mean, you're welcome to wait until they update the little dot on their website, but it's a couple minutes short of 4 hours

yeah no other announcement has been this close to a full number for them to choose to round up in the announcement like this

@bens right, I understand that’s what the raw data shows. I just had a different interpretation of what it means for METR to report the time.

comprouáč€25 YES

@prismatic wanna bet more? limit order up

@Bayesian absolutely not, im a horrible judge of METR times

aberto a áč€50,000 NO at 30% order

@prismatic 😭

© Predita Markets, Inc.‱Termos de Uso‱Privacidade