Will a multi-agent system have its time horizon evaluated by METR before August 2026?
10
1ká¹€1212Jul 31
37%
chance
1H
6H
1D
1W
1M
ALL
METR's time horizon evaluation: https://metr.org/time-horizons/
Some existing multi-agent systems: GPT-5.2 Pro, Grok 4 Heavy, Gemini 3 Deep Think.
This market doesn't count "regular" models being able to spawn subagents. For example, if the reported evaluated model is Claude Opus 4.6, but the evaluation was made within Claude Code where Claude Opus 4.6 could spawn some Claude Sonnet 4.6 subagents, this does not count for the purpose of this market.
Esta pergunta é gerenciada e resolvida pela Predita.
Get
1,000 to start trading!
Ordenar por:
GPT-5.2-Pro is actually available via API right now which I think should simplify the evaluation process quite a lot
Pessoas também estão operando
Perguntas relacionadas
Best AI time horizon by February 2026, per METR?
Will the METR AI coding uplift market be resolved based on the Feb 2026 updated estimate?
19% chance
Best AI time horizon by August 2026, per METR?
Will the METR 50% Time Horizon be "ambiguous" at the end of 2026?
62% chance
Best METR 50% Time Horizon in 2026
Will METR retire the 50% Time Horizon by EOY 2026
40% chance
R2 / V4-Thinking METR 50% time horizon
Polymarket has some METR time horizon market before August 2026?
57% chance
What will be the METR time horizon doubling time in 2026?
What will the frontier METR time horizon be on January 1, 2027?