Will Anthropic’s next Sonnet model exceed 65% on terminal bench? | Axel

Em alta Urgente Novo

Política Esportes Cripto Finanças Geopolítica Resultados Tecnologia Cultura Mundo Economia Eleições

Will Anthropic’s next Sonnet model exceed 65% on terminal bench?

8

100Ṁ4873

Dec 31

10%

chance

1H

6H

1D

1W

1M

ALL

Will be looking toward https://www.tbench.ai/ for evals, using the terminus 2 scaffolding.

Only counts if the number in the model’s name increments, so a new Claude Sonnet 4.5 checkpoint does not count.

If a new Sonnet model is not released by 2027 this will resolve NA

Get

1,000

to start trading!

Ordenar por:

@JaundicedBaboon I don't think anybody is going to test Sonnet 4.6 on Termial Bench, Anthropic had Sonnet 4.6's Terminal-Bench 2.0 score at 59.1%, but nobody has submitted its results to the leaderboard yet. I don't know if you think Anthropic's results are good enough of if you want to continue waiting for somebody to submit results to the leaderboard.

Pessoas também estão operando

Which month will Anthropic release Sonnet 5?

Will Claude Sonnet 5 exceed 85% on SWE-bench verified?

Will Anthropic release an open-weights model in 2026?

Will Anthropic release a first-party image generation model in 2026?

Will Anthropic ever drop out of the capabilities race despite ability to continue?

Will a frontier model score above 90% on the APEX-SWE benchmark before 2028?

Perguntas relacionadas

Which month will Anthropic release Sonnet 5?

Will Claude Sonnet 5 exceed 85% on SWE-bench verified?

Will Anthropic release an open-weights model in 2026?

Will Anthropic release a first-party image generation model in 2026?

Will Anthropic ever drop out of the capabilities race despite ability to continue?

Will a frontier model score above 90% on the APEX-SWE benchmark before 2028?

© Predita Markets, Inc.•Termos de Uso•Privacidade