BIG-bench accuracy 75% #4: Will SOTA for a single model on BIG-bench pass 75% by the start of 2027? | Axel

Em alta Urgente Novo

Política Esportes Cripto Finanças Geopolítica Resultados Tecnologia Cultura Mundo Economia Eleições

BIG-bench accuracy 75% #4: Will SOTA for a single model on BIG-bench pass 75% by the start of 2027?

4

90Ṁ132

2027

86%

chance

1H

6H

1D

1W

1M

ALL

Benchmark
Only the sub benchmarks that are scored as an accuracy (i.e. from 0-100%) will be included (I think that's all of them but I'm not sure)
It must be a single model. If Model A achieves 75% on half and Model B achieves 75% on the other half that does not resolve the question YES
Ensemble models are fine but something like "run Model A on this benchmark and model B on this other benchmark" is not. If there is model selection is must be learned and it cannot include the current benchmark as an input.

Technical AI Timelines

Get

1,000

to start trading!

Pessoas também estão operando

BIG-bench accuracy 75% #5: Will SOTA for a single model on BIG-bench pass 75% by the start of 2028?

MMLU 99% #4: Will SOTA for MMLU (average) pass 99% by the start of 2027?

What will be true of the SOTA AI on the FrontierMath benchmark, before 2027?

MMLU 99% #5: Will SOTA for MMLU (average) pass 99% by the start of 2028?

What will be true of the SOTA AI on the FrontierMath benchmark, before 2028?

SOTA AI at EOY 2026 a reasoning model?

Perguntas relacionadas

BIG-bench accuracy 75% #5: Will SOTA for a single model on BIG-bench pass 75% by the start of 2028?

MMLU 99% #4: Will SOTA for MMLU (average) pass 99% by the start of 2027?

What will be true of the SOTA AI on the FrontierMath benchmark, before 2027?

MMLU 99% #5: Will SOTA for MMLU (average) pass 99% by the start of 2028?

What will be true of the SOTA AI on the FrontierMath benchmark, before 2028?

SOTA AI at EOY 2026 a reasoning model?

© Predita Markets, Inc.•Termos de Uso•Privacidade