Who will ever rank #1 in LMSYS Chatbot Arena Leaderboard in 2025?
87
1.4kṀ190k
resolved Jan 3
Resolvido
YES
OpenAI
Resolvido
YES
xAI
Resolvido
YES
Anthropic
Resolvido
YES
Alphabet / Google
Resolvido
YES
OpenAI (after January)
Resolvido
YES
OpenAI (after February)
Resolvido
YES
Google (after February)
Resolvido
YES
xAI (After initial Grok 3 entry)
Resolvido
YES
xAI (April or later)
Resolvido
YES
Google (after 2.5 release)
Resolvido
YES
Anthropic (after Opus 4.1)
Resolvido
NO
Meta
Resolvido
NO
Safe Superintelligence (SSI)
Resolvido
NO
Apple (?)
Resolvido
NO
Microsoft
Resolvido
NO
Mistral
Resolvido
NO
Reka AI
Resolvido
NO
DeepSeek
Resolvido
NO
OpenAI (May or later)
Resolvido
NO
Moonshot

Chatbot Arena Leaderboard or iterations/new versions of it as creator sees fit. If the project is altogether abandoned with no equivalent, the answers that have not yet resolved YES will resolve NO at EOY unless some replacement appears.

If multiple shared #1 ranks, as has been the case in the past, any of those positions count.


If there are no longer ranked numbers, only top spot/ELO would count. If same ELO, but second spot for some trivial/irrelevant reason, still resolves YES.

Answer resolves YES immediately when an update has been confirmed, no matter what duration it stays in top.

Any AI that hasn't been #1 before during 2025, resolves to NO at the end of the year.

I will not trade in this market, resolution might get messy through mergers/acquisitions, changes to the leaderboard, etc.

Let me know if you want to see improvements to criteria.

See same market but for top 10 below:

/HenriThunberg/who-will-ever-rank-top-10-in-lmsys

  • Update 2025-02-06 (PST) (AI summary of creator comment): Clarification Added:

    • The resolution criteria now explicitly include scenarios where OpenAI retakes the lead after January 2025.

    • Such an event will count toward a confirmed update and will trigger an immediate resolution under the market rules.

  • Update 2025-07-10 (PST) (AI summary of creator comment): - The "Default" view of the Chatbot Arena Leaderboard will be used for resolution.

    • The "Remove style control" view will not count.

Get
Ṁ1,000
to start trading!

🏅 Top traders

#NameTotal profit
1Ṁ36,547
2Ṁ2,274
3Ṁ1,170
4Ṁ553
5Ṁ157
Ordenar por:

@creator please resolve this

preenchido aṀ500 YES at 30% order

@figo lol oops
@HenriThunberg xai resolves yes bc of grok 4.1

Ooops, "After 4.1", Unresolved. My bad, sorry for pings!

comprou Ṁ10 YES

Does this count?

no

@Bayesian If multiple shared #1 ranks, as has been the case in the past, any of those positions count.

@HenriThunberg Oops sorry!

comprou Ṁ10 YES

oh no that means i got owned by @PaulHabermas on deepseek. rip

@PaulHabermas ooops apologies, I forgot myself that I wasn't supposed to trade in this market. Got tempted by your juicy limit order and acted too quickly haha.

Do you feel comfortable with me having a large position against you (hoping that ruling will be easy, and barring that counting on me to be fair) or would you want to set up a limit order for me to sell it back?

Haha should be fine. Could you clarify if style control #1 counts/ no style control #1 counts

@PaulHabermas I think "Default" should in this case be the well... default. I.e. "Remove style control" option is not what counts.

Thanks for your understanding, good luck!

Anthropic currently at 38%, that neither Sonnet 3.7 nor 4.0 will eventually take #1.

aberto a Ṁ800 YES at 99.0% order

@HenriThunberg resolves true already lol, with ChatGPT-4o latest now rank 1

aberto a Ṁ700 YES at 99.0% order

@HenriThunberg this was a yes on Feb 14 https://x.com/lmarena_ai/status/1890477460380348916

I saw some people are betting it down so might be worth resolving it?

@TotalVerb thanks, I forgot my own rules that shared #1 (despite not highest ELO) also qualifies.

preenchido a Ṁ5 NO at 35% order

Is it possible to to add something like "OpenAI retakes the lead after Jan 2025"?

@Ernie I think that's simply another question :)

@Ernie Actually, changed my mind on this. Added!

OpenAI and Google can resolve @HenriThunberg

reposted

See same market but for top 10 below:

/HenriThunberg/who-will-ever-rank-top-10-in-lmsys

@HenriThunberg Would have to be listed as Microsoft under the "Organization" column of the leaderboard. Same for Apple.

© Predita Markets, Inc.Termos de UsoPrivacidade