Will general purpose AI models beat average score of human players in Diplomacy by 2028?
12
1kṀ12922027
56%
chance
1H
6H
1D
1W
1M
ALL
General purpose (not trained for a specific task) language models demonstrated chess playing ability. They are also capable of deception and lie detection. Will language models or visual-language models* beat the average score of human players during a series of 40 games on webDiplomacy.net by 2028? (question modeled after Meta's Cicero result).
[EDIT: Please notice that while "CICERO achieved more than 2x the average score of its opponents" this question requires only achieving the above-average score]
*models or agents trained on different modalities (so e. g. models capable of controlling robotic arm like PaLM-E) would also qualify as long as they weren't trained specifically to play Diplomacy
Esta pergunta é gerenciada e resolvida pela Predita.
Get
1,000 to start trading!
Pessoas também estão operando
Perguntas relacionadas
Will any AI model score above 90% on the ARC-AGI-2 benchmark before April 2026?
59% chance
In 2028, will an AI be able to play randomly selected computer games at human level without getting to practice?
46% chance
Will AI beat top human players at Civ6 (without cheating) by EOY 2026?
21% chance
Chatbot Arena: How high will AI score in 2026?
Will AIs beat human experts in question-answering on the GPQA benchmark before January 1st, 2027?
95% chance
Will OpenAI model win first "inter-AI-model diplomacy" game where the game is EU4/5, Civ6/7, or AoE2 Regicide Rumble?
50% chance
Will an AI by OpenAI beat a super grandmaster playing chess by 2028?
57% chance
Will there be an expert-player-beating AI in civ 6 by 2026?
42% chance
Will an AI be capable of achieving a perfect score on the Putnam exam before 2028?
81% chance
Will an AI achieve a perfect score on the Miklós Schweitzer Competition before 2035?
81% chance