Rank | Score | Rating | Date | Event | |
---|---|---|---|---|---|
|
227
|
13.07 | · | March 4, 2025 |
LLMs - You Can't Please Them All [statistical analysis, text generation, custom metric] |
|
241
|
1145 | · | Feb. 11, 2025 |
FIDE & Google Efficient Chess AI Challenge [board games, reinforcement learning, simulations, custom metric] |
|
649
|
0.61 | · | Feb. 3, 2025 |
WSDM Cup - Multilingual Chatbot Arena [languages, text conversation, accuracy score] |
|
348
|
0.43 | · | Dec. 12, 2024 |
Eedi - Mining Misconceptions in Mathematics [education, nlp, primary and secondary schools, map@{k}] |
|
121
|
0.43 | · | Dec. 2, 2024 |
UM - Game-Playing Strength of MCTS Variants [games, board games, artificial intelligence, regression, mean squared error] |