AI Kimi K2 from China was disqualified for repeatedly making illegal moves - Photo: chess.com
With a resounding 4-0 victory, Gemini 2.5 Pro, o4-mini, Grok 4 and o3 advanced to the semi-finals of the AI chess tournament, defeating Claude 4 Opus, DeepSeek R1, Gemini 2.5 Flash and Kimi k2 respectively.
In the most notable quarter-final, Moonshot AI's Kimi K2 model (China) suffered a disastrous 0-4 defeat against o3, LLM of OpenAI - the developer of ChatGPT.
All four games ended quickly in less than eight moves, as Kimi K2 repeatedly made illegal moves.
For example, in the third game, despite correctly understanding the position when o3 checked behind, Kimi K2 still could not find a single legal move in all four attempts, and was forced to concede defeat. The percentage of moves that matched o3's Stockfish tool was up to 100%, showing that the difference in skill level was too great.
The other Chinese representative, DeepSeek, did not fare any better, losing 0-4 to OpenAI’s o4-mini. Although they did better than their compatriots by holding on in the first game, DeepSeek still made mistakes and was checkmated in the following games.
However, the most impressive character in the quarterfinals was Grok 4, a model from billionaire Elon Musk's xAI Company. Grok 4 easily defeated Google's Gemini 2.5 Flash with a score of 4-0. With the ability to punish every mistake of the opponent, Grok 4's move accuracy rate reached the highest level of the round, approximately 97.5%.
Commenting on the match, world No. 2 player Hikaru Nakamura was surprised: “Grok 4 is definitely the strongest LLM in this tournament. The level gap between it and the other models is not small.”
This comment was further reinforced when Mr. Musk quickly re-shared the image of Nakamura's comment on social network X, along with the confident comment: "This is just a side effect. xAI spends almost no time on chess."
Chess player Nakamura said Grok 4 was completely "out of his league" at the AI tournament - Photo: screenshot
On Google's side, although Gemini 2.5 Flash was eliminated, their remaining representative, Gemini 2.5 Pro, had a convincing 4-0 victory over Claude 4 Opus of Anthropic Company, affirming its position in the tournament.
The semi-finals will take place at 0:30 on August 7 (Vietnam time). The first semi-final is a high-stakes match between Grok 4 and Gemini 2.5 Pro. The other match is a dramatic "OpenAI derby" between o3 and o4-mini.
TUAN LONG
Source: https://tuoitre.vn/my-thang-tuyet-doi-tai-giai-co-vua-danh-cho-ai-20250806111234074.htm
Comment (0)