Anthropic has tested its new generative AI model, Claude 3.7 Sonnet, using the classic Game Boy game — Pokémon Red. In its blog, Anthropic reported that the model was equipped with basic memory, could receive input from screen pixels, and perform function calls to press buttons and navigate the screen, which allowed it to play Pokémon continuously.
One of the unique features of Claude 3.7 Sonnet is its ability for “enhanced reasoning.” This enables the model to solve complex tasks by applying more computational resources and spending more time. Compared to the previous version, Claude 3.0 Sonnet, which was unable to leave the house in Pallet Town, Claude 3.7 Sonnet successfully defeated three gym leaders in the game and earned their badges.
Although it is still unknown how many computational resources were required to achieve these results, the company noted that the model performed 35,000 actions to reach the last gym leader named Surge.