Last month,Uschi Karnat the $61.5 billion-valuated AI startup Anthropic set up a gaming livestream on Twitch. Gaming livestreams are nothing new on Twitch, but this one is a little different: Claude, Anthropic's AI model, is attempting to beat Pokémon Red.
We are now one month in,and the livestream is still going. However, Claude has not progressedall that much. And, at this rate, Anthropic's AI agent may possibly never be the very best, like no one ever was.
According to Anthropic, when it first launched the "Claude Plays Pokémon" project, previous versions of its AI agent Claude failed at some very basic tasks. For example, according to Anthropic, Claude 3.5 would try to run away from almost every battle in June 2024.
A few months and a few versions of Claude later, Anthropic said there was a stark change. In February 2025, Anthropic gave Claude 3.7 Sonnet a whirl at playing Pokémon.
"Within hours, Claude defeated Brock. Days later, it trounced Misty," Anthropic said. "Progress that older models had little hope of achieving."
Anthropic said that Claude 3.7 Sonnet could plan ahead, remember objectives, and learn from its mistakes, unlike previous versions of the AI agent. It also built a knowledge base, saw the screen, and simulated button presses.
However, the progress Claude 3.7 Sonnet originally made in the game seems to have stalled.
For example, livestream viewers watchedas Clause 3.7 took 78 hoursto get through Mt. Moon in the game. On Reddit, gamers estimatedthat it would typically take a child just a few hours to advance through the same stage.
SEE ALSO: Hands-on with the Claude AI app: It's pleasant to use, but jankyClaude can be seen going in circles, stumbling around the same paths, and often knocking into walls as it tries to get around the game.
The livestream is engaging, especially as a text box lays out Claude's "thinking" as the AI agent tries to figure out what moves to make next.
According to Anthropic engineers in an interview with Ars Technica, Claude has an easier time with aspects of the game which involve text, such as Pokémon battles. However, it struggles with the more visual aspects of the game, such as moving around from town to town on the map.
Claude 3.7 Sonnet has gone much further in the game than previous Claude models, so there's been progress. However, for those warning that AI will soon be able to take over the world, we're nowhere close to that being a reality yet. Claude still has 151 Pokémon to catch.
Topics Artificial Intelligence Gaming Pokemon Twitch Streaming
Creative foodie mum makes kid's lunch into edible famous charactersThe iPhone 7 and 7 Plus are already sold out online — here’s how to buy them IRLVery professional business dog even has his own ID cardFacebook co9 things Apple didn't tell you about iPhone 7, Apple Watch 2 and AirPodsNest lets users swipe through days of footage with new Sightline appThis striking photo series celebrates the beauty of body diversityThe iPhone 7 and 7 Plus are already sold out online — here’s how to buy them IRLFirst a Siri joke, now a 9/11 conspiracy? Facebook Trending is having a really bad week.Forget cities. Volvo is testing autonomous trucks ... in a mine?Kendrick Lamar really doesn't want Lil Wayne to retire eitherTwitter updates direct messages with read receipts, dreaded three dots and link previewsHigh School pays beautiful tribute to cheerleader diagnosed with leukemiaPlease enjoy this footage of Kylie and Kendall Jenner stuck in an elevatorThe EpiPen company's latest critic is the guy who makes BotoxNASA is run by a bunch of 'Star Trek' nerds, and this photo proves itWells Fargo fined $185 million over phony accountsKendrick Lamar really doesn't want Lil Wayne to retire either13 awesome record holders to celebrate 'Guinness World Records 2017'FBI arrests two members of hacker group Crackas With Attitude Leavening Agent—Or Ticking Time Staff Picks: Helen Garner, Tim Parks, Friedel Dzubas On Elvis and Teddy Bears Nineteenth Listen: An Archival Interview with Tony Kushner Amazon Echo Show 8 2023: 3 cool new features Manet to Monet: Don’t Let Renoir Paint Trollope’s “Doctor Thorne” Adapted By “Downton” Creator Too Many Books! We‘re in an Era of Overproduction 'Twilight' fans love the Edward Wordle today: Here's the answer and hints for September 20 iPhone 15's battery health feature will keep it alive longer Blue Apron helps you overcome kitchen fatigue while saving you time and effort Best GoPro deal: Save $50 on the HERO11 at Amazon How Do You Define “Poetry”? The catchiest earworms of 2021 that you just can't get out of your head iPhone 15 FineWoven cases on sale: Save 5% at Amazon The cringiest things tech executives said in 2021, from Mark Zuckerberg to Elon Musk Listen: An Archival Interview with Reynolds Price My Exes’ Exes: A Note of Regret
2.0904s , 8229.4453125 kb
Copyright © 2025 Powered by 【Uschi Karnat】,Exquisite Information Network