Google DeepMind trains a video game-playing AI to be your co-op companion

Devin Coldewey

Updated 13 March 2024 at 1:40 pm·4-min read

AI models that play games go back decades, but they generally specialize in one game and always play to win. Google DeepMind researchers have a different goal with their latest creation: a model that learned to play multiple 3D games like a human, but also does its best to understand and act on your verbal instructions.

There are of course "AI" or computer characters that can do this kind of thing, but they're more like features of a game: NPCs that you can use formal in-game commands to indirectly control.

DeepMind's SIMA (scalable instructable multiworld agent) doesn't have any kind of access to the game's internal code or rules; instead, it was trained on many, many hours of video showing gameplay by humans. From this data — and the annotations provided by data labelers — the model learns to associate certain visual representations of actions, objects and interactions. They also recorded videos of players instructing one another to do things in game.

For example, it might learn from how the pixels move in a certain pattern on screen that this is an action called "moving forward," or when the character approaches a door-like object and uses the doorknob-looking object, that's "opening" a "door." Simple things like that, tasks or events that take a few seconds but are more than just pressing a key or identifying something.

The training videos were taken in multiple games, from Valheim to Goat Simulator 3, the developers of which were involved with and consenting to this use of their software. One of the main goals, the researchers said in a call with press, was to see whether training an AI to play one set of games makes it capable of playing others it hasn't seen, a process called generalization.

The answer is yes, with caveats. AI agents trained on multiple games performed better on games they hadn't been exposed to. But of course many games involve specific and unique mechanics or terms that will stymie the best-prepared AI. But there's nothing stopping the model from learning those except a lack of training data.

This is partly because, although there is lots of in-game lingo, there really are only so many "verbs" players have that really affect the game world. Whether you're assembling a lean-to, pitching a tent or summoning a magical shelter, you're really "building a house," right? So this map of several dozen primitives the agent currently recognizes is really interesting to peruse:

A map of several dozen actions SIMA recognizes and can perform or combine. Image Credits: Google DeepMind

The researchers' ambition, on top of advancing the ball in agent-based AI fundamentally, is to create a more natural game-playing companion than the stiff, hard-coded ones we have today.

"Rather than having a superhuman agent you play against, you can have SIMA players beside you that are cooperative, that you can give instructions to," said Tim Harley, one of the project's leads.

Since when they're playing, all they see is the pixels of the game screen, they have to learn how to do stuff in much the same way we do — but it also means they can adapt and produce emergent behaviors as well.

You may be curious how this stacks up against a common method of making agent-type AIs, the simulator approach, in which a mostly unsupervised model experiments wildly in a 3D simulated world running far faster than real time, allowing it to learn the rules intuitively and design behaviors around them without nearly as much annotation work.

"Traditional simulator-based agent training uses reinforcement learning for training, which requires the game or environment to provide a 'reward' signal for the agent to learn from -- for example win/loss in the case of Go or Starcraft, or 'score' for Atari," Harley told TechCrunch, and noted that this approach was used for those games and produced phenomenal results.

DeepMind’s Agent57 AI agent can best human players across a suite of 57 Atari games

"In the games that we use, such as the commercial games from our partners," he continued, "We do not have access to such a reward signal. Moreover, we are interested in agents that can do a wide variety of tasks described in open-ended text - it’s not feasible for each game to evaluate a 'reward' signal for each possible goal. Instead, we train agents using imitation learning from human behavior, given goals in text."

In other words, having a strict reward structure can limit the agent in what it pursues, since if it is guided by score it will never attempt anything that does not maximize that value. But if it values something more abstract, like how close its action is to one it has observed working before, it can be trained to "want" to do almost anything as long as the training data represents it somehow.

Other companies are looking into this kind of open-ended collaboration and creation as well; conversations with NPCs are being looked at pretty hard as opportunities to put an LLM-type chatbot to work, for instance. And simple improvised actions or interactions are also being simulated and tracked by AI in some really interesting research into agents.

Researchers populated a tiny virtual town with AI (and it was very wholesome)

Of course there are also the experiments into infinite games like MarioGPT, but that's another matter entirely.

PA Media: Movies
Blake Lively explains Lady Deadpool connection amid cameo speculation
The Gossip Girl actress outlined a string of coincidences.
Yahoo Movies UK
Is Joker 2 actually a musical?
Joker: Folie à Deux unites Joaquin Phoenix and Lady Gaga, but there are split reports on whether the movie is an all-out musical or not.
Yahoo Movies UK
The highest-grossing animated movies of all time
Inside Out 2 now stands alone at the top of the animation world's highest-grossing movies list. Here are more of the big-hitters.
Yahoo Movies UK
Everything we know about the Borderlands movie
Eli Roth gathers an ensemble cast to bring a video game classic to screens. Here’s everything we know about the Borderlands movie.
Yahoo Movies UK
What is Rob McElhenney’s Deadpool and Wolverine cameo?
Ryan Reynolds found room for his Welcome to Wrexham co-star Rob McElhenney in the new Marvel movie Deadpool and Wolverine.
PA Media: Movies
Mick Jagger and Charlize Theron go chic at Paris Olympics fashion event
The event was co-hosted by Theron along with Lupin star Omar Sy, US tennis star Serena Williams and Spanish singer Rosalia.
PA Media: Movies
Taylor Swift calls Ryan Reynolds’ Deadpool 3 ‘best work of his life’
She shared a photo of herself with Reynolds and his wife Blake Lively.
PA Media: Movies
James Bond star George Lazenby retires from acting after ‘a fun ride’
Lazenby had been a model in his early life, before 007 producer Albert Broccoli met him in a barber’s shop and later offered him an audition.
Yahoo Movies UK
What you need to remember from Marvel and Fox to understand Deadpool and Wolverine
The new Marvel film is a love letter to superhero movies of the past, so if you haven't seen them all or Disney+'s TV shows then you might struggle.
Yahoo Movies UK
Deadpool and Wolverine post-credit scenes explained
Marvel fans are no doubt wondering if the threequel continues the tradition of having a post-credit scene, or multiple, after the main event.
Yahoo Movies UK
Deadpool and Wolverine Easter eggs and cameos you may have missed
As Deadpool joins the MCU, Ryan Reynolds' fourth wall-breaking superhero has a much larger sandbox to play in.
Washington Post
How Skibidi Toilet became one of the most valuable franchises in Hollywood
LOS ANGELES - While big budget movies vie for the top spot at the box office this summer, billions of people are clamoring to watch a YouTube show about toilets with human heads that is fast becoming one of the most valuable franchises in Hollywood. Alexey Gerasimov, the creator behind “Skibidi Toilet,” is working with leading independent Hollywood entertainment studio, Invisible Narratives, to expand the YouTube Shorts series into myriad product lines and a potential television and movie franch
BuzzFeed
I Genuinely Cannot Watch "Longlegs" The Same Way After Learning These 15 Fascinating Facts
Maika Monroe didn't even meet Nicolas Cage until they filmed the scene where her character interrogates Longlegs. So, the first time she met Nicolas Cage, she met him as Longlegs.
Yahoo Movies UK
Matthew Macfadyen wasn’t miscast as Mr Darcy
The Pride and Prejudice actor feels he was miscast in Joe Wright's 2005 adaptation of Jane Austen's book, but the film works so well because of his performance.
Yahoo Movies UK
As Star Wars and Gladiator 2 are review bombed, why is it a thing?
The Acolyte, Gladiator II and House of the Dragon are just some of the recent examples of shows and films being review bombed, but why does it happen?
Yahoo Movies UK
What critics are saying about Marvel's Deadpool and Wolverine
The movie sees Ryan Reynolds and Hugh Jackman team up for Marvel for the first time, but the dream team hasn't convinced every critic of the film's value.
PA Media: Movies
Colin Farrell to run marathon to support friend with rare skin condition
Emma Fogarty is Ireland’s longest-surviving person battling the most severe type of the agonising skin condition epidermolysis bullosa.
PA Media: Movies
Joaquin Phoenix and Lady Gaga dance through chaos in Joker: Folie A Deux trailer
The film will see Arthur Fleck awaiting trial for his crimes.
Yahoo Movies UK
What you need to know about Deadpool & Wolverine
Hugh Jackman is back, and he’s ready to carve himself a new legacy as Wolverine in new MCU blockbuster Deadpool & Wolverine.
Yahoo Movies UK
How is Wolverine alive in Deadpool and Wolverine?
After 2017's Logan many viewers thought the X-Men icon was dead and buried, but not anymore as Hugh Jackman is reprising the role in Deadpool and Wolverine.

Latest stories