Close Menu
NERDBOT
    Facebook X (Twitter) Instagram YouTube
    Subscribe
    NERDBOT
    • News
      • Reviews
    • Movies & TV
    • Comics
    • Gaming
    • Collectibles
    • Science & Tech
    • Culture
    • Nerd Voices
    • About Us
      • Join the Team at Nerdbot
    NERDBOT
    Home»Nerd Voices»NV Tech»A Look at Google Deep Mind’s Genie & Its Use of Large Language Models
    NV Tech

    A Look at Google Deep Mind’s Genie & Its Use of Large Language Models

    Nerd VoicesBy Nerd VoicesMay 23, 20244 Mins Read
    Share
    Facebook Twitter Pinterest Reddit WhatsApp Email

    Rather than adding inductive biases, we focus on scale. We use a dataset of >200k hours of videos from 2D platformers and train an 11B world model. In an unsupervised way, Genie learns diverse latent actions that control characters in a consistent manner. pic.twitter.com/71a3iuAGZA

    — Tim Rocktäschel (@_rockt) February 26, 2024

    If you think text-to-video is the furthest we could get with today’s artificial intelligence (AI) technology, think again. GoogleDeepMind recently unveiled Genie – a generative interactive environment trained on Internet videos. In short, it’s an early prototype for a full-blown text-to-video-games model.

    Genie takes any text, image, photograph, or sketch prompt and generates a controllable virtual world out of it. But don’t expect the output to have triple A graphics just yet. The model is only capable of creating 2D platformers for now – just like the classic Super Mario Bros games we used to play. 

    How is this all possible? DeepMind shares that the model has 11B parameters, and was trained on over 200,000 hours of videos from 2D platformer games. Several models work behind the scenes – first, a tokenizer is used to compress each frame of the videos into discrete tokens, or units of data that serve as a basis for encoding and decoding. From there, a latent action model encodes the transitions between two frames as one of eight latent actions. A third dynamics model is then used to predict future frames. 

    This training allowed Genie to “learn diverse latent actions that control characters in a consistent manner.” Currently, the games generated by Genie only run at only 1FPS, but DeepMind’s Tim Rocktäschel clarifies that the model is not confined to 2D platform games. They trained another Genie on robotics data, and it was able to create controllable simulator games.

    Genie is the latest example of a “world model” in AI, where predictions guide the model’s actions. Developed following the concepts of unsupervised learning, this generative tool teaches itself, to the extent it can logically create a virtual environment in which to operate. 


    It’s safe to say this marks a significant milestone in AI gaming. The transition from raw data to a playable game isn’t instantaneous, but the prospect of generating full-fledged custom games from plain ASCII text is nothing short of groundbreaking. 

    It paves the way for an era where we can automate the design of custom games, transforming game narratives, characters, and environments in mere seconds. It’s not hard to envision a future filled with AI-driven dynamic gaming where characters and scenarios evolve in real time based on player choices and actions.

    In the realm of large language models (LLMs), Genie is a trailblazer as well. Taken as a whole, the model’s innovative use of LLMs demonstrates their potential to unravel intricate patterns and datasets, combine them and create something new. The large language models post by MongoDB explains how LLMs function similarly, working to predict the next word in a sentence based on the context provided. Genie takes this a notch higher and implements it in gaming – it predicts not just words but also actions and transitions in its game environment.

    Genie’s release comes on the heels of OpenAI’s Sora, a text-to-video model that translates text into “realistic and imaginative scenes.” Sora combines a diffusion model and a transformer architecture, where the diffusion model generates video pixels and the transformer predicts future frames. While the scenes look uncannily real, The DeepMind team pointed out that such outputs need actions, hence the birth of Genie.

    The implications of a text-to-video-games model are huge. Whether this technology or any AI progress is related to the massive layoffs in the game development industry in recent years remains a speculative question. An undoubtedly contentious issue, and one we cannot ignore as we edge closer to an era where a substantial part of the game development may happen by machine inference.

    As gamers, we can’t help but be intrigued by the possibility of truly intelligent, dynamic games crafted by the likes of Genie. Yet, it’s crucial that we view these advances with a healthy dose of caution. Not just thinking about the implications for game development, but largely about the broader impacts of AI on our society. Since there’s no public release date for Genie yet, we’ll just have to wait and see how this will all progress.

    Do You Want to Know More?

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Email
    Previous ArticleDriving Growth: Navigating Asset Finance Solutions
    Next Article Exploring the Latest Innovations in Mobility Scooters for Enhanced Comfort
    Nerd Voices

    Here at Nerdbot we are always looking for fresh takes on anything people love with a focus on television, comics, movies, animation, video games and more. If you feel passionate about something or love to be the person to get the word of nerd out to the public, we want to hear from you!

    Related Posts

    Custom Web Development Services: How to Choose the Right Web Development Company for Long-Term Growth

    January 30, 2026
    Acting as the perfect personal assistant, Molt Bot leverages the advanced capabilities of Moltbot AI to proactively manage your schedule, automate web tasks, and provide secure, local intelligence through a seamless chat interface.

    Moltbot: The Proactive AI Chat Assistant That Actually Works for You

    January 30, 2026
    The Future of Smart Glasses Is Closer Than You Think

    The Future of Smart Glasses Is Closer Than You Think

    January 30, 2026
    Creative Freedom + Responsible Use: How to Use VideoAny for Commercial Work Without Surprises

    Creative Freedom + Responsible Use: How to Use VideoAny for Commercial Work Without Surprises

    January 29, 2026
    Is Grok Better Than ChatGPT? A Brutal 2026 Reality Check

    Is Grok Better Than ChatGPT? A Brutal 2026 Reality Check

    January 29, 2026
    Why AI Integration Services Are Powering the Next Wave of Tech Innovation

    Why AI Integration Services Are Powering the Next Wave of Tech Innovation

    January 29, 2026
    • Latest
    • News
    • Movies
    • TV
    • Reviews

    Macaulay Culkin, Others Pay Tribute to Late Catherine O’Hara

    January 30, 2026
    How Amazon Market Research Tools Help You Find Winning Products?

    Why Choose 10XCommerce for Your Amazon SEO Services?

    January 30, 2026

    What Sets Clair Obscur Apart from Other Video Games?

    January 30, 2026
    How to Choose a Real Estate SEO Agency

    How to Choose a Real Estate SEO Agency

    January 30, 2026

    Macaulay Culkin, Others Pay Tribute to Late Catherine O’Hara

    January 30, 2026
    "Schitt's Creek," 2015-2020

    Comedic Icon Catherine O’Hara has Passed Away

    January 30, 2026

    “The AI Doc: Or How I Became an Apocaloptimist” Timely, Urgent, Funny [Review]

    January 28, 2026

    “The Gallerist” Campy, Fun, Cartoonish Look at Art, Artists [Review]

    January 27, 2026

    First Look at Sam Mendes’ “Beatles” Biopic Cast as Fab Four

    January 30, 2026

    “I Play Rocky” Hits Theaters on “Rocky” 50th Anniversary

    January 28, 2026

    Henry Cavill Shares First Look at “Highlander” Reboot

    January 28, 2026

    “The AI Doc: Or How I Became an Apocaloptimist” Timely, Urgent, Funny [Review]

    January 28, 2026

    Mandy Patinkin to Play Odin for “God of War” TV Series

    January 29, 2026

    “Outlander” Final Season Gets Final Trailer

    January 29, 2026

    “Jury Duty Presents: Company Retreat” Gets 1st Teaser

    January 29, 2026

    Apple Lands Brandon Sanderson Properties; Mistborn Movie, Stormlight Series

    January 28, 2026

    “The AI Doc: Or How I Became an Apocaloptimist” Timely, Urgent, Funny [Review]

    January 28, 2026

    “The Gallerist” Campy, Fun, Cartoonish Look at Art, Artists [Review]

    January 27, 2026

    “The S**theads” Odd Couple Absurdist Road Trip from Hell [Review]

    January 25, 2026

    “I Want Your Sex” Sexy, Sleazy, Laugh Out Loud Funny [Review]

    January 25, 2026
    Check Out Our Latest
      • Product Reviews
      • Reviews
      • SDCC 2021
      • SDCC 2022
    Related Posts

    None found

    NERDBOT
    Facebook X (Twitter) Instagram YouTube
    Nerdbot is owned and operated by Nerds! If you have an idea for a story or a cool project send us a holler on [email protected]

    Type above and press Enter to search. Press Esc to cancel.