This is an excerpt of Sources by Alex Heath, a newsletter about AI and the tech industry, syndicated just for The Verge subscribers once a week.
Around the middle of last year, Pim de Witte started reaching out to a handful of prominent AI labs to see if they’d be interested in using data from Medal, his popular video game clipping platform, to train their agents.
Within weeks, it became clear that Medal’s data was more valuable to the labs than he expected. “We received multiple acquisition offers very quickly,” he told me. (He declined to name names, but it has been reported that OpenAI offered $500 million.) “Initially, we were quite interested in them,” he said of the offers, but that “was mostly a result of us not understanding what we were sitting on.”
He had read the Google DeepMind research paper showing that gaming data can be used to teach AI how to navigate a 3D environment. However, the interest from AI labs made him realize that his data from Medal, which receives roughly 2 billion video uploads per year from tens of thousands of video games, could be used to develop a unique foundational model for extending AI to the real world.
“It’s a pretty big bet.”
Today, Pim de Witte announced that Medal is spinning out a new AI lab called General Intuition that has raised a $133.7 million seed round. The money for the round is primarily from Vinod Khosla, founder of Khosla Ventures and one of the first investors in OpenAI. Other investors include General Catalyst and the Raine Group. Moritz Baier-Lentz, who oversees Lightspeed’s gaming investments, is also joining the startup part-time as a founding team member.
Khosla believes that General Intuition could be as impactful in the field of AI agents as OpenAI was on how people use large language models. It’s his firm’s largest seed check since it backed OpenAI in 2018. “It’s a pretty big bet,” he told me. “They have a unique dataset and a unique team.”
Unless you’re steeped in the AI world, you probably haven’t heard much about world models yet. It’s a branch of research that trains AI to have spatial understanding like a human. The idea is that a robot could, for example, predict when a glass of water will spill when knocked off a table and grab it before it falls. More practically, AI researchers are increasingly looking to world models as a way to train agents that can reliably generate and interact with a 3D space.
Among the prominent AI leaders, Google DeepMind CEO Demis Hassabis has been the most vocal advocate for world models and their importance in achieving AGI. Google recently demoed Genie 3, a model that generates a video game-like environment from scratch as you navigate through it. There are also a handful of startups working on similar models, including Fei-Fei Li’s World Labs, which this week released its own demo of a model that generates interactive video in real-time.
For General Intuition, the goal is to control any kind of device that can be mapped to a keyboard and mouse or has a game controller-like input scheme, according to de Witte. He expects the startup’s first model to be used by search and rescue drones but sees the potential for applications in other areas, including humanoid robots and self-driving cars.
Just as LLMs were initially trained on internet text data, de Witte believes that gaming environments will unlock AI’s ability to reliably predict the proper action to take in the physical world. “Games are basically the only verifiable domain for spatial-temporal reasoning,” he explained. “You can separate a good action from a bad action, which is why it’s so valuable.”
Still, it’s a risky bet. The correct technical path for developing world models is hotly debated in the AI industry, and as even Khosla noted to me, it’s unclear what data will ultimately prove the most valuable. Members of de Witte’s early research team have published notable research in the field, but the startup is still competing with better-funded giants like Google. “Somebody will win big in this market,” said Khosla, who told me thinks it’s an area where “multiple hundred-billion-dollar and potentially even trillion-dollar companies will be built.”
De Witte predicts that gaming companies will become prime takeover targets for AI labs as interest in world models heats up. His decision to start General Intuition was driven by the realization that, thanks to Medal’s data, he’s in the unique position to be more than a data supplier. However, he warned me that others might find it challenging to resist licensing checks and acquisition offers from the big AI labs.
“You are at an information disadvantage,” he said when I asked if he had advice for the gaming industry. “The better these models get, the less data they’re likely going to need.”