World Models: The $100T AI Bet Founders Must Make Now

World models are quietly transforming AI from text predictors into systems that understand and simulate the real world. Unlike large language models (LLMs) that predict the next word, world models build internal representations of how environments evolve over time and how actions change states. This leap from language to spatial intelligence promises to unlock AI capable of perceiving, reasoning, and interacting with complex 3D spaces.

Fei-Fei Li calls world models "the next frontier of AI," emphasizing spatial intelligence as essential for machines to see and act in the world. Yann LeCun echoes this urgency, arguing that learning accurate world models is key to human-level AI. His approach highlights the need for self-supervised learning architectures that predict world states in compressed representations rather than raw pixels, optimizing efficiency and generalization.

Leading efforts diverge into three camps. OpenAI’s Sora uses video generation transformers to simulate physical environments, showing emergent long-range coherence and object permanence, crucial for world simulation. Meta’s Joint Embedding Predictive Architecture (V-JEPA) models latent representations of videos and robotic interactions to reduce computational waste and improve reasoning. Fei-Fei Li’s World Labs blends multimodal inputs into spatially consistent, editable 3D worlds via Marble, targeting interactive virtual environment generation.

The commercial potential is looking to be enormous. Over $2 billion was invested across 15+ world model startups in 2024, with estimates valuing the full market north of $100 trillion if AI masters physical intelligence. Robotics leads near-term value: enabling robots to safely navigate unstructured environments requires world models to predict object interactions and plan multi-step tasks. NVIDIA’s Cosmos infrastructure accelerates physical AI training with synthetic photorealistic data, while companies like Skild AI have raised billions by building massive robotic interaction datasets.​

Autonomous vehicles also tap world models to simulate traffic and rare scenarios at scale, cutting down expensive on-road tests and improving safety. Companies like Wayve and Waabi leverage virtual worlds for pre-labeling and scenario generation, critical in achieving full autonomy. Meanwhile, the gaming and entertainment sector is the most mature commercial playground, with startups using world models to generate dynamic game worlds and personalized content that attract millions of users almost overnight

Specialized industrial applications—engineering simulations, healthcare, city planning—show clear revenue pathways with fewer competitors. PhysicsX’s quantum leap in simulation speed exemplifies how tailored world models can revolutionize verticals where traditional methods falter. Healthcare and urban planning stand to gain precision interventions and predictive modeling unparalleled by current AI.

The funding landscape reveals the importance of founder pedigree and scale. Fei-Fei Li’s World Labs hit unicorn status swiftly with $230 million raised, Luma AI secured $900 million Series C for supercluster-scale training, and Skild AI amassed over $1.5 billion focused on robotics. NVIDIA, while a supplier, remains a kingmaker, providing hardware, software, and foundational models as a platform layer—both opportunity and competition for startups.

Crucially, despite staggering investment, gaps abound—technical, commercial, and strategic. Training world models requires vast, complex multimodal datasets rarely available openly, creating defensive moats for data-rich startups. Models still struggle with physics accuracy, generalization to novel scenarios, and real-time performance needed for robotics or autonomous vehicles. Startups innovating around efficiency, transfer learning, sim-to-real gaps, and safety validation have outsized opportunities.

On the market front, vertical-specific solutions in healthcare, logistics, and defense are underserved, offering fertile ground for founders with domain expertise. Productizing world models requires bridging the gap from lab prototypes to robust, scalable deployments, including integration tooling and certification for safety-critical applications. Startups enabling high-fidelity synthetic data generation are becoming ecosystem enablers.​

Strategically, founders must navigate open research—like Meta’s V-JEPA—and proprietary plays exemplified by World Labs. Standardization and interoperability remain open questions critical for ecosystem growth. Handling rare edge cases and ensuring reliable sim-to-real transfer are gating factors for robotic and autonomous systems.

For investors, the thesis is clear but nuanced. Robotics world models, vertical AI for high-value industries, infrastructure and tooling layers, and gaming are high-conviction bets offering a blend of risk and clear pathways to market. Foundational model companies with massive compute and data moats present risky but lucrative opportunities, demanding large capital and specialized talent. Efficiency, differentiated data, and agile product-market fit matter more than raw scale alone.

The next 24 months will crystallize market winners as world models shift from research curiosity to mission-critical AI infrastructure. Founders displaying relentless adaptability, technical depth, and deep domain insight will lead the charge. Investors who balance bets across foundation layers and vertical applications, while embracing geographic and stage diversity, stand to capture disproportionate value.

While the industry watches language models, the less flashy but more profound revolution is unfolding quietly in world models—systems that don’t just process language but build a mental map of reality itself. These systems will define the next era of AI, shaping how machines perceive, interact, and augment the physical world for decades.

That’s the state of play. The winners will be those who combine technical innovation with pragmatic business sense, and above all, a ruthlessly adaptive mindset to pivot rapidly as the frontier evolves.