Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More
Nvidia is betting on three types of computers for its autonomous mobility vision, and it has created a platform to make it a reality with the Cosmos World Foundation Models.
Jensen Huang, CEO of Nvidia, pointed this out in an opening keynote speech at CES 2025, the big tech trade show in Las Vegas this week.
Transportation leads the way
So far, transportation industry leaders are among first to adopt the Cosmos platform. You may have heard about the Three Body Problem. But this is the three computer solution.
“Instead of a Three Body Problem, we have a three computer solution,” Huang said.
Autonomous vehicle (AV) development is made possible by three distinct computers: Nvidia DGX systems for training the AI-based stack in the data center, Nvidia Omniverse running on Nvidia OVX systems for simulation and synthetic data generation, and the Nvidia AGX in-vehicle computer to process real-time sensor data for safety.
Together, these purpose-built, full-stack systems enable continuous development cycles, speeding improvements in performance and safety.
A good example is the the digital twin concept made possible by Omniverse. Engineers use this metaverse-like tech to create hyper-realistic simulations of factories. They perfect the design in the virtual space of the Omniverse. When it is close to perfect, they build the factory in real life, outfitted with sensors. Those sensors collect real world data that is fed back into the virtual model, improving it with actual data. Then the digital twin design is improved and a feedback cycle continues. Nvidia’s Rev Lebaredian has explained this to me numerous times.
At the CES trade show, Nvidia today announced a new part of the equation: Nvidia Cosmos, a platform comprising state-of-the-art generative world foundation models (WFMs), advanced tokenizers, guardrails and an accelerated video processing pipeline built to advance the development of physical AI systems such as AVs and robots.
“The AV data factory flywheel consists of fleet data collection, accurate 4D reconstruction and AI to generate scenes and traffic variations for training and closed-loop evaluation,” said Sanja Fidler, vice president of AI research at Nvidia, in a statement. “Using the Nvidia Omniverse platform, as well as Cosmos and supporting AI models, developers can generate synthetic driving scenarios to amplify training data by orders of magnitude.”
With Cosmos added to the three-computer solution, developers gain a data flywheel that can turn thousands of human-driven miles into billions of virtually driven miles — amplifying training data quality.
“Developing physical AI models has traditionally been resource-intensive and costly for developers, requiring acquisition of real-world datasets and filtering, curating and preparing data for training,” said Norm Marks, vice president of automotive at Nvidia, in a statement. “Cosmos accelerates this process with generative AI, enabling smarter, faster and more precise AI model development for autonomous vehicles and robotics.”
Transportation leaders are using Cosmos to build physical AI for AVs, including:
● Waabi, a company pioneering generative AI for the physical world, will use Cosmos for the search and curation of video data for AV software development and simulation.
● Wayve, which is developing AI foundation models for autonomous driving, is evaluating Cosmos as a tool to search for edge and corner case driving scenarios used for safety and validation.
● AV toolchain provider Foretellix will use Cosmos, alongside Nvidia Omniverse Sensor RTX APIs, to evaluate and generate high-fidelity testing scenarios and training data at scale.
In addition, ridesharing giant Uber is partnering with Nvidia to accelerate autonomous mobility. Rich driving datasets from Uber, combined with the features of the Cosmos platform and Nvidia DGX Cloud, will help AV partners build stronger AI models even more efficiently.
Availability
Cosmos WFMs are now available under an open model license on Hugging Face and the Nvidia NGC catalog. Cosmos models will soon be available as fully optimized Nvidia NIM microservices.