NVIDIA Announces New Open AI Model 'NVIDIA DRIVE Alpamayo-R1' and Tools for Autonomous Driving Research



NVIDIA announced the NVIDIA DRIVE Alpamayo-R1 , an open AI model and tool suite for autonomous driving and robotics, at the

NeurIPS AI conference held in San Diego, USA, on December 1, 2025. The focus of this announcement was on strengthening the fundamental technology of 'physical AI,' which recognizes and interacts with the real, physical world, not just the digital world.

NVIDIA Advances Open Model Development for Digital and Physical AI | NVIDIA Blog
https://blogs.nvidia.com/blog/neurips-open-source-digital-physical-ai/

Alpamayo-R1: Bridging Reasoning and Action Prediction for Generalizable Autonomous Driving in the Long Tail | Research
https://research.nvidia.com/publication/2025-10_alpamayo-r1

Nvidia announces new open AI models and tools for autonomous driving research | TechCrunch
https://techcrunch.com/2025/12/01/nvidia-announces-new-open-ai-models-and-tools-for-autonomous-driving-research/

The NVIDIA DRIVE Alpamayo-R1, which has been released for autonomous driving research, is a new AI model based on the NVIDIA Cosmos-Reason1 inference vision language model for physical AI and robotics, and is positioned as the world's first inference vision language action model (VLA).

While conventional autonomous driving AI can handle individual tasks such as lane changes and obstacle avoidance, it struggles with complex situational judgment. However, NVIDIA DRIVE Alpamayo-R1 combines and processes camera images and linguistic information, allowing it to 'think' (infer) like a human and then decide what to do.

For example, in the video below, when a car sees a rolling ball, it can not only avoid it as an obstacle, but also understand the context and make decisions such as, 'Let's slow down because a child might be chasing us.' This is expected to bring us closer to the realization of 'Level 4' autonomous driving, where the system can perform all operations under specific conditions and locations.


At the time of writing, the NVIDIA DRIVE Alpamayo-R1 dataset is available on Hugging Face .

Additionally, to make it easier for developers to use these advanced AI models in their own projects, Cosmos has also provided a guide and toolset called the 'Cosmos Cookbook.' This includes ' LidarGen ,' which generates the LiDAR data required for autonomous driving simulations, ' Cosmos Policy ,' which creates robot behavior policies from video footage, and ' ProtoMotions3 ,' which enables the physical movement of humans and robots in digital space, providing broad support for the development of robots and autonomous vehicles.



In the field of digital AI, NVIDIA Nemotron , the foundational model for building agent AI, has added new models specialized for voice processing and safety.

For example, ' MultiTalker Parakeet ' is an automatic speech recognition model that can accurately understand overlapping voices and fast-paced conversations, even when multiple people are speaking at the same time. The following video demonstrates 'NVIDIA MultiTalker ASR' using MultiTalker Parakeet. Even when multiple people are speaking at the same time, it recognizes what each person is saying and transcribes it in real time.

NVIDIA MultiTalker ASR Demo: Real-Time, Multi-Speaker Transcription Made Easy - YouTube


Sortformer also performs a process called diarization, which distinguishes who spoke and when from audio data in real time. Other open source technologies include Nemotron Content Safety Reasoning , which controls AI to prevent it from making inappropriate remarks, and NeMo Gym , which supports the creation of an environment for reinforcement learning (a method in which AI learns through trial and error). These models have received very high scores in an independent index that evaluates the openness and transparency of licenses, and are recognized for their contributions to the research community.

NVIDIA co-founder and CEO Jensen Huang has emphasized that the next big wave of AI is 'physical AI.' Nvidia's Chief Scientist, Bill Daly, also believes that robots will play a major role in society in the future, and has indicated his intention to develop the technology that will act as their 'brains.'

in AI,   Video,   Software,   Vehicle, Posted by log1i_yk