Everything about Physical AI

0. What Physical AI actually is

Physical AI is NVIDIA's umbrella term for AI that moves from the digital world into the physical world. Instead of chatbots generating text or code assistants writing software, Physical AI powers robots that walk, vehicles that drive themselves, drones that navigate contested airspace, surgical systems that operate with sub-millimeter precision, and factory floors that run autonomously.

Jensen Huang at GTC 2026 (March 16, 2026): "Physical AI has arrived. Every industrial company will become a robotics company."

The shift is fundamental. Software AI (LLMs, code assistants, image generators) operates in the digital domain: text in, text out. Physical AI operates in the real world: sensor data in (cameras, LIDAR, microphones, force sensors), motor commands out (wheels, joints, grippers, rotors). The AI must perceive, reason, plan, and act in real time, in environments it has never seen before, with consequences for failure that are physical: a robot drops a patient, a drone hits a building, an autonomous truck jackknifes.

This is why simulation, edge computing, and foundation models all converge in Physical AI. You cannot train a humanoid robot to walk by having it fall down 10,000 times in a real lab. You train it in simulation (Isaac Sim), using a physics engine (Newton), with synthetic data (Cosmos), then transfer the learned skills to real hardware (Jetson). The sim-to-real pipeline is the central engineering challenge.

NVIDIA's three-computer architecture: Cloud for training, Simulation for digital twin practice, Jetson for on-robot deployment. The same CUDA code runs end-to-end.

Source: NVIDIA Robotics platform.

1. Key terms (quick reference)

These terms appear throughout this document. Understanding them upfront makes everything that follows click. A full glossary is at the end.

Term	Plain English
Physical AI	AI that operates in the real world: robots, vehicles, drones, surgical systems. Not chatbots or code assistants.
CUDA	NVIDIA's programming platform. Nearly all AI code runs on it. The reason NVIDIA dominates AI hardware.
TOPS	Tera Operations Per Second. How fast an AI chip can think. More is faster. Jetson Orin Nano = 67 TOPS.
Edge AI	Running AI on local hardware (a robot, a camera) instead of cloud servers. Low latency, private, no API costs.
ROS2	Robot Operating System. The standard software framework every serious robot runs. Not an OS, a set of libraries.
VLA	Vision-Language-Action model. Sees a scene, understands instructions, moves a robot. The architecture behind GR00T.
Imitation Learning	Teaching a robot by showing it what to do. Move a leader arm, the follower watches and learns.
Reinforcement Learning	Teaching a robot by trial and error. It tries things, gets rewards or penalties, improves over thousands of attempts.
Sim-to-Real	Skills learned in simulation that transfer to a physical robot. The central challenge of Physical AI.
Isaac	NVIDIA's robotics platform family: Sim (virtual world), Lab (training gym), ROS (robot runtime), GR00T (robot brain).
LeRobot	Hugging Face's open-source robotics framework. The accessible entry point for learning robot AI.
Jetson	NVIDIA's edge AI computer family. The chip that runs inside robots, drones, and cameras.
GR00T	NVIDIA's foundation model for humanoid robots. The "GPT for robots." Open-source.
JEPA	Joint Embedding Predictive Architecture. LeCun's alternative to LLMs. Learns physics from video, not token prediction.
AMI Labs	LeCun's $1B startup building world models based on JEPA. The main competing thesis to NVIDIA's approach.
DOF	Degrees of Freedom. How many independent axes a robot can move. Human arm is about 7 DOF.
LIDAR	Laser-based sensor that creates 3D maps of surroundings. Used in self-driving cars and robot dogs.

2. The NVIDIA ecosystem

NVIDIA has built the dominant end-to-end platform for Physical AI. Understanding this stack is essential because nearly every serious robotics company builds on some part of it. GTC Taipei (June 1, 2026) significantly expanded the ecosystem with Cosmos 3 (open omnimodel), Isaac GR00T as a humanoid reference platform, Alpamayo 2 for Level 4 robotaxis, and RTX Spark consumer PCs that can run Physical AI models locally.

The ecosystem is organized into four layers: solutions (what problem are you solving), software (how do you build it), hardware (what runs it), and developer resources (where do you learn).

Software layer (where the depth is)

The Isaac platform is a family of tools, each handling a different stage of the robot development lifecycle.

Software	What it does	Stage
Isaac Sim	Photorealistic 3D robotics simulator. Digital twins. Synthetic data generation. Built on NVIDIA Omniverse.	Simulation
Isaac Lab	GPU-accelerated reinforcement learning inside Isaac Sim. Train thousands of robots simultaneously.	Training
Isaac ROS	CUDA-accelerated ROS2 stack. GPU-powered perception, navigation, and motor control on the robot.	Deployment
Isaac GR00T	Foundation models and reference platform for humanoid robots. Takes visual + language input, outputs motor commands. GTC Taipei 2026 launched it as an open reference platform for academic research.	Models
Cosmos	World foundation models. Cosmos 3 (GTC Taipei, June 2026) is the first open "omnimodel" unifying vision reasoning, world generation, and action simulation. Super (32B) and Nano (8B) sizes. Enables robots to predict physical outcomes before acting.	Models
Newton	Open-source GPU physics engine. Built with Google DeepMind and Disney Research. GA April 2026.	Physics
Metropolis	Vision AI platform. DeepStream SDK for multi-camera pipelines.	Vision
NemoClaw + OpenShell	Security and governance layer for AI agents. NemoClaw monitors reasoning and ensures actions align with safety guardrails. OpenShell (GTC Taipei, June 2026) provides a secure runtime for self-evolving autonomous agents with governance constraints built in.	Governance
OSMO	Cloud-native orchestration for large-scale robotics workflows. Fleet management.	Operations
JetPack SDK	Base OS + runtime for all Jetson hardware. Ubuntu + CUDA + cuDNN + TensorRT.	Runtime

Developer layer (where to learn)

Jetson AI Lab: Hands-on tutorials for GenAI on Jetson. LLMs, VLMs, VLAs, speech, applications. The #1 starting point.
Jetson Developer Portal: Central hub for documentation, downloads, forums, ecosystem partners.
NVIDIA Robotics Blog: Technical blog posts on robotics tools, SDKs, workflows.
GR00T GitHub: Open-source foundation model code (N1.7 as of May 2026).

3. The hardware ladder: Jetson family

The Jetson lineup is a performance ladder. Each tier is a different price/performance point. JetPack SDK is unified across all of them: code written on one runs on any other.

The Jetson ladder. Orin Nano Super is the recommended starting point; Thor and T4000 (Blackwell) target humanoid-grade foundation-model deployment.

Currently relevant (for new projects in 2026)

Board	AI Performance	Memory	Power	Price	Use case
Jetson Orin Nano Super	67 TOPS	8GB LPDDR5	7-15W	$249	Learning, prototyping, single-robot projects. Runs LLMs (8B at ~15 tok/s), multi-camera vision, robotics stacks. Recommended starting point.
Jetson Orin NX	Up to 100 TOPS	8-16GB	10-25W	$400-600	Mid-range. More headroom for complex multi-model inference.
Jetson AGX Orin	Up to 275 TOPS	32-64GB	15-60W	$1,000-2,000	High-end. Multiple large models simultaneously. Professional robotics.
Jetson Thor (AGX Thor)	2,070 FP4 TFLOPS	128GB	40-130W	$1,999+	Blackwell GPU. Humanoid robots running foundation models (GR00T). What Boston Dynamics, Agility, Figure use.
Jetson T4000	1,200 FP4 TFLOPS	64GB	70W	$1,999 (volume)	Blackwell architecture in a smaller module. Announced CES 2026. Upgrade path from Orin.

Industrial variants

IGX Platform: industrial-grade edge AI. Blackwell-based Thor systems with safety certifications for factories, medical devices, autonomous vehicles. Enterprise deployments.

Why CUDA matters here

NVIDIA's dominance stems from CUDA. Nearly every AI framework (PyTorch, TensorFlow, YOLO, Whisper) is written to run on NVIDIA GPUs via CUDA. A Jetson board runs the same CUDA code as a $30,000 data center GPU. Skills, models, and code transfer up and down the ladder without rewriting. No other edge AI platform has this ecosystem advantage.

Buy: Jetson Store | Orin Nano Super Dev Kit.

4. The software stack: Isaac platform

Isaac Sim (simulation)

The digital twin environment where robots learn before touching reality. Key capabilities:

Photorealistic rendering: synthetic camera images indistinguishable from real photos. Robots trained on synthetic data transfer to real environments.
Physics simulation: powered by PhysX and now Newton. Realistic gravity, friction, collisions, deformation.
Sensor simulation: simulate cameras, LIDAR, IMU, force sensors. The simulated robot "sees" what a real robot would see.
Domain randomization: automatically vary lighting, textures, object positions, and physics parameters so the robot generalizes to unseen environments.
Synthetic data generation: generate millions of labeled training images without manual annotation. The GR00T-Dreams blueprint generated 780,000 synthetic trajectories (9 months of human demonstration data) in just 11 hours.

Isaac Lab (training)

GPU-accelerated reinforcement learning at massive scale. Key capabilities:

Parallel training: run thousands of robot instances simultaneously on GPU clusters. What takes weeks in the real world takes minutes in simulation.
Multi-physics: rigid bodies, soft bodies, fluids, cloth. Complex manipulation tasks like folding laundry or handling deformable objects.
Isaac Lab 3.0: released in early access at GTC 2026. Runs on DGX-class infrastructure. Enhanced support for dexterous manipulation.
Isaac Lab-Arena: simplifies environment composition and accelerates complex task creation. Evaluates tasks in parallel.

Isaac ROS (robot runtime)

The software that runs ON the robot during operation. CUDA-accelerated ROS2.

Built on ROS2: the industry standard robotics framework. Millions of ROS developers can use NVIDIA-accelerated libraries directly.
GPU-accelerated perception: object detection, segmentation, depth estimation, all running on the Jetson GPU. 10-100x faster than CPU-only ROS2.
Navigation stack: autonomous navigation using camera and LIDAR data. Path planning, obstacle avoidance, SLAM.
Manipulation libraries: grasp planning, motion planning for robotic arms.

Newton (physics engine)

Open-source GPU physics engine. Built in collaboration with Google DeepMind and Disney Research.

Purpose: make simulation physics match real-world physics closely enough that sim-trained skills transfer to real robots.
Capabilities: accurate collision detection, realistic object contact, stable simulation of complex systems with both rigid and flexible parts.
GA: Newton 1.0 reached general availability in April 2026.
Disney connection: Disney Research uses Newton for its robotic character platform (the Olaf BDX droids use NVIDIA Jetson + Omniverse + Newton).

5. Foundation models: GR00T and Cosmos

Isaac GR00T

The most ambitious piece of the NVIDIA stack. Foundation models for humanoid robots.

The concept: just as GPT is a general-purpose language model that can be fine-tuned for specific text tasks, GR00T is a general-purpose robot model that can be fine-tuned for specific physical tasks. It takes visual input (what the robot sees) and language input (what it is told to do) and outputs motor commands.

Evolution timeline:

GR00T N1 (GTC 2025): first open foundation model for humanoid reasoning and skills. Cross-embodiment (works across different robot bodies).
GR00T N1.5: generated using synthetic data from GR00T-Dreams blueprint in 36 hours. 40% performance improvement over N1 when combining synthetic and real data.
GR00T N1.6 (CES 2026, January 2026): open reasoning VLA model. Unlocks full body control. Uses Cosmos Reason for better contextual understanding.
GR00T N1.7 (May 2026): supports whole-body humanoid control via the Unitree G1 (UNITREE_G1_SONIC embodiment + GEAR-SONIC controller). The VLA predicts latent action tokens that a whole-body controller decodes into full-body joint commands including legs, arms, and hands. Single policy produces language-conditioned, coordinated manipulation and locomotion end-to-end.
GR00T N (GTC 2026): extended to surgical robotics applications.

Key concept: VLA (Vision-Language-Action). GR00T N1.6+ is a VLA model. It sees, understands instructions, and acts. This is the architecture that makes "pick up the red cup" work without explicit programming for cups.

Code: GitHub | Docs: NVIDIA Developer

Cosmos world models

Cosmos is NVIDIA's family of "world foundation models." Think of them as the robot's imagination.

What they do: Cosmos models predict what will happen next in the physical world. If a robot pushes a cup toward the edge of a table, the world model predicts the cup will fall. This predictive ability is essential for planning actions and evaluating whether a planned action will succeed before executing it.

Key models (as of GTC 2026):

Cosmos Transfer 2.5: converts real video to synthetic training data. Enables massive data augmentation from small real-world datasets.
Cosmos Predict 2.5: predicts future states of physical environments. Enables robot policy evaluation in simulation.
Cosmos Reason 2: open reasoning VLM. Enables machines to see, understand, and act in the physical world. The perception layer for GR00T N1.6+.
Cosmos 3 (GTC Taipei, June 1, 2026): the first fully open "omnimodel" for Physical AI. Unifies vision reasoning, world generation, and action generation in a single model. Available in two sizes: Super (32B) and Nano (8B, can run on local hardware including DGX Spark and RTX Spark). Called an "omnimodel" because previous Cosmos versions separated perception, prediction, and generation into distinct models. Cosmos 3 does all three natively.

All Cosmos models are open and available on Hugging Face. The Nano (8B) variant is particularly significant for the developer ecosystem because it can run locally on NVIDIA's new RTX Spark consumer PCs (ARM-based, up to 128GB unified memory, 1 petaFLOP AI performance), meaning Physical AI development no longer requires cloud compute or expensive workstations.

6. The competing thesis: LeCun, AMI Labs, and JEPA

Not everyone agrees that the NVIDIA approach (massive simulation pipelines, synthetic data, scaled compute) is the right path. The most credible dissent comes from Yann LeCun, who helped create the deep learning revolution and is now betting $1 billion that the entire industry has taken a wrong turn.

What happened

LeCun left Meta in late 2025 after 12 years leading FAIR (Fundamental AI Research). He founded AMI Labs (Advanced Machine Intelligence) in Paris with co-founder Alexandre LeBrun. In March 2026, AMI raised $1.03 billion in seed funding at a $3.5 billion pre-money valuation, the largest seed round in European history, backed by Bezos Expeditions, NVIDIA, Cathay Innovation, Greycroft, Hiro Capital, HV Capital, and others including Tim Berners-Lee, Jim Breyer, Mark Cuban, Xavier Niel, and Eric Schmidt.

His departure was driven by a philosophical split. Meta pivoted resources to catch up in the LLM race (Llama 4). LeCun believed true machine intelligence requires understanding physical reality, not predicting the next token.

The core argument

LeCun's position: LLMs are sophisticated pattern matchers that produce plausible text without ever understanding the world. They predict the next word. Image generators predict pixels. Both operate in "surface space" and fail when applied to physical-world reasoning because the space of possible outputs is astronomically large (every possible future frame of video, every possible physical outcome).

His alternative: world models that learn abstract representations of reality and make predictions in "latent space." Instead of predicting every pixel of what happens next, the model predicts the abstract state of the world: "the cup will fall off the table" rather than generating a video of the cup falling. This is how LeCun believes babies learn physics: by observing patterns, not by generating predictions of every sensory detail.

Key quote: "I do not understand how you can even think of building an agentic system without that system having the ability of predicting the consequences of its actions."

JEPA (Joint Embedding Predictive Architecture)

JEPA is the technical architecture behind LeCun's thesis. It trains AI systems to predict abstract representations of future states in a learned embedding space, rather than predicting raw data.

The progression:

I-JEPA: image-based JEPA. Learns to predict abstract image representations.
V-JEPA 2 (June 2025): video-based JEPA. 1.2B parameters. Trained on 1M+ hours of unlabeled internet video. State-of-the-art on action anticipation (predicting what will happen 1 second in the future from video).
V-JEPA 2-AC (Action-Conditioned): after pre-training on video, a second phase adds just 62 hours of robot control data. The model can then predict physical outcomes of robot actions, enabling planning and control.
V-JEPA 2.1 (March 16, 2026): improved training recipe for high-quality, temporally consistent dense features. All tokens (visible and masked) now contribute to the self-supervised training loss. State-of-the-art robot navigation. 10x faster planning than previous approaches. Closes the gap with DINOv2 on dense, spatially precise features.

Real results, not just theory

V-JEPA 2 has been tested on physical robots at Meta's labs:

Zero-shot robot control: 65-80% success rate on pick-and-place tasks with objects and environments the model had never seen. No retraining for new scenes.
30x faster than NVIDIA's Cosmos on major video benchmarks (Something-Something v2, Epic-Kitchens-100).
50% fewer parameters than comparable models with equivalent benchmark performance.
State-of-the-art action anticipation: predicts what a person will do 1 second in the future from egocentric video, outperforming task-specific models.

The 62-hour training number is striking. GR00T required 780,000 synthetic trajectories (equivalent to 9 months of human demonstration data). V-JEPA 2 achieves competitive robot control from 62 hours of real robot data on top of video pre-training. This is the efficiency argument in action.

NVIDIA GR00T vs LeCun JEPA: head to head

Two approaches to robot intelligence. NVIDIA bets on scaled simulation; LeCun bets on observation-based world models. Both need governance; neither provides it.

The honest assessment

LeCun's credibility is beyond question: he invented convolutional neural networks, won the Turing Award, and built the research lab that produced many of the tools the industry uses today. The V-JEPA results are real and published. The $1B funding gives AMI Labs 4-5 years of runway.

But AMI Labs is months old and has shipped nothing commercially. NVIDIA has 110+ partners with deployed systems. The gap between "impressive research" and "robots doing useful work at scale" is wide. If JEPA's efficiency claims hold at scale, it could reduce the need for massive simulation infrastructure. If they do not hold, the approach remains a research contribution rather than an industry shift.

Both outcomes matter for Physical AI work. Either NVIDIA's full-stack approach wins and the skills around it remain central, or JEPA-style world models win and the skills shift toward efficient representation learning and minimal-data training. Perception, edge deployment, and governance skills are relevant either way.

Where governance sits in both

The System M paper (Dupoux, LeCun, Malik; arXiv 2603.15381, March 2026) proposed an A-B-M architecture for autonomous cognition. The "M" (Meta-learning system) orchestrates how the robot learns and adapts. What is missing from both NVIDIA's and LeCun's approaches: governance. Who decides what the robot should learn? What boundaries constrain its autonomous behavior? How do you audit a world model's internal representations? These are the same questions software AI governance is asking, with physical consequences.

7. The open-source ecosystem: Hugging Face and beyond

Why Hugging Face keeps showing up

Hugging Face is to AI what GitHub is to software. With over 1 million repositories and 5 million+ researchers and developers, it is the infrastructure layer the AI ecosystem runs on. When a new model is released (by NVIDIA, Google, Meta, Alibaba, or anyone else), it shows up on Hugging Face first.

Their robotics push is recent and aggressive. In early 2025, Hugging Face acquired Pollen Robotics (makers of Reachy Mini), creating a vertically integrated open-source robotics stack: software (LeRobot) + hardware (Reachy Mini) + model hosting (Hugging Face Hub) + community. Robotics datasets on the platform grew from 1,145 in 2024 to 26,991 in 2025, making robotics the single largest dataset category on the entire platform.

The Reachy Mini app store (May 6, 2026)

Hugging Face launched an open-source app store for Reachy Mini on May 6, 2026, opening with 200+ apps from 150 different creators. Around 10,000 Reachy Mini devices were already shipped or in transit at launch, with 3,000 shipped in the week prior. The hardware ranges from $299 (Lite, tethered) to $449 (Wireless, RPi 5 with battery).

Apps require no coding knowledge to build or use. Examples at launch include an office receptionist app built in under two hours, baby-monitor-style apps, cooking assistants, and a distraction tracker. Hugging Face's "ML Intern" agent can generate and refine apps. Every app is an open-source repository that anyone can fork, audit, or extend. This mirrors the iOS App Store model but for robot intelligence.

The NVIDIA partnership makes this ecosystem more powerful. As of CES 2026, NVIDIA Isaac models and libraries integrate into LeRobot. A hobby arm running LeRobot connects to the same model ecosystem that Figure's humanoid fleet uses. The workflow: collect demonstration data with LeRobot, train with Isaac Lab, simulate in Isaac Sim, deploy on Jetson.

Hugging Face LeRobot

Framework: PyTorch-based. Covers data collection, training, simulation, and deployment.
Hardware compatibility: SO-ARM100/101 robotic arms, Reachy Mini, any LeRobot-compatible robot.
NVIDIA integration: Isaac open models and libraries integrated into LeRobot as of CES 2026.
Approach: imitation learning (demonstration-based) and reinforcement learning.

Site: Hugging Face LeRobot | Code: GitHub

Other platforms and tools

Platform	Role
Roboflow	Computer vision model training and deployment. Annotate, train YOLO/custom models, deploy to edge. Go-to tool for vision pipelines.
AgenticROS	Bridge between AI agents (Claude Code, Claude Desktop, Gemini, OpenClaw) and ROS2 robots. Natural-language control of physical robots through messaging apps and MCP. Open-source.
Weights & Biases	ML experiment tracking, model versioning, collaboration. Used by most serious ML teams.
Ultralytics	Makers of YOLO (the most popular object detection family). YOLOv8/v9/v11 for real-time detection.
Open Robotics	Maintainers of ROS2 and Gazebo.
MoveIt	Motion planning framework for robotic arms. Built on ROS2. Standard for manipulation.

Key open hardware projects

Project	What it is	Ecosystem
SO-ARM100/101	Open-source 6-axis robotic arm for imitation learning	LeRobot / Hugging Face
Reachy Mini	Expressive desktop robot with camera, mic, speaker. App store as of May 2026.	LeRobot / Pollen / Hugging Face
JetBot / JetRacer	Autonomous vehicle kits	NVIDIA Isaac / Jetson
Duckiebot	MIT-origin self-driving car platform	ROS2 / Jetson

8. Autonomous vehicles: Tesla, Waymo, and the road

This is the most visible and commercially advanced application of Physical AI. Self-driving vehicles represent billions of dollars in real-world deployment, millions of miles of real-world data, and some of the most complex AI challenges in existence.

Tesla: the Physical AI company

Tesla no longer describes itself as a car company. In its 2025 annual report, Tesla described its transition "from a hardware-centric business to a physical AI company." Three Physical AI products are now active or launching:

Full Self-Driving (FSD): Tesla's autonomous driving software, now at version 14.3 (April 2026). Uses end-to-end neural networks trained on data from millions of vehicles. The v14.3 update upgraded the reinforcement learning stage for edge cases, enhanced the vision encoder for low-visibility scenarios, and rewrote the AI compiler to cut inference latency by up to 20%. On the Q1 2026 earnings call, Musk said unsupervised FSD would reach customer vehicles "probably Q4" of this year, though Tesla's FSD timelines have been consistently optimistic.

Cybercab (robotaxi): a purpose-built autonomous vehicle with no steering wheel or pedals. Continuous production began at Giga Texas in April 2026 (the first unit rolled off in February). Tesla designed the Cybercab to comply with all existing FMVSS standards on its own, so it is not subject to NHTSA's 2,500-vehicle annual production cap for autonomous vehicles. Robotaxi service expanding to seven cities in H1 2026 (Dallas, Houston, Phoenix, Miami, Orlando, Tampa, Las Vegas) on top of existing service in Austin. Showcased at the F1 Miami Grand Prix Fan Fest (April 29 to May 3, 2026) in a glass display case.

Optimus (humanoid robot): Gen 3, the first version designed for mass production, was unveiled in Q1 2026 with an updated hand design. The AI5 inference chip was taped out on April 15, 2026, and will deploy to Optimus robots and supercomputer clusters first, not Tesla vehicles. Optimus integrates xAI's Grok for conversational AI while using FSD-derived neural networks for physical movement. Musk has stated plans to produce 1 million Optimus units annually, and SpaceX aims to send Optimus robots to Mars on Starship flights in late 2026.

Why Tesla matters here: they have the largest real-world driving dataset on Earth. The FSD neural networks are the most battle-tested Physical AI systems in consumer deployment. The Optimus-FSD convergence (same AI stack powering both cars and humanoid robots) represents a unique approach where vehicle autonomy and robot autonomy share a single development pipeline.

Tesla AI: tesla.com/AI

Waymo (Alphabet)

The market leader in fully autonomous ride-hailing. Waymo operates commercial robotaxi services in multiple US cities. Their vehicles use a sensor suite (LIDAR, cameras, radar) and proprietary AI for fully driverless operation (no safety driver). Waymo holds the most advanced autonomous vehicle permits in the US.

waymo.com

Other autonomous vehicle companies

Company	Focus	Status
Zoox (Amazon)	Purpose-built autonomous robotaxi (no steering wheel)	Free rides in San Francisco, backed by Amazon's resources
Aurora Innovation	Self-driving trucks (Aurora Driver platform)	Public; commercial deployment with FedEx and others
Mobileye (Intel)	Autonomous driving chips and software	Powers ADAS in 800+ vehicle models. Public.
Volvo Autonomous Solutions	Autonomous trucks and industrial vehicles	Active EU AI Act compliance for autonomous transport
Cruise (GM)	Robotaxi service	Restructured after incidents; pivoting to personal vehicle autonomy
Caterpillar	Autonomous construction and mining	Using NVIDIA platform for autonomous off-highway vehicles
Tesla Semi	Autonomous electric trucking	Up to 500 autonomous Tesla Semis in grocery supply chain
NVIDIA Alpamayo 2	Open platform for Level 4 robotaxi development	Announced GTC Taipei June 2026. Reference architecture for AV companies building on NVIDIA hardware. Won COMPUTEX 2026 Vehicle Technology category award.

The data advantage

What sets autonomous vehicles apart from other Physical AI domains: they generate vast quantities of real-world training data continuously. Every Tesla on the road feeds data back. Every Waymo ride adds to the dataset. This creates a compounding advantage: more data produces better models, better models attract more users, more users generate more data. This flywheel does not exist in humanoid robotics yet, which is why vehicle autonomy is years ahead of robot autonomy in real-world deployment.

9. Companies building Physical AI

The NVIDIA partner ecosystem

As of GTC Taipei (June 2026), NVIDIA has 150+ ecosystem partners, up from 110+ at GTC San Jose in March. The major categories:

Robot brain developers (generalized AI for any robot body)

Skild AI: generalized robot brains using Cosmos world models and Isaac simulation.
FieldAI: same approach as Skild, using Cosmos for data generation.
World Labs: using Isaac Sim to validate generative world models.
Generalist AI: using Cosmos to explore synthetic data generation.

Humanoid robot companies

Company	Robot	NVIDIA tech used	Status
Figure	Figure 02	Isaac platform, accelerated computing, Helix VLA model	Building large-scale fleet for commercial deployment
Agility Robotics	Digit	Jetson AGX Thor (on-board compute)	Deployed in logistics facilities
Boston Dynamics	Atlas (electric)	Jetson Thor integration (2026)	Integrated Jetson Thor into existing humanoids
1X Technologies	NEO Gamma	GR00T N1 (post-trained policy)	Demonstrated autonomous domestic tidying at GTC
AGIBOT	Multiple	GR00T N models, Cosmos	Humanoids for industrial and consumer sectors
Unitree	G1, H1, Go2	GR00T N1.7 (whole-body control)	G1 is the reference embodiment for GR00T
LG Electronics	Home robot	Isaac Sim	New home robot for indoor household tasks (CES 2026)
NEURA Robotics	Multiple	GR00T N models, Cosmos, Isaac	Next-gen humanoids for industrial deployment
Mentee Robotics	Multiple	Cosmos, Isaac Sim, Isaac Lab	Next-gen humanoid development

Industrial robot giants

Company	Focus	NVIDIA integration
ABB Robotics	Industrial automation	Integrating Omniverse into RobotStudio (HyperReality release 2026)
FANUC	Industrial robots, CNC	NVIDIA platform for automation
KUKA	Industrial robotics	Building on NVIDIA technology for physical AI at scale
Universal Robots	Collaborative robots (cobots)	NVIDIA partner ecosystem
YASKAWA	Industrial automation	NVIDIA platform integration

Surgical robotics

Company	System	NVIDIA integration
CMR Surgical	Versius	Cosmos-H simulation for training and validation prior to clinical deployment
Johnson & Johnson MedTech	Monarch Platform	Isaac Sim and Cosmos-based post-training workflows
Medtronic	Surgical systems	Exploring IGX Thor for mission-critical precision and functional safety

Other notable deployments

Company	Application	NVIDIA tech
Amazon Robotics	Warehouse manipulation and mobile robots	Omniverse libraries and frameworks
Caterpillar	Autonomous construction equipment	NVIDIA partner
Disney	Olaf robotic character (BDX droids)	Jetson, Omniverse, Kamino simulation platform
Maximo (AES Corp.)	Solar installation robot fleet	Isaac Sim, Omniverse, accelerated computing. Completed 100MW installation.
Aigen	Agricultural weeding rovers	Jetson Orin for real-time crop vs weed detection
Hexagon Robotics	Industrial autonomy	Cosmos models, Isaac tools
PTC	CAD-to-simulation (Onshape to Isaac Sim)	Cloud-native design-to-simulation workflow
Siemens	Digital twin software	First company to develop software supporting the Mega Omniverse Blueprint

Source: NVIDIA GTC 2026 robotics roundup.

10. Medical and surgical AI

Surgical robotics is one of the most mature and commercially proven applications of Physical AI. The global surgical robotics market is projected to reach $14 billion by 2026.

Surgical robotics leaders

Company	System	What it does	Status
Intuitive Surgical	da Vinci	The dominant surgical robot platform. Used in urology, gynecology, thoracic, and general surgery worldwide. Over 9 million procedures performed.	FDA approved, market leader
Medtronic	Hugo RAS	Modular robotic-assisted surgery. Mobile carts, open console, flexible deployment. Competing with da Vinci on cost and flexibility.	CE marked, expanding globally. Using NVIDIA Cosmos for simulation.
CMR Surgical	Versius	Portable, scalable alternative to fixed-tower robots. Arm carts positioned as needed. Well-suited for smaller operating rooms.	CE marked, deployed in public and private hospitals. Using NVIDIA Cosmos-H for training and validation.
J&J MedTech	Monarch Platform	Endoscopy-focused robotic platform.	Using Isaac Sim and Cosmos for post-training workflows.
Stryker	Mako	Orthopedic surgery (hip and knee replacements).	FDA approved, widely deployed.
Zimmer Biomet	ROSA	Orthopedic surgical robot for knee and brain surgery.	FDA approved.
Moon Surgical	Maestro	AI-powered laparoscopic surgery assistant. Backed by NVIDIA and J&J venture arms.	FDA cleared (2025).
Edge Medical Robotics	(In R&D)	AI-powered soft-tissue surgical robots with real-time tactile sensing.	Pre-commercial, gaining traction.

Beyond the operating room

Rehabilitation and exoskeletons: AI-driven robotic exoskeletons (Ekso Bionics, ReWalk) enable paraplegic patients to walk again. ML-controlled actuators adjust resistance based on patient progress, creating personalized rehabilitation plans. This area bridges Physical AI with wearable technology.

Smart prosthetics: machine-learning-controlled prosthetic limbs learn the wearer's movement patterns and adapt. Advances in neural interfaces are connecting prosthetics directly to the nervous system for near-natural control.

AI diagnostics: medical imaging AI (radiology, pathology, dermatology) using computer vision to detect disease. Closer to software AI than Physical AI, but the diagnostic models increasingly run on edge hardware in clinics and hospitals.

The regulatory landscape for medical AI

The FDA has been actively developing frameworks for AI in medical devices:

January 2025: FDA published draft guidance on AI-enabled Device Software Functions (DSF), recommending model descriptions, data lineage, bias analysis, and human-AI workflow documentation.
2026 agenda: FDA finalizing lifecycle management rules for AI-enabled devices.
EMA + FDA (January 2026): jointly published 10 guiding principles for good AI practice in medicines development.
Key classification: real-time surgical AI predictions fall into Software as a Medical Device (SaMD). The line blurs when AI influences a robot's physical actions.

11. The defense and autonomous sector

This sector represents a massive and rapidly growing application of Physical AI. The skills overlap with commercial robotics is near-total: perception, autonomous navigation, edge AI, reinforcement learning, sim-to-real transfer.

Axon (Nasdaq: AXON)

What they are: global leader in public safety technology. Evolved from body cameras and TASERs into a full AI-powered public safety platform.

Scale: Q1 2026 revenue $807M (up 34% YoY, 9th consecutive quarter of 30%+ growth). AI revenue surged 700%. AI bookings rose 140% YoY. Annual recurring revenue $1.5B (up 35%). Platform Solutions revenue up 95% YoY to $111M. Management raised full-year 2026 guidance to 30-32% growth. Market cap ~$30B.

Product ecosystem:

TASER energy devices (TASER 10 adopting at 2x the rate of TASER 7).
Body-worn, fixed, and in-car cameras (Axon Body 4).
Axon Vision: AI-powered real-time awareness from live video (CCTV analysis at scale).
Axon Vehicle Intelligence: AI license plate recognition with vehicle descriptors.
Dedrone: counter-drone detection and response.
DFR (Drone First Responder): autonomous drones dispatched ahead of ground units.
Draft One: AI-automated police report writing from body camera footage.
Axon Evidence: cloud-based digital evidence management.

Site: axon.com | Q1 2026 earnings

Overland AI (private, Seattle)

What they are: autonomous ground vehicles for the U.S. military. Spun out of the University of Washington in 2022.

Scale: raised $100M in February 2026 (led by 8VC). 100+ employees. Total funding $140M+.

Product stack:

OverDrive: autonomy software stack. Off-road navigation in GPS-denied, comms-denied environments. Computer vision, LIDAR, proprietary AI models.
SPARK: modular sensor and compute hardware kit. Mounts on any vehicle to give it autonomous capability.
OverWatch: command and control platform. One operator controls multiple autonomous ground vehicles.
ULTRA: purpose-built autonomous ground vehicle platform.

Operational deployments (real, not demos):

82nd Airborne Division, JRTC (April 2026): four ULTRA vehicles integrated into a month-long combat training rotation. Reduced distribution timelines by ~50%.
USMC ROGUE Fires (April 2026): integrated OverDrive onto the Marine Corps Remotely Operated Ground Unit for Expeditionary Fires.
Selected for U.S. Army UxS Autonomous Maneuver Program to evaluate autonomy on Infantry Squad Vehicles.

Anduril Industries (private, $61B valuation)

What they are: the largest defense tech startup in the world. Founded 2017 by Palmer Luckey. Building autonomous weapons systems, drones, naval vessels, and the software platform that connects them all.

Scale: $5B Series H on May 13, 2026 led by Thrive Capital and Andreessen Horowitz, doubling valuation to $61B. Expected 2026 revenue ~$4.3B (doubling from 2025). The U.S. Army awarded Anduril a firm-fixed-price enterprise contract with a $20B ceiling for Lattice integration (10-year enterprise agreement). Also signed a framework agreement with the U.S. Department of War to mass-produce at least 3,000 Barracuda-500M cruise missiles over three years. Total funding $11B+.

Product ecosystem:

Lattice: autonomous software platform and network. The "operating system" for the battlefield. Connects and powers thousands of systems, analyzes sensor data, acts autonomously under human supervision.
Fury (YFQ-44A): unmanned autonomous fighter jet for the U.S. Air Force Collaborative Combat Aircraft (CCA) program.
Roadrunner: jet-powered VTOL autonomous air vehicle. Reusable. Intercepts and destroys drones and aerial threats.
Ghost: autonomous small UAS for ISR.
Altius: tube-launched loitering munitions. Deployed in Ukraine.
Barracuda-500M: surface-launched cruise missiles (3,000+ over three years).
Autonomous Surface Vessels: being built at the old Foss Shipyard in Seattle for the U.S. Navy MASC program.
Golden Dome: space-based missile defense system (contract announced May 2026).
Arsenal-1: hyperscale manufacturing campus in Ohio. $1B investment. 5M+ sq ft. 4,000+ workers.

Shield AI (private, $12.7B valuation)

What they are: autonomous piloting software for military aircraft. "The company that builds the pilot, not the plane."

Scale: $1.5B Series G (March 2026) at $12.7B valuation. Projected 2026 revenue $540M+. Total funding $3.6B+.

Core technology:

Hivemind: autonomous piloting software. Flies aircraft without human input or GPS. Selected for the U.S. Air Force CCA prototype program (February 2026).
V-BAT: VTOL drone. Replicator-eligible (DoD mass production program).
X-BAT: stealth autonomous combat aircraft. Flight testing scheduled 2026.

The Anduril-Shield AI dynamic: for the CCA program, Anduril builds the airframe (Fury) and Shield AI provides the autonomy software (Hivemind). The Air Force deliberately split hardware and software to avoid single-vendor lock-in. Anduril supplies the body, Shield AI supplies the brain.

Other companies

Company	Focus	Notes
Skydio	Small autonomous drones	Market leader in small UAS autonomy. Enterprise and government.
Kratos Defense	Attritable drone systems	Public, $3B+ market cap. Affordable autonomous combat aircraft.
Hermeus	Hypersonic unmanned fighter jets	Raised $350M at $1B+ (April 2026). Led by Khosla Ventures.
Helsing	European defense AI	Raising ~$1.2B at ~$18B valuation. The European Anduril equivalent.
Echodyne	Advanced radar systems	Seattle-area. Defense tech radar.
Armada	Portable AI data centers for military	Bellevue-based.
Palantir	Data analytics / defense intelligence	Public, deep DoD relationships. Software layer.
Oshkosh Defense	Light autonomous vehicles	Unveiled LMMAV (October 2025). Supervised autonomy for logistics and counter-drone.
General Dynamics	Robotic combat vehicles	SMET vehicles, TRX-10. Infantry support and battlefield networking.

The funding context

Pentagon FY2026 budget: ~$8.9B for autonomous and collaborative platforms.
Replicator program: ~$1B across two tranches for mass-produced autonomous systems.
CCA program: ~$1B in Air Force RDT&E (the single largest line item).
Anduril: $11B+ total funding, $61B valuation, $20B Lattice contract.
Shield AI: $3.6B+ total funding, $12.7B valuation.
Overland AI: $140M+ total funding.

The Pacific Northwest connection

Seattle and the surrounding area is emerging as a major hub for defense tech and Physical AI: Overland AI (autonomous military ground vehicles, UW spinoff), Anduril (autonomous naval vessels at the Seattle shipyard), Echodyne (radar), Armada (Bellevue, portable AI data centers), Amazon Robotics (warehouse/logistics), and NVIDIA ecosystem partners.

12. Space exploration

Space robotics is one of the oldest and most demanding applications of autonomous AI. Communication delays (up to 24 minutes to Mars one way) mean space robots must operate autonomously, without real-time human control.

NASA's AI robotics fleet (2026)

Perseverance Mars Rover: in December 2025, NASA's Perseverance completed the first AI-planned drives on another world. Generative AI analyzed high-resolution orbital images of Jezero Crater and generated waypoints that kept the rover away from hazardous terrain. Previously, human route planners handled this complex decision-making task.

Astrobee (ISS): three cube-shaped free-flying robots (Honey, Queen, Bumble) operate inside the International Space Station. They inventory supplies, record video, monitor systems. In late 2025, Stanford researchers demonstrated the first AI-controlled robotic navigation on the ISS, achieving 50-60% faster autonomous movement planning.

CIMON: an AI-powered robot about the size of a bowling ball, tested on the ISS for its ability to control other free-flying robots. Robot-to-robot communication and autonomous task delegation in space.

Robonaut 2: a humanoid robot at the ISS with 350+ sensors, 38 processors, and hands with 12 degrees of freedom. Can execute tasks autonomously. A precursor to the humanoid robots that will eventually work alongside astronauts on the Moon and Mars.

Fly Foundational Robots (FFR): a NASA mission launching a robotic arm capable of dexterous manipulation, autonomous tool use, and walking across spacecraft structures in zero gravity. Developed by Motiv Space Systems. The stepping stone to robots that assemble solar arrays, refuel satellites, and build lunar habitats.

SpaceX + Tesla Optimus on Mars

SpaceX aims to launch a small uncrewed fleet of Starships toward Mars in late 2026, carrying Tesla Optimus humanoid robots to explore the Red Planet. Whether this timeline holds is debatable, but the convergence of SpaceX launch capability and Tesla humanoid robotics represents a path where Physical AI operates on another planet within the next few years.

Why space robotics matters

Space robotics operates under extreme constraints: limited compute, harsh environments, communication delays, no ability to physically intervene when something goes wrong. The AI must be resilient, autonomous, and power-efficient. These constraints mirror edge AI challenges on Earth (limited compute, real-time requirements, no cloud fallback). Skills in autonomous navigation, computer vision on constrained hardware, and fault-tolerant AI transfer directly between space and terrestrial robotics.

13. Government regulation and policy

Physical AI regulation is fragmented, evolving rapidly, and presents a significant governance gap. This is one of the most under-developed and under-discussed areas in AI policy today.

EU AI Act and the May 2026 Omnibus simplification

The EU AI Act is the world's first unified law governing AI systems. It entered into force on August 1, 2024, with full applicability originally scheduled for August 2, 2026.

On May 7, 2026, the EU Council and Parliament agreed on the "Digital Omnibus on AI" (part of the Omnibus VII simplification package), making significant changes:

High-risk AI deadlines deferred: obligations applicable to high-risk AI systems subject to existing EU sectoral legislation (Article 6(1) coverage) now take effect on August 2, 2028 rather than 2026.
Machinery overlap resolved: machinery products now only need to comply with sectoral safety rules instead of both the AI Act and sectoral rules. Direct impact on industrial robotics, factory automation, and surgical equipment.
Transparency timeline: grace period for providers to implement transparency solutions for AI-generated content reduced from 6 months to 3 months, with the new deadline December 2, 2026.
Formal adoption expected before August 2, 2026.

Key provisions still affecting Physical AI:

Autonomous vehicles: AI systems in autonomous vehicles qualify as high-risk under Annex I (product safety component). Subject to conformity assessments, technical documentation, and human oversight requirements.
Surgical robots: real-time surgical AI falls into Software as a Medical Device (SaMD). High-risk classification. Rules for AI embedded in regulated medical products apply from August 2028.
Military exemption: the EU AI Act exempts AI systems designed for military, national defense, and national security purposes. This creates a regulatory gap where defense AI (autonomous weapons, military drones) operates outside the Act's framework entirely.
Enforcement reality: non-compliance can cost up to 7% of global annual revenue.

EU AI Act framework | May 7, 2026 Council press release

US approach (fragmented, sector-specific)

The US has no comprehensive federal AI regulation equivalent to the EU AI Act. Instead, regulation happens through existing agencies:

Agency	What they regulate
NHTSA	Autonomous vehicles. Crash reporting for self-driving cars. Investigation authority.
FDA	Medical devices including surgical robots and AI-enabled diagnostics. Draft guidance on AI-enabled Device Software Functions (January 2025). Lifecycle management rules being finalized in 2026.
FTC	Consumer protection from deceptive or unfair AI practices.
DoD	Military AI through the Responsible AI Strategy and autonomous weapons policies.
NIST	AI Risk Management Framework (AI RMF). Voluntary standards.

State-level activity is filling federal gaps:

Colorado AI Act (effective February 2026): most comprehensive US state AI legislation. Regulates high-risk AI in employment, housing, education, financial services.
California: multiple bills addressing automated employment decisions, insurance algorithms, political deepfakes.
NYC Bias Audit Law: mandates annual bias audits for automated employment decision tools.

Autonomous weapons regulation (global)

The most contested governance area in Physical AI. International efforts to regulate autonomous weapons have been ongoing for nearly a decade with limited progress:

UN Convention on Certain Conventional Weapons (CCW): the Group of Governmental Experts on Lethal Autonomous Weapons Systems (LAWS) has been meeting since 2014. Mandate extended to 2026. A "rolling text" summarizing possible regulatory elements exists, but geopolitical tensions have blocked binding agreements.
UN Secretary-General: has called for a legally binding treaty to prohibit LAWS that function without human control, to be concluded by 2026.
International Committee of the Red Cross (ICRC): urges states to negotiate new legally binding rules on autonomy in weapons systems, with both prohibitions and regulations.
Stop Killer Robots: global coalition campaigning against autonomous weapons. Active in 100+ countries.
The EU gap: the AI Act's military exemption means defense AI developed in Europe operates outside civilian AI governance.
The US reality: companies like Anduril, Shield AI, and Lockheed Martin are deploying autonomous weapons systems with human-in-the-loop oversight, but the degree of autonomy is increasing. The $20B Lattice contract (Anduril + US Army) puts autonomous decision-support across the military.

Industry response: NemoClaw (NVIDIA's first governance product)

NVIDIA announced NemoClaw at GTC 2026 as a security and governance layer for AI agents. It monitors the reasoning process of AI agents in real time and ensures their actions align with pre-defined safety guardrails. The key phrase from NVIDIA: it "inspects the intent of the AI's logic." This is the first product-level acknowledgment from a major Physical AI platform that governance of autonomous systems requires inspecting intent, not just monitoring actions.

NemoClaw works with AgenticROS, an open-source project that connects AI agents (Claude Code, Claude Desktop, Gemini, MCP) to ROS2 robots via natural language. The combination means you can now talk to a physical robot through Claude, and NemoClaw monitors whether the robot's AI is staying within its safety boundaries. This is exactly the governance-at-the-intent-layer problem.

Both are early. NemoClaw is a security product, not a full governance framework. It monitors and sandboxes; it does not address accountability, liability, auditability, or the broader policy questions. But it validates the direction.

AgenticROS GitHub | agenticros.com

The gap that nobody is filling

Physical AI governance is the least developed and most needed area of AI policy. The people building autonomous systems (engineers) and the people governing them (policy makers) rarely understand each other's domains. The governance questions in Physical AI (when can a robot act without human approval? who is liable when a surgical robot fails? how do you audit an autonomous vehicle's decision-making?) are extensions of the same questions the AI governance world is asking about software AI, but with physical consequences.

14. Skills that matter across Physical AI

The Physical AI stack rewards engineers who can move across layers: perception on the camera, planning in the model, control on the actuator, governance over the whole thing. The skills below show up in roles at every company in this document.

Skill	Where it is used	How to build it
Edge AI deployment	Every robot, drone, vehicle, camera system	Jetson Orin Nano + JetPack SDK + tutorials
Computer vision	Perception layer for all Physical AI systems	YOLO, DeepStream, Isaac ROS perception
Autonomous navigation	Ground vehicles, drones, mobile robots	JetBot/JetRacer, SLAM, path planning
Reinforcement learning	Training robots in simulation	Isaac Lab, Isaac Sim
Imitation learning	Teaching robots by demonstration	LeRobot, SO-ARM101, VR teleoperation
Sim-to-real transfer	Getting simulation-trained skills to work on real hardware	Isaac Sim + real robot
NVIDIA platform fluency	Table stakes for any role touching Physical AI	CUDA, JetPack, Isaac, Metropolis
ROS2	Standard robotics framework	Used by every serious robotics company
Foundation model fine-tuning	Adapting GR00T/Cosmos for specific tasks	GR00T GitHub, Hugging Face
Sensor fusion	Combining camera, LIDAR, IMU, force sensor data	Isaac ROS, multi-sensor projects
AI governance for physical systems	Safety, compliance, accountability for autonomous systems	Connect existing AI governance knowledge to robotics specifics

Role types

Role	Description	Companies hiring
Robotics Engineer	Full-stack robot development: perception, planning, control	Every company in this doc
Perception Engineer	Computer vision and sensor processing for robots	Figure, Agility, Boston Dynamics, Axon
Simulation Engineer	Building and maintaining digital twins and sim environments	NVIDIA, Figure, Anduril, Shield AI
Edge AI Developer	Deploying and optimizing AI models on Jetson/edge hardware	Axon, Overland AI, Aigen, every robot company
VLA / Foundation Model Researcher	Training and fine-tuning robot foundation models	NVIDIA, Skild AI, Figure, 1X
Autonomous Systems Engineer	Self-driving vehicles, drones, ground robots	Overland AI, Anduril, Shield AI, Skydio
Physical AI Governance / Strategy	Safety frameworks, compliance, ethics for autonomous systems	NVIDIA, Axon, Anduril, DoD, consulting

15. Key resources and links

Primary (start here)

Jetson AI Lab: hands-on tutorials for GenAI on Jetson. THE starting point.
Jetson Orin Nano Super (buy): the hardware entry point.
NVIDIA Robotics Platform: overview of the full platform.
Isaac GR00T GitHub: open-source humanoid foundation model.
LeRobot: open-source robotics framework.
SO-ARM100 GitHub: open-source robotic arm.

Key recent articles (verified May 2026)

Article	Date
NVIDIA GTC 2026 robotics roundup	March 2026
NVIDIA CES 2026 Physical AI announcements	January 2026
Anduril $5B Series H at $61B	May 13, 2026
EU AI Act Omnibus simplification agreed	May 7, 2026
Hugging Face Reachy Mini App Store launch	May 6, 2026
Axon Q1 2026 earnings	May 2026
Shield AI $12.7B valuation	March 2026
LeCun AMI Labs $1.03B seed	March 2026
Overland AI $100M raise	February 2026

16. Full glossary

AI and computing

Term	Definition
TOPS	Tera Operations Per Second. Measures how many trillion math operations an AI chip performs per second. More TOPS is faster inference. Jetson Orin Nano = 67 TOPS.
TFLOPS	Tera Floating-Point Operations Per Second. Like TOPS but for floating-point math. Jetson Thor = 2,070 FP4 TFLOPS.
CUDA	NVIDIA's parallel computing platform. Nearly all AI frameworks run on CUDA. Skills/code transfer between any NVIDIA GPU. The Windows of AI hardware.
NPU	Neural Processing Unit. A chip designed specifically for AI inference. Apple has one in every M-series chip.
Edge AI	Running AI models on local hardware instead of cloud servers. Low latency, privacy, no API costs.
GGUF	File format for quantized LLM models used by llama.cpp and Ollama. Runs on CPU.
MLX	Apple's machine learning framework for Apple Silicon. Runs natively via Metal GPU. Faster than GGUF on Mac.
Inference	Running a trained model to make predictions. Distinct from training. Edge AI is about fast, efficient inference on local hardware.
Quantization	Compressing a model by reducing the precision of its numbers (e.g., from 16-bit to 4-bit). Smaller and faster at slight accuracy cost.

Robotics

Term	Definition
DOF	Degrees of Freedom. The number of independent axes a robot can move along. Human arm = ~7 DOF.
ROS / ROS2	Robot Operating System. Industry-standard open-source framework for robotics software.
SLAM	Simultaneous Localization and Mapping. A robot builds a map of its environment while tracking its own position within that map.
Imitation Learning	Teaching a robot by demonstration. Move a leader arm, the follower watches and learns.
Reinforcement Learning (RL)	Robot learns by trial and error. Tries things, gets rewards/penalties, improves.
VLA	Vision-Language-Action model. Takes visual + language input and outputs motor actions. The architecture behind GR00T.
VLM	Vision-Language Model. Like an LLM but also processes images. Cosmos Reason 2 is a VLM.
Sim-to-Real Transfer	Getting skills learned in simulation to work on real hardware. THE central engineering challenge of Physical AI.
Digital Twin	A photorealistic virtual copy of a real environment. Built in Isaac Sim / Omniverse.
LIDAR	Light Detection and Ranging. A sensor that measures distance by bouncing laser light off surfaces. Creates 3D maps.
IMU	Inertial Measurement Unit. A sensor that measures acceleration and rotation. Used for balance and orientation.
CSI Camera	Camera Serial Interface. Direct ribbon-cable connection between camera sensor and board. Lower latency than USB.
End Effector	The tool at the end of a robotic arm. Could be a gripper, a welding torch, a camera, or a surgical instrument.

NVIDIA-specific

Term	Definition
Isaac	NVIDIA's robotics platform family: Isaac Sim, Isaac Lab, Isaac ROS, Isaac GR00T. Named after Isaac Asimov.
GR00T	Generalist Robot 00 Technology. NVIDIA's foundation model family for humanoid robots. Open-source. Current: N1.7.
Cosmos	NVIDIA's world foundation model family. Enables robots to "imagine" physical world outcomes.
Newton	NVIDIA's open-source GPU physics engine. Built with Google DeepMind and Disney Research. GA April 2026.
Omniverse	NVIDIA's 3D simulation and collaboration platform. Isaac Sim is built on Omniverse.
JetPack	The base SDK for all Jetson hardware. Unified across all Jetson boards.
DeepStream	NVIDIA's SDK for multi-camera video analytics pipelines.
TensorRT	NVIDIA's inference optimizer. Converts PyTorch/TensorFlow models for fast deployment.
Metropolis	NVIDIA's Vision AI platform. Tools and models for video analytics applications.
NemoClaw	NVIDIA's security and governance layer for AI agents (GTC 2026). "Inspects the intent of the AI's logic."
AgenticROS	Open-source project connecting AI agents (Claude Code, Claude Desktop, Gemini, MCP) to ROS2 robots via natural language.
Coral TPU	Google's Edge TPU accelerator. 4 TOPS. Not NVIDIA, but commonly used alongside Jetson in the edge AI ecosystem.

LeCun / AMI Labs

Term	Definition
JEPA	Joint Embedding Predictive Architecture. LeCun's alternative to LLMs and generative models. Predicts in latent space rather than predicting pixels or tokens.
AMI Labs	Advanced Machine Intelligence Labs. Paris-based startup founded by LeCun in late 2025. Raised $1.03B at $3.5B valuation in March 2026.
V-JEPA 2	Video JEPA version 2. 1.2B parameter world model trained on 1M+ hours of video. Zero-shot robot control 65-80%.
V-JEPA 2.1	March 2026 release. Dense, spatially-precise features. All tokens contribute to the self-supervised training loss.
Latent Space	An abstract mathematical representation of data learned by a model. JEPA predicts in latent space rather than surface space.
World Model	An AI system that predicts how the physical world will change in response to actions. Cosmos and JEPA are competing approaches.

Autonomous vehicles

Term	Definition
FSD	Full Self-Driving. Tesla's autonomous driving software. End-to-end neural networks trained on fleet driving data. Version 14.3 (April 2026).
ADAS	Advanced Driver Assistance Systems. Lane keeping, adaptive cruise control, AEB. Mobileye powers ADAS in 800+ vehicle models.
Robotaxi	A fully autonomous vehicle operating as a taxi with no human driver. Waymo operates commercial robotaxis. Tesla Cybercab (no steering wheel) began continuous production April 2026.
Sensor Fusion	Combining data from multiple sensors (cameras, LIDAR, radar, ultrasonic) to build a unified understanding of the environment.

Defense

Term	Definition
CCA	Collaborative Combat Aircraft. U.S. Air Force program to pair manned fighter jets with autonomous drone wingmen. Anduril builds the airframe, Shield AI provides the autonomy.
Replicator	Pentagon program to mass-produce affordable autonomous systems. $1B across two tranches.
ISR	Intelligence, Surveillance, and Reconnaissance. Primary mission for military drones.
UAS / UxS	Unmanned Aerial System / Unmanned Systems. Drones and autonomous vehicles of all types.
Counter-UAS	Technology for detecting and defeating enemy drones. Axon's Dedrone subsidiary.
GPS-Denied	Environments where GPS signals are jammed or unavailable. Overland AI specializes in this.
Lattice	Anduril's autonomous software platform. The "operating system" connecting all of Anduril's hardware and third-party systems on the battlefield. $20B U.S. Army contract (March 2026).
Hivemind	Shield AI's autonomous piloting software. Flies aircraft without human input or GPS.
Barracuda-500M	Anduril's surface-launched cruise missile. Framework agreement with U.S. Department of War for 3,000+ units over three years (May 2026).