Physical AI AV Reasoning Challenge banner
2026 · NVIDIA Physical AI · Alpamayo Research

Physical AI AV Reasoning Challenge

A long-tail benchmark for chain-of-causation reasoning over rare, ambiguous, and safety-critical driving scenarios — pedestrian-dense interactions, work zones, complex intersections, emergency scenes, and more.

01 Overview

Autonomous driving stacks behave well on the head of the distribution, but the long tail — rare, ambiguous, or interactive scenarios — is where most safety-critical failures occur. This challenge invites the research community to build models that can reason about these long-tail scenarios in natural language, building on NVIDIA's public PAI-AV Dataset.

Models will be evaluated on a curated out-of-distribution test set mined from a large physical-AI autonomous-driving corpus. Each scenario is anchored at a precise keyframe and annotated with a chain-of-causation describing the relevant agents, interactions, and the appropriate ego behavior.

02 Challenge Tracks

The 2026 edition has two tracks: a chain-of-causation reasoning-generation track, and an open auto-labeling leaderboard for the research community.

Track 1 · Reasoning

Chain-of-Causation Generation

Input: a multi-camera driving clip and an event window.
Output: a free-form natural-language explanation that identifies the relevant agents, the interactions that make the scenario challenging, and the recommended ego behavior at the keyframe.

Submission format, evaluation protocol, and scoring details will be released on June 15, 2026.

Track 2 · Auto-Labeling

Reasoning Auto-Labeling

Input: the validation clips of the out-of-distribution reasoning set.
Output: automatically generated chain-of-causation reasoning labels for each clip.

Submission format and details released on June 15, 2026.

03 Timeline

  1. 2026-06-04 Challenge announcement (CVPR 2026) Public preview at the CVPR / Computex venue.
  2. 2026-06-15 Evaluation server opens Submission portal goes live with full evaluation methodology, tracks, data, and submission details.
  3. 2026-10-31 Submission deadline Final submissions accepted; the public leaderboard closes.
  4. 2026-11-15 Final results released Final results and award decisions released to participants.
  5. Early December 2026 Winners announced Top entries highlighted and final analysis shared.

Dates are tentative and subject to update.

04 Prize

The winner of Track 1 will be awarded an NVIDIA DGX Spark. Track 2 runs as a leaderboard-only benchmark for the community.

DGX

NVIDIA DGX Spark

Awarded to the Track 1 winner. Personal AI supercomputer for desktop development and inference.

05 Organizers

This competition is hosted by NVIDIA's Autonomous Vehicle Research Group.

Host

NVIDIA Autonomous Vehicle Research Group

Interdisciplinary NVIDIA Research team advancing vehicle autonomy across perception, prediction, planning, control, simulation, foundation models, and AI safety.

Bookmark this page — full challenge details, evaluation methodology, and submission instructions will be published when the evaluation server opens.