Foundation Model-Enabled Scene Understanding, Reasoning, and Decision-Making for Autonomous Driving and ITS

Songyan Zhang, Wenhui Huang, Yuxiao Chen, Peng Hang,
Zhiyu Huang, Jian Sun, Xiaoyu Mo, Chen Lv

IEEE International Conference on Intelligent Transportation Systems (ITSC 2025)

Call for Papers (Code:4315u)

Scope


This year's invited session continues the focus on foundation models, with a particular emphasis on large language models (LLMs) and vision-language models (VLMs) as emerging tools for advancing scene understanding, reasoning, and decision-making in autonomous driving and intelligent transportation systems (ITS). Recent developments in LLMs and VLMs have shown promising abilities in visual-linguistic reasoning, structured decision-making, and interpreting complex, multimodal inputs. These capabilities open new possibilities for improving the robustness, interpretability, and generalization of autonomous systems in diverse traffic environments. These models offer new pathways to incorporate general, human-like knowledge into machine perception and cognition, enabling more capable and context-aware autonomous systems. The session will explore a broad set of topics, including, but not limited to, multimodal perception, knowledge-driven planning, the development of traffic- and driving-specific LLMs/VLMs, integration of these models with reinforcement learning, and knowledge-assisted continual and lifelong learning. In addition, the session will highlight emerging directions such as vision-language-action (VLA) models, hybrid learning frameworks, and the integration of foundation models with traditional ITS methods.

Topics

  1. LLMs/LMs enhanced situational awareness, scene understanding, and reasoning.
  2. Foundation-model enhanced prediction, decision-making and control.
  3. Knowledge-distillation for autonomous driving.
  4. Embodied intelligence in autonomous driving and ITS.
  5. Explainable and interpretable autonomous driving.
  6. Human-in-the-loop AI and human–vehicle interactions.
  7. Driving safety validation and closed-loop evaluation.
  8. System safety and cyber security of foundation model-based systems.
  9. Simulation and real-world deployment of learning-based driving systems.

Potential Papers

Organizer

Songyan Zhang
NTU

MY ALT TEXT

Wenhui Huang
Harvard

MY ALT TEXT

Yuxiao Chen
Nvidia

MY ALT TEXT

Peng Hang
Tongji

MY ALT TEXT

Zhiyu Huang
UCLA

MY ALT TEXT

Jian Sun
Tongji

MY ALT TEXT

Xiaoyu Mo
KTH

MY ALT TEXT

Chen Lv
NTU

MY ALT TEXT
MY ALT TEXT
MY ALT TEXT
MY ALT TEXT
MY ALT TEXT
MY ALT TEXT
MY ALT TEXT