About Me

Ciao! I am Wenhui (Oscar) Huang, a Postdoctoral Fellow in Harvard Computational Robotics Group and Harvard Embodied Minds Lab, co-advised by Prof. Heng Yang and Prof. Yilun Du. I was a PhD and Research Associate in Automated Driving and Human-Machine System Lab at Nanyang Technological University (NTU), advised by Prof. Chen Lv, and a Visiting PhD Scholar in Harvard Computational Robotics Group. Previously, I had the pleasure of collaborating with Prof. Francesco Braghin during my time at the i.Drive Lab.

Research Interests

Research Field Research Application
Reinforcement Learning Intelligent Tranporatation Systems
Continual Learning Robotics
VLM and VLA Autonomous Vehicles
Human-in-the-Loop AI Visual Navigation

🔥 News

  • Feb. 2026:  🏅🏅 I am honored to be recognized as a featured Postdoctoral Researcher by MAE department, NTU!
  • Dec. 2025:  ☃️☃️ Our work on enhancing open-ended reasoning of AD-tailored VLM through reinforced fine-tuning (RFT) is now available on arXiv! OpenREAD
  • Oct. 2025:  🍁🍁 Our work on Mixture-of-Transformers-based VLA with unified Fast-Slow reasoning is now available on arXiv! MoTVLA
  • Aug. 2025:  🏅🏅 I am awarded as a Winner of Asia-Pacific Region in Future Leaders Program of ITS World Congress 2025, and will give an invited talk in Atlanta, see you there!
  • May. 2025:  🏅🏅 Our team secured the Champion in 𝗜𝗧𝗦𝗦 𝘅 𝗟𝗧𝗔 𝗛𝗮𝗰𝗸𝗮𝘁𝗵𝗼𝗻 2025!
  • Apr. 2025:  🏅🏅 I am honored to be recognized as a featured PhD candidate in ITS field by IEEE Intelligent Transportation Systems Society (ITSS)!
  • Mar. 2025:  🌱🌱 Our paper on deep imitative reinforcement learning-based decision-making for autonomous driving has been accepted by Transportation Research Part C!
  • Jan. 2025:  ☃️☃️ Our paper on integrated hybrid prediction and planning for autonomous driving has been accepted by TPAMI!
  • Dec. 2024:  🎄🎄 Our work on VLM-based closed-loop autonomous driving is now available on arXiv! WiseAD
  • Jul. 2024:  🏖️🏖️ Our ITSC 2025 invited session on Learning-empowered Intelligent Transportation Systems: Foundation Vehicles and Coordination Technique received 11 paper submissions, and 10 papers were accepted. Congratulations to all the authors! Looking forward to seeing you in Edmonton, Canada!
  • Jun. 2024:  🏖️🏖️ Our paper on safety human-in-the-loop RL for autonomous driving has been accepted by T-ITS!
  • Jun. 2024:  🏆🏆 Our team secured first place in the Waymo Open Dataset Occupancy Flow Challenge and second place in the Sim Agents Challenge! Check out our technical reports on the Waymo challenge website and CVPR 2024 Workshop on Autonomous Driving.
  • May. 2024:  🌱🌱 Visited Gran Turismo Sophy team at Sony AI, Japan, and gave a talk on RL for autonomous racing!
  • Apr. 2024:  🌱🌱 Our paper on robust RL for autonomous driving has been accepted by Transportation Research Part C: Emerging Technologies!
  • Jan. 2024:  ☃️☃️ Our paper on uncertainty-aware RL with human guidance for autonomous driving has been accepted by ICRA! See you in Yokohama, Japan!
  • Nov. 2023:  ☃️☃️ Our paper on robust RL for decision-making has been accepted by Engineering!
  • Oct. 2023:  🍁🍁 Our paper on learning from demonstration for RL-based eco-driving has been accepted by Energy!
  • Sep. 2023:  🍁🍁 Our paper on sampling efficient preference-guided DQN (PGDQN) algorithm has been accepted by TNNLS!
  • Sep. 2023:  🍁🍁 Our paper on goal-oriented transformer-based autonomous navigation has been accepted by T-ITS!
  • Sep. 2023:  🏅🏅 We won the best paper runner-up award in ITSC 2023!
  • Dec. 2022:  🥈🥈 Our team secured second place at Intelligent Algorithm Final of Alibaba Global Future Vehicle Challenge!

📝 Publications

Highlights

arXiv'25
动态演示>

MoTVLA: A Vision-Language-Action Model with Unified Fast-Slow Reasoning

Wenhui Huang, Changhe Chen, Han Qi, Chen Lv, Yilun Du, Heng Yang

Project

  • We propose a Mixture-of-Transformers–based vision–language–action (VLA) model, termed MoTVLA, featuring unified fast–slow reasoning to enable language-steerable action diffusion with motion decomposition. This design enhances both semantic understanding (general & robotics reasoning) and manipulation performance (Simulation & Real-World) in long-horizon tasks.

The source code and model checkpoints will be released by March 2026.

arXiv'25
sym

OpenREAD: Reinforced Open-Ended Reasoning for End-to-End Autonomous Driving with LLM-as-Critic

Songyan Zhang1, Wenhui Huang1, Zhen Chen, Collister Chua, Qihang Huang, Chen Lv
1Equal Contributions

Project

  • We enhance open-ended reasoning through reinforcement fine-tuning (RFT) by introducing an LLM-as-Critic mechanism, and achieve joint training with trajectory planning for VLM-based autonomous driving.
arXiv'24
sym

WiseAD: Knowledge Augmented End-to-End Autonomous Driving with Vision-Language Model

Songyan Zhang1, Wenhui Huang1, Zihui Gao, Hao Chen, Chen Lv
1Equal Contributions

Project

  • we investigate the effects of the depth and breadth of fundamental driving knowledge on closed-loop driving performance and introduce WiseAD, a specialized VLM tailored for end-to-end autonomous driving capable of driving reasoning, action justification, object recognition, risk analysis, driving suggestions, and trajectory planning across diverse scenarios.
T-ITS'24
动态演示>

Safety-aware human-in-the-loop reinforcement learning with shared control for autonomous driving

Wenhui Huang, Haochen Liu, Zhiyu Huang, Chen Lv

Project

  • we propose a safe human-in-the-loop RL (SafeHiL-RL), combined with curriculum guidance, to enable safe and efficient decision-making policies for autonomous vehicles.
ICRA'24
sym

Uncertainty-aware Reinforcement Learning for Autonomous Driving with Multimodal Digital Driver Guidance

Wenhui Huang, Zitong Shan, Shanhe Lou, Chen Lv

Video: Youtube | BiliBili

Project

  • We propose an uncertainty-aware RL (UnaRL) framework, incorporating multimodal human guidance, to achieve robust decision-making policies for autonomous vehicles.
T-ITS'23
动态演示>

Goal-guided Transformer-enabled Reinforcement Learning for Efficient Autonomous Navigation

Wenhui Huang, Yanxin Zhou, Xiangkun He, Chen Lv

Sim to Real Video: Youtube | BiliBili

Project

  • We propose a goal-oriented transformer-based RL algorithm to realize efficient and adaptative visual navigation.
TNNLS'23
动态演示>

Sampling Efficient Deep Reinforcement Learning Through Preference-Guided Stochastic Exploration

Wenhui Huang, Cong Zhang, Jingda Wu, Xiangkun He, Jie Zhang, Chen Lv

Project

  • We propose a sampling efficient stochastic exploration policy, preference-guided DQN (PGDQN), to enhance the performance of reinforcement learning algorithms.

Selected Publications

🎖 Honors and Awards

📚 Academic Services

Program Committee

Journal Reviewer

  • IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
  • IEEE Transactions on Intelligent Transportation Systems (T-ITS)
  • IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
  • IEEE Transactions on Vehicular Technology (TVT)
  • IEEE Transactions on Intelligent Vehicles (TIV)
  • IEEE Transactions on Automation Science and Engineering (T-ASE)
  • IEEE Robotics and Automation Letters (RA-L)
  • The International Journal of Robotics Research (IJRR)
  • Transportation Research Part C: Emerging Technologies (Part C)
  • Engineering

Conference Reviewer

  • Forty-Third International Conference on Machine Learning (ICML) 2026
  • IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2025
  • IEEE International Conference on Robotics and Automation (ICRA) 2023 – 2026
  • IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2023 - 2025
  • IEEE Intelligent Vehicles Symposium (IV) 2023 – 2026
  • IEEE Intelligent Transportation Systems Conference (ITSC) 2023 – 2026

💬 Invited Talk

  • Sep. 2025, Toward Country-scale Traffic Foundation Models: A Safe Vision-Language-Transportation System
    ITS World Congress 2025, Atlanta, USA.
  • May. 2025, Collective Continual Evolution of Decision-Making Systems with Policy-Language Alignment
    MIT CLEAR Lab , Boston, USA
    UCLA Mobility Lab, LA, USA.
  • Mar. 2025, Reinforced Continual Evolution of Decision-Making Systems
    School of Vehicle and Mobility, Tsinghua University, Beijing, China.
    Traffic Operations and Simulations Lab, Tongji University, Shanghai, China.
  • Feb. 2025, Reinforced Continual Evolution of Decision-Making Systems
    Engineering Research Center for Intelligent Electrified Mobility, SouthEast University, Nanjing, China.
    Prof. Chunhua Shen’s Lab, Zhejiang University, Hangzhou, China.
  • Nov. 2024, Reinforced Continual Evolution of Decision-Making Systems, Harvard Computational Robotics Group, Harvard University, Cambridge, USA.
  • Sep. 2024, Reinforcement Learning-empowered Decision Making for Autonomous Driving under Knowledge Guidance, Michigan Traffic Lab, University of Michigan, Michigan, USA.
  • May. 2024, Uncertainty-aware Reinforcement Learning for Autonomous Driving and Autonomous Racing, Gran Turismo Sophy team, Sony AI, Tokyo, Japan.

👀 Visits: Loading...