Rui Liu

University of Maryland, College Park
Ph.D. in Computer Science, working with Prof. Pratap Tokekar and Prof. Ming Lin

August 2022 - Now

University of Maryland, College Park
Master in Computer Science

May 2024

Shanghai Jiao Tong University
B.S. in Mechanical Engineering

June 2020

Tencent AI Lab
Research Intern. Working on Multimodal LLM reasoning, post-training with RL.

May 2025 - Nov 2025
Bellevue, WA

Apple
PhD Intern. Worked on ML algorithms development and data analysis.

May 2023 - Aug 2023
Cupertino, CA

University of Maryland, College Park
Graduate Research Assistant. Working on AI, ML and Robotics.

Aug 2022 - Now
College Park, MD

Tencent Robotics X
Research Intern. Worked on quadruped robotics algorithms development and gait planning.

Jun 2020 - Nov 2020
Shenzhen, China

The Chinese Unversity of Hong Kong
Research Intern. Worked on surgical robotics motion planning.

Jul 2019 - Sep 2019
Hong Kong, China

Shanghai Jiao Tong University
Research Assistant. Worked on electric vehicle heat pump systems.

Mar 2019 - May 2020
Shanghai, China

Dual-Uncertainty Guided Policy Learning for Multimodal Reasoning

R. Liu, D. Yu, T. Zheng, R. Dai, Z. Li, W. Yu, Z. Liang, L. Song, H. Mi, P. Tokekar, D. Yu

A method that guides policy learning using measured dual-uncertainty (output and perceptual) as feedback signals to encourage exploration and enhance reasoning in multimodal LLMs.

Preprint, 2026

Stable and Efficient Single-Rollout RL for Multimodal Reasoning

R. Liu, D. Yu, L. Ke, H. Liu, Y. Zhou, Z. Liang, H. Mi, P. Tokekar, D. Yu

A group-free RLVR approach that achieves both stable optimization and effective multimodal reasoning performance.

CVPR, 2026

Active Asymmetric Multi-Agent Multimodal Learning under Uncertainty

R. Liu, P. Tokekar, M. Lin

We propose Active Asymmetric Multi-Agent Multimodal Learning under Uncertainty (A2MAML), a principled approach for uncertainty-aware, modality-level collaboration.

Preprint, 2026

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

T. Zheng, C. Huang, R. Dai, Y. He, R. Liu, X. Ni, H. Bao, K. Wang, H. Zhu, J. Huang, F. Huang, H. Huang

A training-free controller for efficient parallel thinking by using consensus-based early stopping and deviation-based branch pruning to reduce computational costs while maintaining accuracy.

Preprint, 2026

Save the Good Prefix: Precise Error Penalization via Process-Supervised RL to Enhance LLM Reasoning

H. Liu, D. Yu, S. Lu, Y. Zhou, R. Liu, Z. Liang, H. Mi, C. Wei, D. Yu

An LLM training framework that improves reasoning by using Process Reward Models (PRMs) to identify the first error in a chain-of-thought, penalizing only the subsequent incorrect steps while rewarding the preceding good prefix.

Preprint, 2026

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Z. Li, W. Yu, C. Huang, R. Liu, Z. Liang, F. Liu, J. Chen, D. Yu, J. Boyd-Graber, H. Mi, D. Yu

Vision-SR1 uses RL to enhance reasoning in vision-language models by decomposing the process into visual perception and language reasoning stages, improving accuracy and reducing hallucinations.

ICLR, 2026

Adaptive Conformal Guidance for Learning under Uncertainty

R. Liu, P. Gao, Y. Shen, M. Lin, P. Tokekar

A broadly applicable framework that dynamically modulates guidance signals based on associated uncertainty, providing a simple yet effective solution for incorporating uncertainty-aware guidance across diverse machine learning systems.

ICLR, 2026

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

T. Zheng, H. Zhang, W. Yu, X. Wang, R. Dai, R. Liu, H. Bao, C. Huang, H. Huang, D. Yu

A RL framework that enhances LLMs reasoning capabilities by enabling parallel thinking through a progressive curriculum.

ICLR, 2026

CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models

R. Dai, L. Song, H. Liu, Z. Liang, D. Yu, H. Mi, Z. Tu, R. Liu, T. Zheng, H. Zhu, D. Yu

CDE enhances Reinforcement Learning with Verifiable Rewards (RLVR) by using intrinsic curiosity signals from the actor and critic to improve exploration and reduce premature convergence in LLMs.

ICLR, 2026

CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems

R. Liu, Y. Shen, P. Gao, P. Tokekar, M. Lin

A multi-modal multi-agent framework that enables agents to collaborate and share multi-modal data during training while allowing inference with reduced modalities during testing, which is especially beneficial for deployment in resource-constrained environments.

NeurIPS, 2025

MMCD: Multi-Modal Collaborative Decision-Making for Connected Autonomy with Knowledge Distillation

R. Liu, P. Gao, Y. Shen, P. Tokekar, M. Lin

A multi-modal collaborative decision-making approach for connected autonomy.

IROS, 2025

IMRL: Integrating Visual, Physical, Temporal, and Geometric Representations for Enhanced Food Acquisition

R. Liu, Z. Mahammad, A. Bhaskar, P. Tokekar

A representation learning approach for robust imitation learning in robotic manipulation for food acquisition.

ICRA, 2025

Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis

R. Liu, A. Gupta, E. Noorani, P. Tokekar

A thorough iteration complexity analysis for the risk-sensitive policy gradient method.

Preprint, 2025

LAVA: Long-horizon Visual Action based Food Acquisition

A. Bhaskar, R. Liu, V. Sharma, G. Shi, P. Tokekar

A long-horizon visual action learning approach for robotic manipulation in food acquisition of liquid, semisolid, and deformable foods.

IROS, 2024

Data-Driven Distributionally Robust Optimal Control with State-Dependent Noise

R. Liu, G. Shi, P. Tokekar

A data-driven technique for estimating the uncertainty distribution and corresponding KL divergence bound for distributionally robust optimal control (DROC).

IROS, 2023

Adaptive Visual Imitation Learning for Robotic Assisted Feeding Across Varied Bowl Configurations and Food Types

R. Liu, A. Bhaskar, P. Tokekar

An adaptive visual imitation learning approach for robotic scooping tasks in assistive feeding.

ICRA, 2024

Rui Liu

Ph.D. student

Education

Employment

Publications