I’m currently a junior undergraduate student (from 2023 Fall) in IIIS (Yao Class), Tsinghua University, pursuing a Bachelor’s degree in Computer Science and Technology.
My research goal is to develop fundamental models with intrinsic understandings of the world and apply these to obtain general decision intelligence. Currently, my research interests include:
World Models: Visual World Models, Object-Centric World Models, Grounding Foundation Models(e.g. Video Diffusion Models, LLMs) to World Models.
Generalist Robot Policies: VLAs, Humanoid Foundation Models, Video-Centric Generalist Policies.
Recently, I am super interested in understanding theoretical foundations of machine learning and robotics, especially for generative modeling, sequence prediction, and robot learning.
News
[Nov. 2025] 🏆 Honored to receive Comprehensive Excellence Award of Tsinghua.
[Aug. 2025] 🐋 SURGE is accepted by EMNLP 2025 Main, with a top 0.3% meta score!
[May. 2025] 🔥 I became a member of the Sparking Program, the most prestigious and selective academic organization for students at Tsinghua University (top 1%).
[May. 2025] 📈 TrajWorld is accepted by ICML, 2025.
[Nov. 2024] 🏆 Honored to receive Comprehensive Excellence Award of Tsinghua.
[Nov. 2024] 🏆 Glad to receive Outstanding Sports Scholarship of Tsinghua.
Education
B.S. in Computer Science, Tsinghua University, 2023-2027 (expected). Institute for Interdisciplinary Information Sciences (Yao Class), Tsinghua University. GPA: 3.93/4.00, Rank: 9/91. Selected Courses:Natural Language Processing (A+), Algebra and Computation (A+, Top 1), Fundamentals of Programming (A+), Multi-modal Machine Learning (A), Deep Learning (A), Computer Vision (A), Introduction to Computer Systems (A).More Selected Courses:Basic Principles of Marxism (A+), The History of Western Music (A+), Discrete Mathematics II (A), Fundamentals of Computer Science (A), Advanced Topics in Linear Algebra (A), Calculus-A II (A), Physics I (A).
A curated list of awesome works in world modeling, aiming to serve as a one-stop resource for everyone interested in world modeling. [🧵 Twitter Thread]
1. Designed and implemented a PostgreSQL-based course sharing platform using Scala for backend and React for frontend 2. Utilized Stable Diffusion 2 and Llama 2 API to enhance users experiences
A 2D Stickman vs CAD-themed game, developed using Unity. In this game, players, taking form as stick figures, explore a world within a CAD software through movement, skills, and various interactions.
We propose “Watch-and-Learn”, a multimodal framework that efficiently enhances MLLMs' reasoning abilities in counting tasks by integrating function calls.