Siqiao Huang | Tsinghua University

About Me

I’m currently a junior undergraduate student (from 2023 Fall) in IIIS (Yao Class), Tsinghua University, pursuing a Bachelor’s degree in Computer Science and Technology.

In the summer of 2024, I joined THUML under the supervision of Prof. Mingsheng Long.

I welcome any collaboration or discussion, whether with seniors or peers. Please feel free to reach out!

Some picture options: ( I'll try to keep this up to date)

Inspired by Pieter Abbeel's homepage. Photos are taken within the past year.

Giving a talk on my recent work (first from the right) 🗣️

Eating 😋

Hanging out with friends (second from left) 🤣

Cuddling my dog at home 🐕

Research Interests

My research goal is to develop Fundamental models with intrinsic understandings of the world and apply these to obtain general decision intelligence. Currently, my research interests include:

World Models: State-based World Models, Visual World Models, Grounding Foundation Models(e.g. Video Diffusion Models, LLMs) to World Models.
Data-driven reinforcement learning & decision making: Model-Based Reinforcement Learning, Offline Reinforcement Learning, Imitation Learning.

News

[May. 2025] I became a member of the Sparking Program, the most prestigious and selective academic organization for students at Tsinghua University (top 1% in university).
[May. 2025] 📈TrajWorld is accepted by ICML, 2025.
[Jan. 2025] My personal blog website is officially online!
[Nov. 2024] Honored to receive Comprehensive Excellence Award of Tsinghua.
[Nov. 2024] Glad to receive Outstanding Sports Scholarship of Tsinghua.

Education

B.S. in Computer Science, Tsinghua University, 2023-2027 (expected).
Institute for Interdisciplinary Information Sciences (Yao Class), Tsinghua University.
GPA: 3.93/4.00, Rank: 12/93.
Selected Courses: Natural Language Processing (A+), Algebra and Computation (A+, Top 1), Fundamentals of Programming (A+), Multi-modal Machine Learning (A), Deep Learning (A), Computer Vision (A), Introduction to Computer Systems (A).
More Selected Courses:
Basic Principles of Marxism (A+), The History of Western Music (A+), Discrete Mathematics II (A), Fundamentals of Computer Science (A), Advanced Topics in Linear Algebra (A), Calculus-A II (A), Physics I (A).

Publications

arXiv

Vid2World: Crafting Video Diffusion Models to Interactive World Models

Siqiao Huang*, Jialong Wu*, Qixing Zhou, Shangchen Miao, Mingsheng Long

arXiv preprint, 2025.

arXiv PDF Project Page BibTex Huggingface

arXiv

SURGE: On the Potential of Large Language Models as General-Purpose Surrogate Code Executors

Bohan Lyu*, Siqiao Huang*, Zichen Liang*, Qian Sun,
Jiaming Zhang

arXiv preprint, 2025.

arXiv PDF BibTex Huggingface

ICML

Trajectory World Models for Heterogeneous Environments

Shaofeng Yin*, Jialong Wu*, Siqiao Huang, Xingjian Su, Xu He, Jianye Hao, Mingsheng Long#

International Conference on Machine Learning (ICML), 2025.

arXiv PDF BibTex

* Equal Contribution, # Corresponding Author

Projects

A Survey on K-means Clustering Algorithms: Theoretical Analysis and Performance Comparison

Elucidated the computational complexity and convergence properties of K-means clustering algorithms and its variants.

PDF Project Page

DreamFactory : Grounding Language Models to World Models

We investigated the feasibility of utilizing language models as text-based world models. Through empirical study, we found that the performance is greatly hindered by overlengthy CoTs, and we proposed DreamFactory, a novel architecture to address this issue.

PDF Code

MANIGEN: generative simulation pipeline with maniskill2

We introduce ManiGen, a generative simulation pipeline using ManiSkill to automate task creation. It utilizes the power of LLMs to propose tasks, generate scenes, and produce task-specific code for rewards, parameters, and metrics.

PDF Code Project Page

Course Sharing Platform

1. Designed and implemented a PostgreSQL-based course sharing platform using Scala for backend and React for frontend 2. Utilized Stable Diffusion 2 and Llama 2 API to enhance users experiences

Language: Scala, HTML, CSS, JavaScript

Code Video

CAD Escape Game

A 2D Stickman vs CAD-themed game, developed using Unity. In this game, players, taking form as stick figures, explore a world within a CAD software through movement, skills, and various interactions.

Language: C#, Unity

PDF Code THU Software Design Contest 2nd Prize(2024)

Watch And Learn: Empowering MLLMs to Count Like Humans

We propose “Watch-and-Learn”, a multimodal framework that efficiently enhances MLLMs' reasoning abilities in counting tasks by integrating function calls.

PDF

Honors & Awards

[2025] Spark Scientific and Technological Innovation Fellowship (top 1%, 30 out of 3000)
[2024] Outstanding Sports Scholarship of Tsinghua University
[2024] Comprehensive Excellence Award of Tsinghua University

Professional Services

Teaching Assistant

Introduction to Artificial Intelligence, Spring 2025. Instructor: Prof. Mingsheng Long.

Reviewer

Language

TOEFL: Total Score 117/120 (On First Trial, Speaking 30/30).
CET-4: 688/710, CET-6: 685/710.

Miscs

I love playing the piano and singing songs. I’m a member of Tsinghua University Chorus. I can also play the chinese flute quite well.
I’m also really into sports, especially basketball and running. I’m a member of the IIIS basketball team and a huge fan of the Golden State Warriors.
I love animals (especially dogs). My parents and I raised three lovely Poodles.
In high school, I was quite into Physics & Chemistry, and participated in Olympiad in Physics and Olympiad in Chemistry.
I am also a tech blogger, here’s a link to my blog website.