Xu Guo

Hi! I'm currently a second-year Master student at Tsinghua University. Previously, I received my B.Eng. degree in Electronic Information Engineering from Tianjin University.

Email  /  Google Scholar  /  Github  / 

profile photo

Research

My primary research interests lie in reinforcement learning and generative models, with a specific focus on long video generation and world models.

DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
Xu Guo*, Fulong Ye*, Qichao Sun*, Liyang Chen, Bingchuan Li, Pengze Zhang, Jiawei Liu, Songtao Zhao, Qian He, Xiangwang Hou
ICML 2026
Project Page  /  Paper  /  Code 🔥  GitHub stars
DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
Xu Guo*, Fulong Ye*, Xinghui Li*, Pengqi Tu, Pengze Zhang, Qichao Sun, Songtao Zhao, Xiangwang Hou, Qian He
Preprint 2026
Project Page  /  Paper  /  Code 🔥  GitHub stars
X2Edit
X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning
Jian Ma*, Xujie Zhu*, Zihao Pan, Qirong Peng, Xu Guo, Chen Chen, Haonan Lu
AAAI 2026
Paper  /  Code  GitHub stars
X2I
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation
Jian Ma*, Qirong Peng*, Xu Guo, Chen Chen, Haonan Lu, Zhenyu Yang
ICCV 2025
Paper  /  Code  GitHub stars
AMADP
Adaptive AUV Hunting Policy with Covert Communication via Diffusion Models
Xu Guo, Xiangwang Hou, Minrui Xu, Jianrui Chen, Jingjing Wang, Jun Du, Yong Ren
ICC 2025 (best paper award 17/2500(Top 0.68%))
Paper  /  Code

Experience

kling
Research Intern, Kling Kuaishou, Beijing, China
2026.03 - present
Collaborators: Xintao Wang, Xinghui Li, etc.
Bytedance
Research Intern, Bytedance Intelligent Creation Lab, Beijing, China
2025.04 - 2026.03
Collaborators: Qian He, Fulong Ye, etc.
OPPO
Research Intern, OPPO, Shenzhen, China
2024.12 - 2025.04
Collaborators: Jian Ma, Haonan Lu, etc.
Galaxea AI
Research Intern, Galaxea AI, Beijing, China
2024.04 - 2024.08
Collaborators: Prof. Huazhe Xu, Prof. Hang Zhao, etc.

Design and source code from Jon Barron's website.