Research
My primary research interests lie in reinforcement learning and generative models, with a specific focus on long video generation and world models.
|
|
|
DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
Xu Guo*, Fulong Ye*, Qichao Sun*, Liyang Chen, Bingchuan Li, Pengze Zhang, Jiawei Liu, Songtao Zhao, Qian He, Xiangwang Hou
ICML 2026
Project Page
/
Paper
/
Code 🔥
|
|
|
DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
Xu Guo*, Fulong Ye*, Xinghui Li*, Pengqi Tu, Pengze Zhang, Qichao Sun, Songtao Zhao, Xiangwang Hou, Qian He
Preprint 2026
Project Page
/
Paper
/
Code 🔥
|
|
|
X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning
Jian Ma*, Xujie Zhu*, Zihao Pan, Qirong Peng, Xu Guo, Chen Chen, Haonan Lu
AAAI 2026
Paper
/
Code
|
|
|
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation
Jian Ma*, Qirong Peng*, Xu Guo, Chen Chen, Haonan Lu, Zhenyu Yang
ICCV 2025
Paper
/
Code
|
|
|
Adaptive AUV Hunting Policy with Covert Communication via Diffusion Models
Xu Guo, Xiangwang Hou, Minrui Xu, Jianrui Chen, Jingjing Wang, Jun Du, Yong Ren
ICC 2025 (best paper award 17/2500(Top 0.68%))
Paper
/
Code
|
|
|
Research Intern, Kling Kuaishou, Beijing, China
2026.03 - present
Collaborators: Xintao Wang, Xinghui Li, etc.
|
|
|
Research Intern, Bytedance Intelligent Creation Lab, Beijing, China
2025.04 - 2026.03
Collaborators: Qian He, Fulong Ye, etc.
|
|
|
Research Intern, OPPO, Shenzhen, China
2024.12 - 2025.04
Collaborators: Jian Ma, Haonan Lu, etc.
|
|
|
Research Intern, Galaxea AI, Beijing, China
2024.04 - 2024.08
Collaborators: Prof. Huazhe Xu, Prof. Hang Zhao, etc.
|
Design and source code from Jon Barron's website.
|
|