π¦ About me
My name is Junhao Cheng (η¨ι§θ±ͺ). I am currently an undergraduate student at Sun Yat-sen University (SYSU) supervised by Prof. Xiaodan Liang (ζ’ε°δΈΉ). I am an upcoming MPhil student at Prof. Jing Liao (ε»θ)βs lab. Before this, I had the privilege of interning in Ming-Hsuanβs lab and working closely with him.
I am currently a long-term intern at ARC Lab, Tencent PCG. My research interests lie in interactive and generative AI. Now I focus on designing novel applications for video generation and reasoning and other downstream tasks to make AI serve humans.
I am looking for research collaborations and PhD opportunities. If you think there is anything interesting we can discuss, feel free to email me!
π₯ News
- 2024.10: Β ππ One paper as the first author is accepted by Energy (JCR Q1).
- 2024.06: Β ππ Release AutoStudio (400+Starsβ¨) for comic book generation.
- 2024.05: Β ππ One paper as the second author is accepted by ACL 2024.
- 2024.04: Β ππ Release TheaterGen for benchmarking multi-turn image generation.
π» Internships
- 2024.10 - now, Tencent, ARC Lab, Shenzhen.
- 2023.02 - 2024.10, Lenovo, Research Institute, Shenzhen.
- 2023.08 - 2024.02, Pengcheng Laboratory, Shenzhen.
- 2023.03 - 2023.08, Chinese Institute of Brain Research (CIBR), Liu Lab, Beijing.
![]() |
2021-now Studying as an Undergraduate Student at Sun Yat-sen University Supervisor: Xiaodan Liang (ζ’ε°δΈΉ) |
![]() |
AnimeGamer: Infinite Anime Life Simulation with Next Game State PredictionJunhao Cheng, Yuying Ge, Yixiao Ge, Jing Liao, Ying Shan arXiv 2025 / Paper / Code |
![]() |
Object Isolated Attention for Consistent Story VisualizationXiangyang Luo, Junhao Cheng, Yifan Xie, Xin Zhang, Tao Feng, Zhou Liu, Fei Ma, Fei Yu ICME 2025 (CCF-B) / Paper |
![]() |
BD-Diff: Generative Diffusion Model for Image Deblurring on Unknown Domains with Blur-Decoupled LearningJunhao Cheng, Wei-Ting Chen, Xi Lu, Ming-Hsuan Yang arXiv 2025 / Paper / Code |
![]() |
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image GenerationJunhao Cheng, Xi Lu, Hanhui Li, Khun Loun Zai, Baiqiao Yin, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang arXiv 2024 / Paper / Code |
![]() |
VisDiaHalBench: A Visual Dialogue Benchmark For Diagnosing Hallucination in Large Vision-Language ModelsQingxing Cao, Junhao Cheng, Xiaodan Liang, Liang Lin ACL 2024 (CCF-A) / Paper / Code |
![]() |
TheaterGen: Character Management with LLM for Consistent Multi-turn Image GenerationJunhao Cheng, Baiqiao Yin, Kaixin Cai, Minbin Huang, Hanhui Li, Yuxin He, Xi Lu, Yue Li, Yifei Li, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang arXiv 2024 / Paper / Code |
![]() |
Integrating Domain Knowledge into Transformer for Short-Term Wind Power ForecastingJunhao Cheng, Xing Luo, Zhi Jin Energy (JCR-Q1) / Paper |