πŸ‘¦ About me

My name is Junhao Cheng (程钧θ±ͺ). I am currently an undergraduate student at Sun Yat-sen University (SYSU) supervised by Prof. Xiaodan Liang (撁小丹). I am an upcoming MPhil student at Prof. Jing Liao (廖菁)’s lab. Before this, I had the privilege of interning in Ming-Hsuan’s lab and working closely with him.

I am currently a long-term intern at ARC Lab, Tencent PCG. My research interests lie in interactive and generative AI. Now I focus on designing novel applications for video generation and reasoning and other downstream tasks to make AI serve humans.

I am looking for research collaborations and PhD opportunities. If you think there is anything interesting we can discuss, feel free to email me!

πŸ”₯ News

  • 2024.10: Β πŸŽ‰πŸŽ‰ One paper as the first author is accepted by Energy (JCR Q1).
  • 2024.06: Β πŸŽ‰πŸŽ‰ Release AutoStudio (400+Stars✨) for comic book generation.
  • 2024.05: Β πŸŽ‰πŸŽ‰ One paper as the second author is accepted by ACL 2024.
  • 2024.04: Β πŸŽ‰πŸŽ‰ Release TheaterGen for benchmarking multi-turn image generation.

πŸ’» Internships

πŸŽ“ Educations

clean-usnob

2021-now

Studying as an Undergraduate Student at Sun Yat-sen University

Supervisor: Xiaodan Liang (撁小丹)


πŸ“ Publications

clean-usnob

AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction


Junhao Cheng, Yuying Ge, Yixiao Ge, Jing Liao, Ying Shan

arXiv 2025 / Paper / Code GitHub stars
clean-usnob

Object Isolated Attention for Consistent Story Visualization


Xiangyang Luo, Junhao Cheng, Yifan Xie, Xin Zhang, Tao Feng, Zhou Liu, Fei Ma, Fei Yu

ICME 2025 (CCF-B) / Paper
clean-usnob

BD-Diff: Generative Diffusion Model for Image Deblurring on Unknown Domains with Blur-Decoupled Learning


Junhao Cheng, Wei-Ting Chen, Xi Lu, Ming-Hsuan Yang

arXiv 2025 / Paper / Code GitHub stars
clean-usnob

AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation


Junhao Cheng, Xi Lu, Hanhui Li, Khun Loun Zai, Baiqiao Yin, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang

arXiv 2024 / Paper / Code GitHub stars
clean-usnob

VisDiaHalBench: A Visual Dialogue Benchmark For Diagnosing Hallucination in Large Vision-Language Models


Qingxing Cao, Junhao Cheng, Xiaodan Liang, Liang Lin

ACL 2024 (CCF-A) / Paper / Code GitHub stars
clean-usnob

TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation


Junhao Cheng, Baiqiao Yin, Kaixin Cai, Minbin Huang, Hanhui Li, Yuxin He, Xi Lu, Yue Li, Yifei Li, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang

arXiv 2024 / Paper / Code GitHub stars
clean-usnob

Integrating Domain Knowledge into Transformer for Short-Term Wind Power Forecasting


Junhao Cheng, Xing Luo, Zhi Jin

Energy (JCR-Q1) / Paper