Junhao Cheng's Homepage

👋 About me [Updated 25/11/2025]

My name is Junhao Cheng (程钧豪). I received my bachelor's degree from Sun Yat-sen University (SYSU) in , supervised by Prof. Xiaodan Liang (梁小丹). Now I am an MPhil student at Prof. Jing Liao (廖菁)'s lab. Before this, I had the privilege of interning in Prof. Ming-Hsuan Yang's lab and working closely with him.

I am currently a research intern at Kuaishou Kling team. My research interests lie in Interactive AI. Now I focus on foundation models and novel applications for image/video understanding and generation.

I am open to research collaborations, PhD opportunities (27 Fall), and industry/startup roles. If you're interested in discussing potential synergies, I'd be happy to 📧connect.

🔥 News

2025.11: Release Video-as-Answer, extending next-event prediction to video answer.

2025.06: 🎉🎉 One paper is accepted by ICCV 2025.

2025.06: Release Video-Holmes, evaluating MLLMs for complex video reasoning.

2025.04: Release AnimeGamer (300+Stars✨), transforming characters from anime films into interactive entities with an MLLM.

2024.06: Release AutoStudio (400+Stars✨), generating comic book with multi-character, multi-turn consistency.

2024.05: 🎉🎉 One paper is accepted by ACL 2024.

🎓 Educations

MPhil Student @ City University of Hong Kong

2025 - 2027 (expected)

Supervisor: Jing Liao (廖菁)

Undergraduate Student @ Sun Yat-sen University

2021 - 2025

Supervisor: Xiaodan Liang (梁小丹)

💻 Internships

Research Intern @ Kuaishou, Kling Team

2025.07 - now

Mentor: Liang Hou(侯良), Xin Tao (陶鑫)

Research Intern @ Tencent, ARC Lab

2024.09 - 2025.06

Mentor: Yuying Ge (葛玉莹)

Research Intern @ Lenovo, Research Institute

2023.09 - 2024.08

Research Intern @ Chinese Institute for Brain Research

2023.03 - 2023.08

Mentor: Yunzhe Liu (柳昀哲)

📝 Selected Publications

AnimeGamer Poster

AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

Junhao Cheng, Yuying Ge, Yixiao Ge, Jing Liao, Ying Shan

ICCV 2025 Paper Code

Isolate Poster

Object Isolated Attention for Consistent Story Visualization

Xiangyang Luo, Junhao Cheng, Yifan Xie, Xin Zhang, Tao Feng, Zhou Liu, Fei Ma, Fei Yu

ICME 2025 Paper

VisDiaHalBench Poster

VisDiaHalBench: A Visual Dialogue Benchmark For Diagnosing Hallucination in Large Vision-Language Models

Qingxing Cao, Junhao Cheng, Xiaodan Liang, Liang Lin

ACL 2024 Paper Code

DKFormer Poster

Integrating Domain Knowledge into Transformer for Short-Term Wind Power Forecasting

Junhao Cheng, Xing Luo, Zhi Jin

Energy (JCR-Q1) Paper