Google Scholar  |   Github
Email: menghanxyz@hust.edu.cn
Email: menghanxyz@gmail.com

Menghan Xia

Associate Professor
Huazhong University of Science and Technology

I am now an associate professor in School of Software Engineering, Huazhong University of Science and Technology (HUST). Previously, I worked as a Senior Researcher at Kling Team of Kuaishou Technology (2024~2025), and Tencent AI Lab (2021~2024), leading an effort for AIGC research and technical application to products. I received my Ph.D. in Computer Science and Engineering from The Chinese University of Hong Kong (CUHK) in 2021, supervised by Prof. Tien-Tsin Wong. Before that, I obtained a B.Eng. degree in Photogrammetry and Remote Sensing in 2014 and M.Eng. degree in Pattern Recognition and Intelligent System in 2017, both from Wuhan University, under the supervision of Prof. Jian Yao. During my doctoral studies, I collaborated with Adobe Research for a year, and completed a research internship at Microsoft Research Asia (MSRA).

My research interest lies in Computer Vision and Deep Learning, especially image/video processing & generation. Currently I focus on visual generative foundation models and its multimodal controllability for AIGC applications.


🔔Now Looking for !!!

  • 2026 incoming Mphil (quota available): image/video generative foundation model, multimodal conditional generation, AIGC applications for 4D modeling, digital human, etc.
  • Undergraduate who are passionate in research, hardworking, and self-motivated for high-quality research, are also welcome to contact.

  • Publications

    Selected
       •   

    ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

    Jianhong Bai, Menghan Xia, Xiao Fu, Xintao Wang, Lianrui Mu, Jinwen Cao, Zuozhu Liu, Haoji Hu, Xiang Bai, Pengfei Wan, Di Zhang
    IEEE International Conference on Computer Vision (ICCV), 2025
    Webpage   Code  

    PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution

    Shian Du, Menghan Xia, Chang Liu, Xintao Wang, Jing Wang, Pengfei Wan, Di Zhang, Xiangyang Ji
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025

    SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

    Jianhong Bai, Menghan Xia, Xintao Wang, Ziyang Yuan, Xiao Fu, Zuozhu Liu, Haoji Hu, Pengfei Wan, Di Zhang
    International Conference on Learning Representations (ICLR), 2025
    Webpage   Code  

    ToonCrafter: Generative Cartoon Interpolation

    Jinbo Xing, Hanyuan Liu, Menghan Xia, Yong Zhang, Xintao Wang, Ying Shan, Tien-Tsin Wong
    SIGGRAPH Asia (special issue of ACM Transactions on Graphics), 2024
    Webpage  •   Code  •   Demo

    StyleCrafter: Taming Artistic Video Diffusion with Reference-Augmented Adapter Learning

    GongyeLiu, Menghan Xia, Yong Zhang, Haoxin Chen, Jinbo Xing, Xintao Wang, Ying Shan, Yujiu Yang
    SIGGRAPH Asia (special issue of ACM Transactions on Graphics), 2024
    Webpage  •   Code  •   Demo

    DynamiCrafter: Animating Open-Domain Images with Video Diffusion Priors

    Jinbo Xing, Menghan Xia, Yong Zhang, Haoxin Chen, Xintao Wang, Ying Shan, Tien-Tsin Wong
    European Conference on Computer Vision (ECCV), 2024
    Webpage  •   Code  •   Demo

    FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling

    Haonan Qiu, Menghan Xia, Yong Zhang, Yingqing He, Xintao Wang, Ying Shan, Ziwei Liu
    International Conference on Learning Representations (ICLR), 2024
    Webpage  •   Code  •   Demo

    Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance

    Jinbo Xing, Menghan Xia, Yuxin Liu, Yuechen Zhang, Yong Zhang, Yingqing He, Hanyuan Liu, Haoxin Chen, Xiaodong Cun, Xintao Wang, Ying Shan, Tien-Tsin Wong
    IEEE Transactions on Visualization and Computer Graphics (TVCG), 2024 Early access
    Webpage  •   Code

    VideoCrafter1: Open Diffusion Models for High-Quality Video Generation

    Haoxin Chen*, Menghan Xia*, Yingqing He*, Yong Zhang*, Xiaodong Cun*, Shaoshu Yang, Jinbo Xing, Yaofang Liu, Qifeng Chen, Xintao Wang, Chao Weng, Ying Shan
    preprint arXiv:2310.19512, 2023
    Webpage  •   Code  •   Discord

    Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation

    Yingqing He*, Menghan Xia*, Haoxin Chen*, Xiaodong Cun, Yuan Gong, Jinbo Xing, Yong Zhang, Xintao Wang, Chao Weng, Ying Shan, Qifeng Chen
    European Conference on Computer Vision Workshops (ECCVW), 2024
    Webpage  •   Code

    Codetalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior

    Jinbo Xing, Menghan Xia, Yuechen Zhang, Xiaodong Cun, Jue Wang, Tien-Tsin Wong
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
    Webpage  •   Code

    Disentangled Image Colorization via Global Anchors

    Menghan Xia, Wenbo Hu, Tien-Tsin Wong, Jue Wang
    SIGGRAPH Asia (special issue of ACM Transactions on Graphics), 2022
    Webpage  •   Video  •   Code  •   Demo

    LF2MV: Learning An Editable assets-View Towards Light Field Representation

    Menghan Xia, Jose Echevarria, Minshan Xie, Tien-Tsin Wong
    IEEE Transactions on Visualization and Computer Graphics (TVCG), 2022 Early access
    Video  •   Code

    Deep Halftoning with Reversible Binary Pattern

    Menghan Xia, Wenbo Hu, Xueting Liu, Tien-Tsin Wong
    IEEE International Conference on Computer Vision (ICCV), 2021
    Webpage  •   Code  •   Demo

    Exploiting Aliasing for Manga Restoration

    Minshan Xie*, Menghan Xia*, Tien-Tsin Wong
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
    Webpage  •   Code

    Invertible Grayscale

    Menghan Xia, Xueting Liu, Tien-Tsin Wong
    SIGGRAPH Asia (special issue of ACM Transactions on Graphics), 2018
    Webpage  •   Video  •   Code

    Color Consistency Correction Based on Remapping Optimization for Image Stitching

    Menghan Xia, Jian Yao, Renping Xie, Mi Zhang, Jinsheng Xiao
    IEEE International Conference on Computer Vision Workshops (ICCVW), 2017
    Code  •   Journal Extension [ISPRS]

    Globally Consistent Alignment for Planar Mosaicking via Topology Analysis

    Menghan Xia, Jian Yao, Renping Xie, Mi Zhang, Jinsheng Xiao
    Pattern Recognition (PR), 66:239-252, 2017
    Webpage  •   Code

    Services

    •  Area Chair/Program Committee

    •  Conference Review

    •  Journal Review

    ©Menghan Xia