Google Scholar  /   Github
Email: menghanxyz@gmail.com

Menghan Xia

I am a researcher at Tencent AI Lab since 2021. Prior to this, I received my PhD in Computer Science and Engineering from The Chinese University of Hong Kong (CUHK) in 2021, supervised by Prof. Tien-Tsin Wong. Before that, I obtained a B.Eng. degree in Photogrammetry and Remote Sensing in 2014 and M.Eng. degree in Pattern Recognition and Intelligent System in 2017, both from Wuhan University, under the supervision of Prof. Jian Yao. During my doctoral studies, I collaborated with Adobe Research for a year, starting in March 2019, and completed a research internship at Microsoft Research Asia (MSRA) in the summer of 2021.

My research interest lies in computer vision and deep learning, especially image/video generation & translation. Currently I focus on generative foundation models (AIGC), multimodal learning, and talking human synthesis.

Research Works

Representative
   •   

Dynamicrafter: Animating Open-Domain Images with Video Diffusion Priors

Jinbo Xing, Menghan Xia, Yong Zhang, Haoxin Chen, Xintao Wang, Ying Shan, Tien-Tsin Wong.
preprint arXiv:2310.12190, 2023.
Webpage  •   Code  •   Demo

FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling

Haonan Qiu, Menghan Xia, Yong Zhang, Yingqing He, Xintao Wang, Ying Shan, Ziwei Liu.
International Conference on Learning Representations (ICLR), 2024.
Webpage  •   Code  •   Demo

Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance

Jinbo Xing, Menghan Xia, Yuxin Liu, Yuechen Zhang, Yong Zhang, Yingqing He, Hanyuan Liu, Haoxin Chen, Xiaodong Cun, Xintao Wang, Ying Shan, Tien-Tsin Wong.
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2024 Early access.
Webpage  •   Code

VideoCrafter1: Open Diffusion Models for High-Quality Video Generation

Haoxin Chen*, Menghan Xia*, Yingqing He*, Yong Zhang*, Xiaodong Cun*, Shaoshu Yang, Jinbo Xing, Yaofang Liu, Qifeng Chen, Xintao Wang, Chao Weng, Ying Shan.
preprint arXiv:2310.19512, 2023.
Webpage  •   Code  •   Discord

Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation

Yingqing He*, Menghan Xia*, Haoxin Chen*, Xiaodong Cun, Yuan Gong, Jinbo Xing, Yong Zhang, Xintao Wang, Chao Weng, Ying Shan, Qifeng Chen.
preprint arXiv:2307.06940, 2023.
Webpage  •   Code

Codetalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior

Jinbo Xing, Menghan Xia, Yuechen Zhang, Xiaodong Cun, Jue Wang, Tien-Tsin Wong.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
Webpage  •   Code

Disentangled Image Colorization via Global Anchors

Menghan Xia, Wenbo Hu, Tien-Tsin Wong, Jue Wang.
SIGGRAPH Asia (special issue of ACM Transactions on Graphics), 2022.
Webpage  •   Video  •   Code  •   Demo

LF2MV: Learning An Editable assets-View Towards Light Field Representation

Menghan Xia, Jose Echevarria, Minshan Xie, Tien-Tsin Wong.
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2022 Early access.
Video  •   Code

Deep Halftoning with Reversible Binary Pattern

Menghan Xia, Wenbo Hu, Xueting Liu, Tien-Tsin Wong.
IEEE International Conference on Computer Vision (ICCV), 2021.
Webpage  •   Code  •   Demo

Exploiting Aliasing for Manga Restoration

Minshan Xie*, Menghan Xia*, Tien-Tsin Wong.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
Webpage  •   Code

Invertible Grayscale

Menghan Xia, Xueting Liu, Tien-Tsin Wong.
SIGGRAPH Asia (special issue of ACM Transactions on Graphics), 2018.
Webpage  •   Video  •   Code

Color Consistency Correction Based on Remapping Optimization for Image Stitching

Menghan Xia, Jian Yao, Renping Xie, Mi Zhang, Jinsheng Xiao.
IEEE International Conference on Computer Vision Workshops (ICCVW), 2017.
Code  •   Journal Extension [ISPRS]

Globally Consistent Alignment for Planar Mosaicking via Topology Analysis

Menghan Xia, Jian Yao, Renping Xie, Mi Zhang, Jinsheng Xiao.
Pattern Recognition (PR), 66:239-252, 2017.
Webpage  •   Code

Academic Services

•  Conference Review

•  Journal Review

©Menghan Xia  • Source