Biography
I am a PhD student of MMLab@CUHK advised by Prof. Xiangyu Yue. Recently, I focus on efficient and unified multimodal LLMs, such as LLaMA-Adpater, OneLLM and Tar. I received my Master and Bachelor degree from Wuhan University and Central South University, respectively. I interned at Bytedance Seed, Shanghai AI Lab and Tencent YouTu Lab.
News
Selected Publications

Bridge: Growing Visual Generative Capacity for Pre-Trained MLLMs

Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

Multimodal Long Video Modeling Based on Temporal Dynamic Context

Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation

Retrieval-Augmented Personalization for Multimodal Large Language Models

OneLLM: One Framework to Align All Modalities with Language


Experience



Educations

09/2023 - Present
PhD. The Chinese University of Hong Kong

09/2019 - 06/2022
M.E. Wuhan University
09/2015 - 06/2019
B.E. Central South University
Activities
- Reviewer of CVPR’21-24, ICCV’21-23, NIPS’23, ICLR’24, WACV’22, ECCV’22-24, AAAI’23-24
- Reviewer of TPAMI, IJCV, TIP, ISPRS, TGRS, TNNLS