I am a PhD student (4th year) at Language Computing and Machine Learning Group, MOE Key Laboratory of Computational Linguistics, School of EECS, Peking University, supervised by Prof. Xu Sun.
Before that, I was an undergraduate student in Software Engineering, Huazhong University of Science and Technology under the guidance of Prof. Kun He.
My research interests lie within (1) Vision-Language Foundation Models, (2) Video Understanding and Generation, and (3) Open-Ended Visual Recognition.
I’m currently seeking a job (topic: multi-modal LLM, video understanding and generation, etc.). Feel free to reach out if you are interested!