I am currently a researcher at Shanghai AI Lab, focusing on the research of multimodal large language models and multimodal data algorithms. Prior to joining Shanghai AI Lab, I was engaged in data algorithm research at SenseTime Group Inc. (2020-2022). I obtained my Ph.D. from the University of Chinese Academy of Sciences in 2020. Between 2018 and 2019, I participated in the National Natural Science Foundation of China’s joint Ph.D. training program at the University of Central Florida, under the supervision of Professors Yongdong Zhang and Guo-Jun Qi.