Skip to content

Latest commit

 

History

History
34 lines (28 loc) · 2.96 KB

index.md

File metadata and controls

34 lines (28 loc) · 2.96 KB

About Me [中文]

  • I'm a 3rd-year Ph.D. student in Computer Science at the Information School at the Renmin University of China. And I have been advised by Prof. Xirong Li from 2019 to now. I received my B.S. and M.E. degrees from Chongqing University of Posts and Telecommunications, Chongqing, China, in 2016 and 2019, respectively.
  • My research interests mainly focus on cross-modal computing, to be specific, image/video caption, text-to-video retrieval, and attack on cross-modal retrieval. I am mainly involved in an international leading benchmark evaluation for video retrieval by text, named TRECVID Ad-hoc Video Search, which has been funded by NIST. In addition, I have been paying more attention to how to use large-scale pre-trained visual-language models to achieve better performance in those downstream tasks recently.
  • Contact me: [email protected] or [email protected]

Publications

Competition experience

  • MM’21 Grand Challenge: Pretraining for video captioning, top 3
  • 2021 TREC Video Retrieval Evaluation (TRECVID) AVS, top 3
  • 2020 TREC Video Retrieval Evaluation (TRECVID) AVS, top 2

Honor

National Scholarships 2018.