๐Ÿ‘ฆ About Me

I am currently an Assistant Professor in Visual Information Processing and Learning (VIPL) group at the Institute of Computing Technology (ICT), Chinese Academy of Sciences (CAS). I received my Ph.D. from ICT, CAS in 2024, under the supervision of Prof. Xilin Chen. I also had close collaboration with Prof. Jie Zhang and Prof. Xiujuan Chai. My research interests mainly focus on human behavior analysis and understanding from sequential data, with a particular emphasis on gesture and sign language. I am also interested in efficient data utilization in deep learning and the trustworthiness of multimodal large language models. Currently, I am actively working to promote the real-world deployment of sign language technologies. Feel free to reach out if youโ€™re interested in any of these topics or potential collaboration.

๐Ÿ”ฅ News

  • 2025.06: ย ๐ŸŽ‰๐ŸŽ‰ Our team won the 1st Multimodal Sign Language Recognition Challenge Challenge at ICCVโ€™25 in signer-independent and unseen sentence sub-tasks. Congratulations to everyone involved!
  • 2025.01: ย ๐ŸŽ‰๐ŸŽ‰ Our team won the Cross-View Isolated Sign Language Recognition Challenge at WWWโ€™25 in both the RGB and RGB-D tracks. Congratulations to everyone involved!
  • 2024.09: ย ๐ŸŽ‰๐ŸŽ‰ One paper on skeleton-aware sign language recognition was accepted by ACCV 2024. Congratulations to Yifan Yang and the team!
  • 2024.06: ย ๐ŸŽ‰๐ŸŽ‰ One paper on vision-language pre-training in SLT was accepted by ECCV 2024. Congratulations to Peiqi Jiao and the team!
  • 2023.12: ย ๐ŸŽ‰๐ŸŽ‰ Successfully defended my PhD dissertation.
  • 2023.10: ย ๐ŸŽ‰๐ŸŽ‰ One paper on keyframe selection in CSLR was accepted by Scientia Sinica Informationis 2023.
  • 2023.10: Present the doctoral consortium โ€œAlignment Constraints for Video-based Sign Language Understandingโ€ at the workshop on Assistive Computer Vision and Robotics at ICCV23 [pdf] [workshop]
  • 2023.07: ย ๐ŸŽ‰๐ŸŽ‰ One paper on co-occurrence signals in CSLR was accepted by ICCV 2023. Congratulations to Peiqi Jiao and the team!
  • 2022.07: ย ๐ŸŽ‰๐ŸŽ‰ One paper on sequential representation learning was accepted by ECCV 2022.

๐Ÿ“ Publications

ECCV 2022
sym

Deep Radial Embedding for Visual Sequence Learning

Yuecong Min, Peiqi Jiao, Yanan Li, Xiaotao Wang, Lei Lei, Xiujuan Chai, Xilin Chen

  • RadialCTC constrains sequence features on a hypersphere while retaining the iterative alignment mechanism of CTC, which also provides a clear geometric interpretation for CTC
  • RadialCTC controls the peaky behavior with a simple angular perturbation term
ICCV 2021
sym

Visual Alignment Constraint for Continuous Sign Language Recognition

Yuecong Min, Aiming Hao, Xiujuan Chai, Xilin Chen

|

  • VAC provides an efficient way to make CSLR models end-to-end trainable and is adopted as the baseline model by many recent works
  • Two metrics to evaluate the contributions of the feature extractor and the alignment module
CVPR 2020
sym

An Efficient PointLSTM for Point Clouds Based Gesture Recognition

Yuecong Min, Yanxiao Zhang, Xiujuan Chai, Xilin Chen

|

  • PointLSTM can leverage long-term spatio-temporal relationships in irregular sequence data (e.g., point cloud) while preserving the spatial structure for irregular sequence recognition problem
  • Evaluation results on 3D gesture recognition and action recognition show great potential for real-time applications

๐ŸŽ– Honors and Awards

  • Excellence Prize of the Chinese Academy of Sciences (CAS) President Award, 2023.
  • China National Scholarship for Ph.D., 2022

๐Ÿ“– Educations

  • 2017.09 - 2024.1, I was a Ph.D. student at Institute of Computing Technology, CAS, under the supervision of Prof. Xilin Chen.
  • 2013.09 - 2017.07, I was a college student in Shandong University, Weihai.

โœ’๏ธ Academic Services

  • Invited journal reviewer for IEEE TPAMI / IEEE TMM / IEEE TIP / PR โ€ฆ
  • Invited conference reviewer for CVPRโ€™22 /ACM MMโ€™22 / ECCVโ€™22 / CVPRโ€™23 โ€ฆ

โš™๏ธ Misc

  • A summary of papers on multimodal hallucination benchmark and detection. survey paper on arXiv
  • A summary of papers on gesture and sign language recognition.
  • A simple tool to visualize the main keywords of accepted papers for the recent Computer Vision conferences