Biography

I am a Postdoctoral Researcher at the School of Computer Science and Technology,   University of Science and Technology of China (USTC), and a member of the  State Key Laboratory of Cognitive Intelligence. I am a member of the China Computer Federation, a committee member of the Affective Computing Specialist Committee and the Youth Working Committee of the China National Committee for Chinese Information Processing, and the Secretary-General of the Affective Computing Specialist Committee of the Anhui Province Artificial Intelligence Society. I received my Ph.D. in the School of Computer Science, University of Science and Technology of China (USTC) in June 2023, supervised by Prof.  Enhong Chen. Before starting my Ph.D. study, I worked at MediaTek Inc. from 2017 to 2019. Earlier, I received my Master's degree in 2014 and my Bachelor's degree in 2017, both from Southwest University of Science and Technology. I am broadly interested in affective computing, multimodal understanding, and human-computer interaction.


Experiences

  • 2024.07—Now, Postdoctoral Researcher, School of Computer Science and Technology, University of Science and Technology of China
  • 2023.06—2024.07, Lecturer, School of Computer Science and Technology, Southwest University of Science and Technology
  • 2019.09—2023.06, Ph.D, School of Computer Science and Technology, University of Science and Technology of China
  • 2017.07—2019.07, Senior Software D.E., MediaTek.Inc (Chengdu)
  • 2014.09—2017.06, M.S., School of Computer Science and Technology, Southwest University of Science and Technology
  • 2010.09—2014.06, B.S., School of Information Engineering, Southwest University of Science and Technology

Awards and Honors

  • 2024   Received the Best Student Paper award at ACM SIGKDD'24
  • 2024   Received the Outstanding Paper Nomination Award award at PRAI 2024
  • 2024   Achieved the Runner-up position in the detection track at the ACM MM 2024@Micro-Expression Grand Challenge
  • 2023   Achieved the Runner-up position in the Track 3 at the CVPR 2023@Long Video Understanding Challenge
  • 2023   获得一/二等奖,“天马杯”全国高校科技创新大赛@2D/3D数字人生成
  • 2022   Achieved the Runner-up position in the generation track at the ACM MM 2022@Micro-Expression Grand Challenge
  • 2022   Secured the third place in the spotting track at the ACM MM 2022@Micro-Expression Grand Challenge
  • 2021   Achieved the third place in the generation track at the ACM MM 2021@Micro-Expression Grand Challenge

Selected Publications

2024

[SCIS] Shukang Yin#, Chaoyou Fu#*, Sirui Zhao#*, Tong Xu, Hao Wang, Dianbo Sui, Enhong Chen*. "Woodpecker: Hallucination Correction for Multimodal Large Language Models", SCIENCE CHINA Information Sciences(SCIS), 2024, Accepted.
[National Science Review] Shukang Yin#, Chaoyou Fu#*, Sirui Zhao#*, Ke Li, Xing Sun, Tong Xu, Enhong Chen*. "A Survey on Multimodal Large Language Models", National Science Review, 2024, Accepted.
[arXiv] Chaoyou Fu, Yi-Fan Zhang, Shukang Yin, Bo Li, Xinyu Fang, Sirui Zhao, Haodong Duan, Xing Sun, Ziwei Liu, Liang Wang, Caifeng Shan, Ran He. "MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs", arXiv preprint arXiv:2411.15296, 2024.
[arXiv] Chaoyou Fu, Yuhan Dai, Yongdong Luo, Lei Li, Shuhuai Ren, Renrui Zhang, Zihan Wang, Chenyu Zhou, Yunhang Shen, Mengdan Zhang, Peixian Chen, Yanwei Li, Shaohui Lin, Sirui Zhao, Ke Li, Tong Xu, Xiawu Zheng, Enhong Chen, Rongrong Ji, Xing Sun "Video-mme: The first-ever comprehensive evaluation benchmark of multi-modal llms in video analysis", arXiv preprint arXiv:2405.21075, 2024.
[ACM MM'24] Zhengye Zhang#, Sirui Zhao#, Xinglong Mao, Shifeng Liu, Hao Wang, Tong Xu, Enhong Chen*. "A Multi-scale Feature Learning Network with Optical Flow Correction for Micro- and Macro-expression Spotting", In Proceedings of the 32nd ACM International Conference on Multimedia (ACM MM'24), Melbourne, Australia, 2024, Accepted.
[ICME'24] Shifeng Liu, Xinglong Mao, Sirui Zhao*, Chaoyou Fu, Ying Yu, Tong Xu, Enhong Chen*. "TGMAE: Self-supervised Micro-Expression Recognition with Temporal Gaussian Masked Autoencoder", In Proceedings of the 2024 IEEE International Conference on Multimedia and Expo (ICME'24), Niagra Falls, Canada, 2024, Accepted.
[ACM TOMM] Shukang Yin, Sirui Zhao*, Hao Wang, Tong Xu, Enhong Chen*. "Exploiting Instance-level Relationships in Weakly Supervised Text-to-Video Retrieval", ACM Transactions on Multimedia Computing Communications and Applications, 2024, Accepted.
[PRCV'24] Xinglong Mao, Shifeng Liu, Sirui Zhao*, Hao Wang, Tong Xu, Enhong Chen*. "H2LMER: A Cross Frame-Rate Representation Alignment Framework for Micro-Expression Recognition", Chinese Conference on Pattern Recognition and Computer Vision (PRCV), 2024.
[ICMR'24] Chenxiao Liu, Zheyong Xie, Sirui Zhao, Jin Zhou, Tong Xu*, Minglei Li, Enhong Chen, "Speak From Heart: An Emotion-Guided LLM-Based Multimodal Method for Emotional Dialogue Generation", In Proceedings of the 14th International Conference on Multimedia Retrieval (ICMR'24), Dusit Thani Laguna Phuket, Thailand, 2024, Accepted.
[ACM SIGKDD'24] Mingjia Yin, Hao Wang*, Wei Guo, Yong Liu, Suojuan Zhang, Sirui Zhao, Defu Lian, Enhong Chen, "Dataset Regeneration for Sequential Recommendation", The 30th ACM SIGKDD Conference on Knowledge Discovery & Data Mining (KDD'2024), Accepted.
[TOIS] Hao Wang, Mingjia Yin, Luankang Zhang, Sirui Zhao, Enhong Chen, "MF-GSLAE: A Multi-Factor User Representation Pre-training Framework for Dual-Target Cross-Domain Recommendation", ACM Transactions on Information Systems, Accepted.

2023

[TAFFC] Sirui Zhao, Huaying Tang, Xinglong Mao, Shifeng Liu, Hao Wang, Tong Xu, Enhong Chen*, "DFME: A New Benchmark for Dynamic Facial Micro-expression Recognition", IEEE Transactions on Affective Computing, doi: 10.1109/TAFFC.2023.3341918, 2023.
[ACM TOMM] Sirui Zhao, Hongyu Jiang, Hanqing Tao, Rui Zha, Kun Zhang, Tong Xu, Enhong Chen. "PEDM: A Multi-task Learning Model for Persona-aware Emoji-embedded Dialogue Generation", ACM Transactions on Multimedia Computing, Communications and Applications, 2023, 19(3s): 1-21.
[ICME'23] Shukang Yin, Shiwei Wu, Tong Xu, Sirui Zhao*, Enhong Chen*. "AU-aware graph convolutional network for Macro- and Micro-expression spotting", 2023 IEEE International Conference on Multimedia and Expo (ICME), IEEE, 2023: 228-233.
[ICME'23] Yiming Zhang, Hao Wang, Yifan Xu, Xinglong Mao, Tong Xu, Sirui Zhao*, Enhong Chen*. "Adaptive Graph Attention Network with Temporal Fusion for Micro-Expressions Recognition", 2023 IEEE International Conference on Multimedia and Expo (ICME), IEEE, 2023: 1391-1396.
[PRAI'23] Huaying Tang, Xiaorong Zhang, Xinglong Mao, Shifeng Liu, Sirui Zhao*, Enhong Chen*. "Global and Local Mixer for Micro-Expression Recognition", 2023 IEEE 6th International Conference on Pattern Recognition and Artificial Intelligence (PRAI), Haikou, China, 2023, pp. 509-517.
[IWMCAS'23] Liu Minghao, Liu Haiyi, Zhao Sirui*, Ma Fei, Li Minglei, Dai Zonghong, Wang Hao, Xu Tong, Chen Enhong*. "STAN: Spatial-Temporal Awareness Network for Temporal Action Detection", Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023: 161-165.
[CIKM'23] Mingjia Yin, Hao Wang*, Xiang Xu, Likang Wu, Sirui Zhao, Wei Guo, Yong Liu, Ruiming Tang, Defu Lian, Enhong Chen, "APGL4SR: A Generic Framework with Adaptive and Personalized Global Collaborative Information in Sequential Recommendation", Proceedings of the 32nd ACM International Conference on Information and Knowledge Management (CIKM'2023), Accepted.
[FCS] Mingdi HU, Long BAI, Jiulun FAN, Sirui ZHAO, Enhong CHEN, "Vehicle Color Recognition Based on Smooth Modulation Neural Network with Multi-Scale Feature Fusion", Frontiers of Computer Science, 2023, 17(3): 173321.

2022

[Neural Networks] Sirui Zhao, Huaying Tang, Shifeng Liu, Yangsong Zhang, Hao Wang, Tong Xu, Enhong Chen*. "ME-PLAN: A Deep Prototypical Learning with Local Attention Network For Dynamic Micro-Expression Recognition", Neural Networks, 2022, 153: 427-443.
[ACM MM'22] Sirui Zhao, Shukang Yin, Huaying Tang, Jin Rijin, Yifan Xu, Tong Xu, Enhong Chen*, "Fine-grained Micro-Expression Generation based on Thin-Plate Spline and Relative AU Constraint", Proceedings of the 30th ACM International Conference on Multimedia, 2022: 7150-7154.
[ACM MM'22] Wenhao Leng, Sirui Zhao#, Yiming Zhang, Shiifeng Liu, Xinglong Mao, Hao Wang, Tong Xu, Enhong Chen*. "ABPN: Apex and Boundary Perception Network for Micro- and Macro-Expression Spotting", Proceedings of the 30th ACM International Conference on Multimedia. 2022: 7160-7164.
[ICIP'22] Rijin Jin, Sirui Zhao, Zhongkai Hao, Yifan Xu, Tong Xu*, Enhong Chen, "AVT: Au-Assisted Visual Transformer for Facial Expression Recognition", 2022 IEEE International Conference on Image Processing (ICIP), IEEE, 2022: 2661-2665.
[PRAI'22] Hongyi Li, Sirui Zhao, Yadong Wu, Shiwei Wu, Tong Xu and Enhong Chen*, "Supervised Contrastive Attentive Learning for Facial Expression Recognition in the wild", 2022 5th International Conference on Pattern Recognition and Artificial Intelligence (PRAI), IEEE, 2022: 293-301.

2021

[Neurocomputing] Sirui Zhao, Hanqing Tao, Yangsong Zhang, Tong Xu, Kun Zhang, Zhongkai Hao, Enhong Chen*. "A Two-stage 3D CNN based Learning Method for Spontaneous Micro-Expression Recognition", Neurocomputing, 2021, 448(2021), 276-289.
[Neural Networks] Yangsong Zhang, Huan Cai, Li Nie, Peng Xu, Sirui Zhao, Cuntai Guan. "An end-to-end 3D convolutional neural network for decoding attentive mental state", Neural Networks, 2021, 144: 129-137.
[ACM MM'21] Yifan Xu, Sirui Zhao, Huaying Tang, Xinlong Mao, Tong Xu*, Enhong Chen, "FAMGAN: Fine-grained AUs Modulation based Generative Adversarial Network for Micro-Expression Generation", In Proceedings of the 29th ACM International Conference on Multimedia (ACM MM'21), Chengdu, China, 2021, 4813-4817.
[Vis] Liang Fan, Cheng Chen, Sirui Zhao, Xiarorong Zhang, Yadong Wu, Fang Wang, et al., "Multi-threaded parallel projection tetrahedral algorithm for unstructured volume rendering", Journal of Visualization, 2021, 24(2): 261-274.

Hosted and Participated Research Projects

  • January 2025 - January 2028, 主持, The Natural Science Foundation of China, 国家自然科学基金青年基金
  • January 2023 - January 2024, 主持, The Natural Science Foundation of Sichuan, China, 省自然科学基金青年基金
  • My research is also supported by grants from these leading companies, e.g., Huawei, and Huadong Photoelectric

只要思想不滑坡,办法总比困难多!

There's always a way as long as you maintain in good a state of mind!