Sirui Zhao-赵思蕊

2025

[arxiv] Zhengye Zhang, Sirui Zhao*, Shifeng Liu, Shukang Yin, Xinlong Mao, Tong Xu, Enhong Chen*. "Unsupervised lightweight 3D convolutional network for enhanced infrared imaging in wearable devices", arXiv preprint arXiv:2505.07007, 2025.

[FCS] Biao Zhu, Jun Zhang, Sirui Zhao*, Zhengye Zhang, Enhong Chen*. "Unsupervised lightweight 3D convolutional network for enhanced infrared imaging in wearable devices", Frontiers of Computer Science, 2026, 20(1): 2001306., 2025 Accept

[IEEE TAFFC] Shifeng Liu, Xinlong Mao, Sirui Zhao*, Peiming Li, Tong Xu, Enhong Chen*. "MER-CLIP: AU-Guided Vision-Language Alignment for Micro-Expression Recognition", IEEE Transactions on Affective Computing, DOI:10.1109/TAFFC.2025.3584918, 2025.

[ICME 2025] Fangyuan Liu, Sirui Zhao*, Tong Xu, Yu Sun, Hao Wang, Suojuan Zhang, Enhong Chen*. "PhysFFTFormer: A Frequency Domain-based Vision Transformer for Efficient Remote Physiological Measurement", In Proceedings of the IEEE International Conference on Multimedia&Expo 2025 (ICME'25), Nantes, France, 2025, Accepted.

[ICIP 2025] Xiaohao Wang, Sirui Zhao*, Xinglong Mao, Yiming Zhang, Shifeng Liu, Tong Xu, Enhong Chen*, "DGRGaze: A difference-guided gaze estimation framework based on 6D rotation matrix representation", In Proceedings of the 32nd IEEE International Conference on Image Processing (ICIP'25), 2025, Anchorage Alaska, USA, Accepted.

[CVPR2025] Chaoyou Fu, Yuhan Dai, Yondong Luo, Lei Li, Shuhuai Ren, Renrui Zhang, Zihan Wang, Chenyu Zhou, Yunhang Shen, Mengdan Zhang, Peixian Chen, Yanwei Li, Shaohui Lin, Sirui Zhao, Ke Li, Tong Xu, Xiawu Zheng, Enhong Chen, Rongrong Ji, Xing Sun. "Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis", In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025 (CVPR'25)Nashville TN, USA, 2025.

[ICIC 2025] Siyuan Jin, Sirui Zhao*, Yifan Xu, Shifeng Liu, Mengduo Wu, Tong Xu*, "JoyLive: Efficient audio-driven portrait animation by 3D implict keypoints", In Proceedings of the International Conference on Intelligent Computing (ICIC'25), 2025, Ningbo, China, Accepted.

[ICIC 2025] Kang Yin, Sirui Zhao*, Xinlong Mao, Shifeng Liu, Yiming Zhang, Tong Xu*, Enhong Chen, "A Phoneme-Aware Multi-Task Learning Framework with Dynamic Prioritization for Speech Emotion Recognition", In Proceedings of the International Conference on Intelligent Computing (ICIC'25), 2025, Ningbo, China, Accepted.

2024

[SCIS] Shukang Yin#, Chaoyou Fu#*, Sirui Zhao#*, Tong Xu, Hao Wang, Dianbo Sui, Enhong Chen*. "Woodpecker: Hallucination Correction for Multimodal Large Language Models", SCIENCE CHINA Information Sciences(SCIS), 2024, Accepted.

[National Science Review] Shukang Yin#, Chaoyou Fu#*, Sirui Zhao#*, Ke Li, Xing Sun, Tong Xu, Enhong Chen*. "A Survey on Multimodal Large Language Models", National Science Review, 2024, Accepted.

[arXiv] Chaoyou Fu, Yi-Fan Zhang, Shukang Yin, Bo Li, Xinyu Fang, Sirui Zhao, Haodong Duan, Xing Sun, Ziwei Liu, Liang Wang, Caifeng Shan, Ran He. "MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs", arXiv preprint arXiv:2411.15296, 2024.

[arXiv] Chaoyou Fu, Yuhan Dai, Yongdong Luo, Lei Li, Shuhuai Ren, Renrui Zhang, Zihan Wang, Chenyu Zhou, Yunhang Shen, Mengdan Zhang, Peixian Chen, Yanwei Li, Shaohui Lin, Sirui Zhao, Ke Li, Tong Xu, Xiawu Zheng, Enhong Chen, Rongrong Ji, Xing Sun "Video-mme: The first-ever comprehensive evaluation benchmark of multi-modal llms in video analysis", arXiv preprint arXiv:2405.21075, 2024.

[ACM MM'24] Zhengye Zhang#, Sirui Zhao#, Xinglong Mao, Shifeng Liu, Hao Wang, Tong Xu, Enhong Chen*. "A Multi-scale Feature Learning Network with Optical Flow Correction for Micro- and Macro-expression Spotting", In Proceedings of the 32nd ACM International Conference on Multimedia (ACM MM'24), Melbourne, Australia, 2024, Accepted.

[ICME'24] Shifeng Liu, Xinglong Mao, Sirui Zhao*, Chaoyou Fu, Ying Yu, Tong Xu, Enhong Chen*. "TGMAE: Self-supervised Micro-Expression Recognition with Temporal Gaussian Masked Autoencoder", In Proceedings of the 2024 IEEE International Conference on Multimedia and Expo (ICME'24), Niagra Falls, Canada, 2024, Accepted.

[ACM TOMM] Shukang Yin, Sirui Zhao*, Hao Wang, Tong Xu, Enhong Chen*. "Exploiting Instance-level Relationships in Weakly Supervised Text-to-Video Retrieval", ACM Transactions on Multimedia Computing Communications and Applications, 2024, Accepted.

[PRCV'24] Xinglong Mao, Shifeng Liu, Sirui Zhao*, Hao Wang, Tong Xu, Enhong Chen*. "H2LMER: A Cross Frame-Rate Representation Alignment Framework for Micro-Expression Recognition", Chinese Conference on Pattern Recognition and Computer Vision (PRCV), 2024.

[ICMR'24] Chenxiao Liu, Zheyong Xie, Sirui Zhao, Jin Zhou, Tong Xu*, Minglei Li, Enhong Chen, "Speak From Heart: An Emotion-Guided LLM-Based Multimodal Method for Emotional Dialogue Generation", In Proceedings of the 14th International Conference on Multimedia Retrieval (ICMR'24), Dusit Thani Laguna Phuket, Thailand, 2024, Accepted.

[ACM SIGKDD'24] Mingjia Yin, Hao Wang*, Wei Guo, Yong Liu, Suojuan Zhang, Sirui Zhao, Defu Lian, Enhong Chen, "Dataset Regeneration for Sequential Recommendation", The 30th ACM SIGKDD Conference on Knowledge Discovery & Data Mining (KDD'2024), Accepted.

[TOIS] Hao Wang, Mingjia Yin, Luankang Zhang, Sirui Zhao, Enhong Chen, "MF-GSLAE: A Multi-Factor User Representation Pre-training Framework for Dual-Target Cross-Domain Recommendation", ACM Transactions on Information Systems, Accepted.

2023

[TAFFC] Sirui Zhao, Huaying Tang, Xinglong Mao, Shifeng Liu, Hao Wang, Tong Xu, Enhong Chen*, "DFME: A New Benchmark for Dynamic Facial Micro-expression Recognition", IEEE Transactions on Affective Computing, doi: 10.1109/TAFFC.2023.3341918, 2023.

[ACM TOMM] Sirui Zhao, Hongyu Jiang, Hanqing Tao, Rui Zha, Kun Zhang, Tong Xu, Enhong Chen. "PEDM: A Multi-task Learning Model for Persona-aware Emoji-embedded Dialogue Generation", ACM Transactions on Multimedia Computing, Communications and Applications, 2023, 19(3s): 1-21.

[ICME'23] Shukang Yin, Shiwei Wu, Tong Xu, Sirui Zhao*, Enhong Chen*. "AU-aware graph convolutional network for Macro- and Micro-expression spotting", 2023 IEEE International Conference on Multimedia and Expo (ICME), IEEE, 2023: 228-233.

[ICME'23] Yiming Zhang, Hao Wang, Yifan Xu, Xinglong Mao, Tong Xu, Sirui Zhao*, Enhong Chen*. "Adaptive Graph Attention Network with Temporal Fusion for Micro-Expressions Recognition", 2023 IEEE International Conference on Multimedia and Expo (ICME), IEEE, 2023: 1391-1396.

[PRAI'23] Huaying Tang, Xiaorong Zhang, Xinglong Mao, Shifeng Liu, Sirui Zhao*, Enhong Chen*. "Global and Local Mixer for Micro-Expression Recognition", 2023 IEEE 6th International Conference on Pattern Recognition and Artificial Intelligence (PRAI), Haikou, China, 2023, pp. 509-517.

[IWMCAS'23] Liu Minghao, Liu Haiyi, Zhao Sirui*, Ma Fei, Li Minglei, Dai Zonghong, Wang Hao, Xu Tong, Chen Enhong*. "STAN: Spatial-Temporal Awareness Network for Temporal Action Detection", Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023: 161-165.

[CIKM'23] Mingjia Yin, Hao Wang*, Xiang Xu, Likang Wu, Sirui Zhao, Wei Guo, Yong Liu, Ruiming Tang, Defu Lian, Enhong Chen, "APGL4SR: A Generic Framework with Adaptive and Personalized Global Collaborative Information in Sequential Recommendation", Proceedings of the 32nd ACM International Conference on Information and Knowledge Management (CIKM'2023), Accepted.

[FCS] Mingdi HU, Long BAI, Jiulun FAN, Sirui ZHAO, Enhong CHEN, "Vehicle Color Recognition Based on Smooth Modulation Neural Network with Multi-Scale Feature Fusion", Frontiers of Computer Science, 2023, 17(3): 173321.

2022

[Neural Networks] Sirui Zhao, Huaying Tang, Shifeng Liu, Yangsong Zhang, Hao Wang, Tong Xu, Enhong Chen*. "ME-PLAN: A Deep Prototypical Learning with Local Attention Network For Dynamic Micro-Expression Recognition", Neural Networks, 2022, 153: 427-443.

[ACM MM'22] Sirui Zhao, Shukang Yin, Huaying Tang, Jin Rijin, Yifan Xu, Tong Xu, Enhong Chen*, "Fine-grained Micro-Expression Generation based on Thin-Plate Spline and Relative AU Constraint", Proceedings of the 30th ACM International Conference on Multimedia, 2022: 7150-7154.

[ACM MM'22] Wenhao Leng, Sirui Zhao#, Yiming Zhang, Shiifeng Liu, Xinglong Mao, Hao Wang, Tong Xu, Enhong Chen*. "ABPN: Apex and Boundary Perception Network for Micro- and Macro-Expression Spotting", Proceedings of the 30th ACM International Conference on Multimedia. 2022: 7160-7164.

[ICIP'22] Rijin Jin, Sirui Zhao, Zhongkai Hao, Yifan Xu, Tong Xu*, Enhong Chen, "AVT: Au-Assisted Visual Transformer for Facial Expression Recognition", 2022 IEEE International Conference on Image Processing (ICIP), IEEE, 2022: 2661-2665.

[PRAI'22] Hongyi Li, Sirui Zhao, Yadong Wu, Shiwei Wu, Tong Xu and Enhong Chen*, "Supervised Contrastive Attentive Learning for Facial Expression Recognition in the wild", 2022 5th International Conference on Pattern Recognition and Artificial Intelligence (PRAI), IEEE, 2022: 293-301.

2021

[Neurocomputing] Sirui Zhao, Hanqing Tao, Yangsong Zhang, Tong Xu, Kun Zhang, Zhongkai Hao, Enhong Chen*. "A Two-stage 3D CNN based Learning Method for Spontaneous Micro-Expression Recognition", Neurocomputing, 2021, 448(2021), 276-289.

[Neural Networks] Yangsong Zhang, Huan Cai, Li Nie, Peng Xu, Sirui Zhao, Cuntai Guan. "An end-to-end 3D convolutional neural network for decoding attentive mental state", Neural Networks, 2021, 144: 129-137.

[ACM MM'21] Yifan Xu, Sirui Zhao, Huaying Tang, Xinlong Mao, Tong Xu*, Enhong Chen, "FAMGAN: Fine-grained AUs Modulation based Generative Adversarial Network for Micro-Expression Generation", In Proceedings of the 29th ACM International Conference on Multimedia (ACM MM'21), Chengdu, China, 2021, 4813-4817.

[Vis] Liang Fan, Cheng Chen, Sirui Zhao, Xiarorong Zhang, Yadong Wu, Fang Wang, et al., "Multi-threaded parallel projection tetrahedral algorithm for unstructured volume rendering", Journal of Visualization, 2021, 24(2): 261-274.

Patent application

一种非接触心率检测方法、系统及相关设备，2025-05-02，中国，ZL202510240309.6（授权）
一种基于AU引导的微表情识别方法及装置，2025-02-18，中国，ZL202411783466.3（授权）
一种基于音素感知的语音情感识别方法及装置，2024-12-08，中国，ZL202411505238.X（授权）
基于小波变换混合增强对比学习的微动作识别方法及装置，2024-7-15，中国，ZL202410938994.5（授权）
跨帧率微表情识别方法及装置，2024-7-16，中国，ZL202410592967.7（授权）
一种微表情识别模型的训练方法、识别方法及相关设备，2024-7-26，中国，ZL202410649574.5（授权）
视频检索方法、系统、设备及存储介质，2023-10-16，中国，ZL202311331941.9（授权）
一种自发微表情识别方法，2022-9-30, 中国，ZL202011559343.3（授权）
自然场景下人脸表情识别方法、系统、设备及存储介质，2022-9-06, 中国，ZL202210546946.2（授权）
微表情峰值自动检测方法、系统、设备及存储介质，2022-4-14，中国，ZL202210387781.9（授权）
人脸表情识别方法、系统、设备及存储介质，2022-4-28, 中国，CN202210459722.8
微表情检测方法、系统、设备及存储介质，2023-04-03，中国，CN202310345351 .5
文本情感原因的识别方法、系统、设备及存储介质，申请日期：2022-8-26, 中国， CN202211032385.0
一种基于可视交互的三维人体动作关键帧提取方法，申请日期：2022-11-23, 中国， CN202211476480.X

Hosted and Participated Research Projects

January 2025 - January 2028, 主持, The Natural Science Foundation of China, 国家自然科学基金青年基金
January 2023 - January 2024, 主持, The Natural Science Foundation of Sichuan, China, 省自然科学基金青年基金
My research is also supported by grants from these leading companies, e.g., Huawei, and Huadong Photoelectric

只要思想不滑坡,办法总比困难多！

There's always a way as long as you maintain in good a state of mind!

Biography

Awards and Honors

Selected Publications