Home
Fei Ma is a Researcher and Principal Investigator at Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ), also known as Guangming Laboratory. He leads a research team of over 10 members, focusing on cutting-edge areas including affective computing, multimodal large models, AIGC, and human motion generation. Before joining the Guangming Laboratory, he served as a Senior Engineer at Huawei Cloud. Additionally, he had internship experiences at Tencent and OPPO. These experiences have equipped him with the ability to advance both research and implementation together. He received the B.S. degree from the University of Electronic Science and Technology of China (UESTC) in 2017, ranking first among 363 students in his major. He earned his Ph.D. degree from Tsinghua University in 2022 under the supervision of Prof. Lin Zhang.
Currently, he is actively looking for interns and visiting students (online or offline) who are passionate about the above research areas. Interested candidates are welcome to contact via email.
📣 Recent News
[Jul. 2025] One paper is accepted by TPAMI. Congratulations to Haiwei!
[Jul. 2025] One paper is accepted by ECAI.
[Jul. 2025] Two papers are accepted by ACM MM. Congratulations to Yihong, one of my first PhD students!
[Jul. 2025] One paper is accepted by ICML 2025 R2-FM Workshop.
[Jun. 2025] Two papers are accepted by IROS. Congratulations to Yifan and Zixuan!
[May 2025] One paper is accepted by IEEE Transactions on Affective Computing. Congratulations to Yifan and Yukan!
[May 2025] One paper is accepted by ACL 2025. Congratulations to Congzhi!
[May 2025] One paper is accepted by Information Fusion.
[Apr. 2025] Three papers are accepted by IJCAI 2025. Congratulations to Haiwei and Chenyang!
[Apr. 2025] One paper is accepted by ICMR 2025. Congratulations to Yifan!
[Apr. 2025] One papers is accepted by Information Fusion.
[Mar. 2025] Three papers are accepted by ICME 2025. Congratulations to Tao Feng, Xin Zhang, and Yang Xiang!
[Mar. 2025] One papers is accepted by IEEE TPAMI. Congratulations to Hongwei!
📝 Publications
-
* represents the first author, # represents the corresponding author.
-
For more paper information, please refer to the Google Scholar page.
Selected Journal Papers
1) F. Ma*, Y. Xie, Y. Li, Y. He, Y. Zhang, H. Ren, Z. Liu, W. Yao, F. Ren, F. Yu, S. Ni. A Review of Human Emotion Synthesis Based on Generative Technology. IEEE Transactions on Affective Computing, 2025. (IF: 9.6)
2) H. Hou, F. Ma*, Z. Li, F. Yu. VisualRWKV-HM: Enhancing Linear Visual-Language Models via Hybrid Mixing. Information Fusion, 2025. (IF: 14.8)
3) H. Ren, Y. Zhou, J. Zhu, X. Lin, H. Fu, Y. Huang, Y. Fang, F. Ma, H. Yu, B. Cheng. Rethinking Efficient and Effective point-based Networks for Event Camera Classification and Regression. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025. (IF: 20.8)
4) S. Chen, Z. Wu, K. Zhang, C. Li, B. Zhang, F. Ma, F. Yu, Q. Li. Exploring embodied multimodal large models: Development, datasets, and future directions. Information Fusion, 2025. (IF: 14.8)
5) F. Ma*, Y. Yuan, Y. Xie, H. Ren, I. Liu, Y. He, F. Ren, F. Yu, S. Ni. Generative Technology for Human Emotion Recognition: A Scoping Review. Information Fusion, 2024. (IF: 14.8)
6) C. Wang, H. Yu, X. Li, F. Ma, X. Wang, T. Taleb, V. Leung. Dependency-Aware Microservice Deployment for Edge Computing: A Deep Reinforcement Learning Approach with Network Representation. IEEE Transactions on Mobile Computing, 2024. (IF: 7.7)
7) Y. Liu, H. Hou, F. Ma #, S. Ni, F. Yu. MLLM-TA: Leveraging Multimodal Large Language Models for Precise Temporal Video Grounding. IEEE Signal Processing Letters, 2024. (IF: 3.2)
Selected Conference Papers
1) C. Zhang, J. Peng, Z. Wang, Y. Lai, H. Sun, H. Chang, F. Ma, W. Yu. VReST: Enhancing Reasoning in Large Vision-Language Models through Tree Search and Self-Reward Mechanism. ACL 2025. (CCF A)
2) H. Xue, Z. Zhang, M. Li, Z. Dai, F. Yu, F. Ma #, Z. Wu. VideoHumanMIB: Unlocking Appearance Decoupling for Video Human Motion In-betweening. IJCAI 2025. (CCF A)
3) W. Feng, Y. Zhu, R. Zhang, C. Wang, F. Ma, X. Wang, X. Li. Active Multimodal Distillation for Few-shot Action Recognition. IJCAI 2025. (CCF A)
4) G. Chen, Y. He, M. Yu, F. Yu, G. Xu, F. Ma, M. Li, G. Zhou. Inter3D: A Benchmark and Strong Baseline for Human-Interactive 3D Object Reconstruction. IJCAI 2025. (CCF A)
5) Y. Xie, F. Ma*, Y. Bin, Y. He, F. Yu. Audio-Driven Talking Face Video Generation with Joint Uncertainty Learning. ACM ICMR 2025.
6) Y. Xie, T. Feng, X. Zhang, X. Luo, Z. Guo, W. Yu, H. Chang, F. Ma #, F. Yu. PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis. AAAI 2025. (CCF A)
7) X. Xiang, Z. Dai, H. Xue, D. Wang, M. Li, Y. Yue, F. Ma #, W. Yu, H. Chang, F. Yu. ReMask-Animate: Refined Character Image Animation Using Mask-Guided Adapters. AAAI 2025. (CCF A)
8) L. Wang, S. Shi, F. Ma, F. Yu, P. Li, Y. He. Subgraph Invariant Learning towards Large-scale Graph Node Classification. AAAI 2025. (CCF A)
9) X. Luo, X. Zhang, Y. Xie, X. Tong, W. Yu, H. Chang, F. Ma #, F. Yu. CodeSwap: Symmetrically Face Swapping Based on Prior Codebook. ACM MM 2024. (CCF A)
10) L. Xiong, X. Cheng, J. Tan, X. Wu, X. Li, L. Zhu, F. Ma, M. Li, H. Xu, Z. Hu. SegTalker: Segmentation-based Talking Face Generation with Mask-guided Local Editing. ACM MM 2024. (CCF A)
Selected Chinese Patents
1) 马飞*,徐洪波,谢长岭,卓一瑶,罗奕明,李阳,纪奕泓。一种基于多模态大模型的跌倒检测方法、系统、终端及存储介质。202510512556.7
2) 马飞*,卓一瑶,施斯,董淳光。一种个性化数字人预问诊平台。202411030663.8
3) 马飞*,卓一瑶,侯皓文,尹东富,李海鹏。一种智能陪护方法、智能陪护系统及计算机存储介质。202411159025.6
4) 马飞*,徐洪波,卓一瑶,董淳光,施斯。一种基于多智能体的微短剧自动化生成方法、系统及终端。202411315872.7
5) 马飞*,彭亮,李明磊,怀宝兴。数字人视频的生成方法、装置、设备及存储介质。202310429308.7
6) 马飞*,彭亮,李明磊,怀宝兴。数字人多媒体资源的生成方法、装置、设备及存储介质。202310389438.2
7) 马飞*,李明磊,刘辉,杨昌鹏。一种评估方法、装置及设备。202310934271.3
8) 马飞*,陈志毅,李明磊,怀宝兴。一种虚拟形象的管理方法及相关系统。202310486823.9
9) 马飞*,李明磊,怀宝兴,戴宗宏。一种数字人绑定评估方法。202311218488.0
10) 李国健,马飞*,徐洪波,卓一瑶,谢长岭,朱海俊,赵豫鄂,胡赫。一种基于大模型的水利智慧语音交互方法、系统、终端及存储介质。202510512558.6
11) 张鑫,马飞*,卓一瑶,花霖。一种基于扩散模型生成语义掩码的多模态人脸编辑方法。202411056500.7
12) 谢奕凡,马飞*,卓一瑶,田甄。一种基于神经辐射场的语音驱动数字人视频生成方法。202411071122.X
13) 罗向阳,马飞*,徐洪波,卓一瑶,刘洲,董君心。一种具有一致性故事插画生成的框架。202411242121.7
14) 董君心,马飞*,贺颖,董淳光,施斯,侯皓文。网站网页主题风格切换方法、装置、计算机设备及存储介质。202411989061.5
15) 彭亮,马飞*,李明磊,怀宝兴。一种虚拟对象的动作图像数据生成方法、装置及相关设备。202310489294.8
👥 Team
Research Engineers:
Hongbo Xu
Minghui Li
Interns/Mentored Students:
PhD Students: Yifan Xie, Hongwei Ren, Jiyue Jiang, Jian Chen, Yi Zhang, Ziheng Ye, Zixuan Guo, Hu Hu, Yihong Ji, Zebang Cheng
Master Students: Haiwei Xue, Xunzhi Xiang, Ledong An, Yiling Tao, Yihong Huang, Jingtao Zhou, Wenhao Zhang, Jintao Guan, Junhao Chen, Guojian Li
🌐 Other Activities
- Program Committee Member / Reviewer:
Journals:
Pattern Recognition
IEEE Transactions on Human-Machine Systems
IEEE Signal Processing Letters
Neurocomputing
Computers in Industry
Ocean Engineering
Signal Processing: Image Communication
Software: Practice and Experience
Behaviour & Information Technology
IEEE ACCESS
Conferences:
NeurIPS(2025, 2023, 2022)
ICML 2022
ICLR(2025-2022)
CVPR (2025)
AAAI 2026
ACM MM (2025-2023)
ICME (2023-2019)
ICASSP 2023
WACV 2026
ICIP 2021
ICONIP 2019
- Teaching:
Machine Learning, Spring 2025
Social Psychology and Behavioral Big Data, Spring 2023