Meng Xiao (肖濛)
肖濛
|

About Me
Passionate researcher dedicated to advancing AI for scientific discovery and life sciences
Meng Xiao received joint doctoral training from the University of Chinese Academy of Sciences and Institute for Infocomm Research, Agency for Science, Technology and Research (ASTAR), Singapore in June 2023.
Currently, he is an associate professor at the Computer Network Information Center, Chinese Academy of Sciences. He is also a research fellow at DUKE-NUS Medical School, National University of Singapore.
Meng Xiao has published over 40 papers, including iMeta, the Innovation Life, NeurIPS, ICLR, ICML, IEEE TKDE, IEEE ICDE, ACM SIGKDD, AIJ, and ACM TKDD. He actively contributes to the academic community as a (s)PC member or reviewer on many premier international conferences and journals.
Research Interests
Latest News
One paper "Towards Data-Centric AI: A Comprehensive Survey of Traditional, Reinforcement, and Generative Approaches for Tabular Data Transformation" accepted by ACM TKDD!
One paper "A Cross-Modal Hierarchical Contrastive Learning Framework for Protein-Protein Interaction Prediction" accepted by DASFAA 2026!
Key Achievements
CAS President Award - Special Prize (中国科学院院长奖学金特别奖)
The highest award for graduate students of the CAS
2023Young Talent Support Project by BAST (北京市科协青年人才托举工程)
Batch of 2024-2026
2024Doctoral Degree (博士学位)
University of Chinese Academy of Sciences (中国科学院大学)
2023Published 40+ Research Articles
Includes top journals and conferences
To DateTimeline
Associate Professor
2026-PresentComputer Network Information Center, CAS
Scientific Data Mining, Data-centric AI
Research Fellow
2025-PresentDUKE-NUS Medical School, NUS
Single-cell Data Mining, AI for lifescience
Assistant Researcher
2023-2025Computer Network Information Center, CAS
Scientific Data Mining, Data-centric AI
PhD Student
2019-2023University of Chinese Academy of Sciences
Scientific Literature Mining, Interdisciplinary Knowledge Graph
Get in Touch
Interested in collaboration, research opportunities, or academic discussions?
Contact Information
Work Email
Academic Email
Computer Network Information Center, CAS
Duke-NUS Medical School, NUS
Academic Profiles
Research Collaboration
I'm always interested in collaborating with researchers and practitioners in:
- Foundation models for biomedical scientific discovery
- Agentic AI applications in life sciences and healthcare
- Scientific literature data mining and knowledge discovery
- Data-centric AI and efficient machine learning systems
Academic Services
Area Chair
AISTATS
Workshop Organizer
Data-Centric AI (ICDM 2023/2024, CIKM 2024)
Reviewer
NeurIPS, ICML, ICLR, SIGKDD, and more
Journal Reviewer
Cell Research, Scientific Report, TKDD, and more
Response Time: I typically respond to emails within 2-3 business days. For urgent matters, please mention "URGENT" in the subject line.
Academic Impact
Quantifying research contributions and scholarly influence
Last updated: 3/11/2026
Featured Projects
Leading collaborative research initiatives and open-source contributions

SciHorizon: AI-for-Science Benchmark
A comprehensive benchmarking platform for evaluating AI systems in scientific applications, from data processing to large language model integration.

scCompass: Multi-Species scRNA-seq Database
An integrated database and analysis platform for single-cell RNA sequencing data across multiple species, designed for AI-ready applications.

GeneCompass: Gene Regulatory Foundation Model
A knowledge-informed cross-species foundation model for understanding universal gene regulatory mechanisms.

Data-Centric AI Workshop Series
Organizing workshops at top-tier conferences (ICDM, CIKM) to advance data-centric approaches in AI research.
Collaborative Research
Most projects are collaborative efforts with talented researchers from leading institutions. I believe in open science and actively contribute to open-source initiatives that benefit the broader research community.
Selected Publications
28 selected publications • First author, co-first author, and corresponding author works
2026

A Cross-Modal Hierarchical Contrastive Learning Framework for Protein-Protein Interaction Prediction
Ran Zhang, Yihong Wang, Xuezhi Wang, QingQing Long, Jianghua Zhao, Meng Xiao

A Comprehensive Survey on Artificial Intelligence for Biomolecule Design
Guang Yang, Jianing Li, Qiong Wang, Lianlian Wu, Zhengshan Chen, Peng-Cheng Zhao, Haoyang Wang, Qinglong Wang, Bing Wen, Chao Song, Wenxin Xu, Xiongfei He, Xiao Zhang, Sophia Tsoka, Siu-Ming Yiu, Fang-Xiang Wu, Meng Xiao, Shirui Pan, Hui Yu, Song He, Xiaoli Li, Xiaochen Bo, Min Wu, Jian-Yu Shi
2025

Adapting Graph Models via Target Integrity Assessment and Source Distribution Hypothesis
Ziyue Qiao, Xiaomin Yu, Weiyu Guo, Xiao Luo, Meng Xiao, Hui Xiong

Knowledge-Guided Gene Panel Selection for Label-Free Single-Cell RNA-Seq Data: A Reinforcement Learning Perspective
Meng Xiao, Weiliang Zhang, Xiaohan Huang, Hengshu Zhu, Min Wu, Xiaoli Li, Yuanchun Zhou

SciHorizon: Benchmarking AI-for-Science Readiness from Scientific Data to Large Language Models
Chuan Qin, Xin Chen, Chengrui Wang, Pengmin Wu, Xi Chen, Yihang Cheng, Jingyi Zhao, Meng Xiao, Xiangchao Dong, Qingqing Long, Boya Pan, Han Wu, Chengzan Li, Yuanchun Zhou, Hui Xiong, Hengshu Zhu

scCompass: An Integrated Multi-Species scRNA-seq Database for AI-Ready
Pengfei Wang, Wenhao Liu, Jiajia Wang, Yana Liu, Pengjiang Li, Ping Xu, Wentao Cui, Ran Zhang, Qingqing Long, Zhilong Hu, Chen Fang, Jingxi Dong, Chunyang Zhang, Yan Chen, Chengrui Wang, Guole Liu, Hanyu Xie, Yiyang Zhang, Meng Xiao, Shubai Chen, Haiping Jiang, The X-Compass Consortium, Yiqiang Chen, Ge Yang, Shihua Zhang, Zhen Meng, Xuezhi Wang, Guihai Feng, Xin Li, Yuanchun Zhou

Gut microbiota and tuberculosis
Yanhua Liu, Ling Yang, Maryam Meskini, Anjana Goel, Monique Opperman, Sagar Singh Shyamal, Ajay Manaithiya, Meng Xiao, Ruizi Ni, Yajing An, Mingming Zhang, Yuan Tian, Shuang Zhou, Zhaoyang Ye, Li Zhuang, Linsheng Li, Istuti Saraswat, Ankita Kar, Syed Luqman Ali, Shakir Ullah, Syed Yasir Ali, Shradha Kaushik, Tianmu Tian, Mingyang Jiao, Shujun Wang, Giulia Ghisleni, Alice Armanni, Sara Fumagalli, WenYu Wang, Chao Cao, Miguel Prieto Lage, Marla Carpena Rodriguez, Antonia Bruno, Chanyuan Jin, Hanqing Hu, Yuhang Zhang, Ilse du Preez, Ashok Aspatwar, Lingxia Zhang, Wenping Gong

FastFT: Accelerating Reinforced Feature Transformation via Advanced Exploration Strategies
Tianqi He, Xiaohan Huang, Yi Du, Qingqing Long, Ziyue Qiao, Min Wu, Yanjie Fu, Yuanchun Zhou, Meng Xiao

Reinforcement Learning-based Feature Generation Algorithm for Scientific Data
Meng Xiao, Junfeng Zhou, Yuanchun Zhou

GCAL: Adapting Graph Models to Evolving Domain Shifts
Ziyue Qiao, Qianyi Cai, Hao Dong, Jiawei Gu, Pengyang Wang, Meng Xiao, Xiao Luo, Hui Xiong

Knowledge Hierarchy Guided Biological-Medical Dataset Distillation for Domain LLM Training
Xunxin Cai, Chengrui Wang, Qingqing Long, Yuanchun Zhou, Meng Xiao

COMAE: COMprehensive Attribute Exploration for Zero-shot Hashing
Yuqi Li, Qingqing Long, Yihang Zhou, Ran Zhang, Zhiyuan Ning, Zhihong Zhu, Yuanchun Zhou, Xuezhi Wang, Meng Xiao
2024

GeneCompass: Deciphering Universal Gene Regulatory Mechanisms with a Knowledge-informed Cross-species Foundation Model
Xiaodong Yang, Guole Liu, Guihai Feng, Dechao Bu, Pengfei Wang, Jie Jiang, Shubai Chen, Qinmeng Yang, Hefan Miao, Yiyang Zhang, Zhenpeng Man, Zhongming Liang, Zichen Wang, Yaning Li, Zheng Li, Yana Liu, Yao Tian, Wenhao Liu, Cong Li, Ao Li, Jingxi Dong, Zhilong Hu, Chen Fang, Lina Cui, Zixu Deng, Haiping Jiang, Wentao Cui, Jiahao Zhang, Zhaohui Yang, Handong Li, Xingjian He, Liqun Zhong, Jiaheng Zhou, Zijian Wang, Qingqing Long, Ping Xu, Hongmei Wang, Zhen Meng, Xuezhi Wang, Yangang Wang, Yong Wang, Shihua Zhang, Jingtao Guo, Yi Zhao, Yuanchun Zhou, Fei Li, Jing Liu, Yiqiang Chen, Ge Yang, Xin Li

Traceable Group-Wise Self-Optimizing Feature Transformation Learning: A Dual Optimization Perspective
Meng Xiao, Dongjie Wang, Min Wu, Kunpeng Liu, Hui Xiong, Yuanchun Zhou, Yanjie Fu

Interdisciplinary Fairness in Imbalanced Research Proposal Topic Inference: A Hierarchical Transformer-based Method with Selective Interpolation
Meng Xiao, Min Wu, Ziyue Qiao, Yanjie Fu, Zhiyuan Ning, Yi Du, Yuanchun Zhou

SCReader: Prompting Large Language Models to Interpret scRNA-seq Data
Cong Li, Qingqing Long, Yuanchun Zhou, Meng Xiao

GUME: Graphs and User Modalities Enhancement for Long-Tail Multimodal Recommendation
Guojiao Lin, Zhen Meng, Dongjie Wang, Qingqing Long, Yuanchun Zhou, Meng Xiao

MOAT: Graph Prompting for 3D Molecular Graphs
Qingqing Long, Yuchen Yan, Wentao Cui, Zhihong Zhu, Wei Ju, Zhihong Zhu, Yuanchun Zhou, Xuezhi Wang, Meng Xiao

GENESUM: Large Language Model-based Gene Summary Extraction
Zhijian Chen, Chuan Hu, Min Wu, Qingqing Long, Xuezhi Wang, Yuanchun Zhou, Meng Xiao
2023

Hierarchical Interdisciplinary Topic Detection Model for Research Proposal Classification
Meng Xiao, Ziyue Qiao, Yanjie Fu, Hao Dong, Yi Du, Pengyang Wang, Hui Xiong, Yuanchun Zhou

Beyond Discrete Selection: Continuous Embedding Space Optimization for Generative Feature Selection
Meng Xiao, Dongjie Wang, Min Wu, Pengfei Wang, Yuanchun Zhou, Yanjie Fu

Traceable Automatic Feature Transformation via Cascading Actor-Critic Agents
Meng Xiao, Dongjie Wang, Min Wu, Ziyue Qiao, Pengfei Wang, Kunpeng Liu, Yuanchun Zhou, Yanjie Fu

Resolving the Imbalance Issue in Hierarchical Disciplinary Topic Inference via LLM-based Data Augmentation
Xunxin Cai, Meng Xiao, Zhiyuan Ning, Yuanchun Zhou

Reinforcement-Enhanced Autoregressive Feature Transformation: Gradient-steered Search in Continuous Space for Postfix Expressions
Dongjie Wang, Meng Xiao, Min Wu, Pengfei Wang, Yuanchun Zhou, Yanjie Fu

NEEDED: Introducing Hierarchical Transformer to Eye Diseases Diagnosis
Xu Ye, Meng Xiao, Zhiyuan Ning, Weiwei Dai, Wenjuan Cui, Yi Du, Yuanchun Zhou


