Meng Xiao (肖濛)
肖濛
|

About Me
Passionate researcher dedicated to advancing AI for scientific discovery and life sciences
Meng Xiao received joint doctoral training from the University of Chinese Academy of Sciences and Institute for Infocomm Research, Agency for Science, Technology and Research (ASTAR), Singapore in June 2023.
Currently, he is a research fellow at DUKE-NUS Medical School, National University of Singapore. He is also an assistant researcher (postdoctoral fellow) at the Computer Network Information Center, Chinese Academy of Sciences from 2023.
Meng Xiao has published over 30 papers, including iMeta, NeurIPS, ICLR, ICML, IEEE TKDE, IEEE ICDE, ACM SIGKDD, AIJ, and ACM TKDD. He actively contributes to the academic community as a (s)PC member or reviewer on many premier international conferences and journals.
Research Interests
Latest News
Two papers accepted by CRAD DCAI Issue and BMC Bioinformatics!
Key Achievements
CAS President Award - Special Prize (中国科学院院长奖学金特别奖)
The highest award for graduate students of the CAS
2023Young Talent Support Project by BAST (北京市科协青年人才托举工程)
Batch of 2024-2026
2024Doctoral Degree (博士学位)
University of Chinese Academy of Sciences (中国科学院大学)
2023Published 30+ Research Articles
Includes top journals and conferences
To DateTimeline
Research Fellow
2025-PresentDUKE-NUS Medical School, NUS
Single-cell Data Mining, AI for lifescience
Assistant Researcher
2023-PresentComputer Network Information Center, CAS
Scientific Data Mining, Data-centric AI
PhD Student
2019-2023University of Chinese Academy of Sciences
Scientific Literature Mining, Interdisciplinary Knowledge Graph
Get in Touch
Interested in collaboration, research opportunities, or academic discussions?
Contact Information
Work Email
Academic Email
Computer Network Information Center, CAS
No. 2 South Dongsheng Road, Haidian District, Beijing
Duke-NUS Medical School, NUS
8 College Road, Singapore, 169857
Academic Profiles
Research Collaboration
I'm always interested in collaborating with researchers and practitioners in:
- Foundation models for biomedical scientific discovery
- Agentic AI applications in life sciences and healthcare
- Scientific literature data mining and knowledge discovery
- Data-centric AI and efficient machine learning systems
Academic Services
Area Chair
AISTAT 2025
Workshop Organizer
Data-Centric AI (ICDM 2023/2024, CIKM 2024)
Reviewer
NeurIPS, ICML, ICLR, SIGKDD, and more
Guest Editor
Electronics Special Issue
Response Time: I typically respond to emails within 2-3 business days. For urgent matters, please mention "URGENT" in the subject line.
Academic Impact
Quantifying research contributions and scholarly influence
Last updated: 8/23/2025
Featured Projects
Leading collaborative research initiatives and open-source contributions

SciHorizon: AI-for-Science Benchmark
A comprehensive benchmarking platform for evaluating AI systems in scientific applications, from data processing to large language model integration.

scCompass: Multi-Species scRNA-seq Database
An integrated database and analysis platform for single-cell RNA sequencing data across multiple species, designed for AI-ready applications.

GeneCompass: Gene Regulatory Foundation Model
A knowledge-informed cross-species foundation model for understanding universal gene regulatory mechanisms.

Data-Centric AI Workshop Series
Organizing workshops at top-tier conferences (ICDM, CIKM) to advance data-centric approaches in AI research.
Collaborative Research
Most projects are collaborative efforts with talented researchers from leading institutions. I believe in open science and actively contribute to open-source initiatives that benefit the broader research community.
Selected Publications
24 selected publications • First author, co-first author, and corresponding author works
2025

SciHorizon: Benchmarking AI-for-Science Readiness from Scientific Data to Large Language Models
Chuan Qin, Xin Chen, Chengrui Wang, Pengmin Wu, Xi Chen, Yihang Cheng, Jingyi Zhao, Meng Xiao, Xiangchao Dong, Qingqing Long, Boya Pan, Han Wu, Chengzan Li, Yuanchun Zhou, Hui Xiong, Hengshu Zhu

scCompass: An Integrated Multi-Species scRNA-seq Database for AI-Ready
Pengfei Wang, Wenhao Liu, Jiajia Wang, Yana Liu, Pengjiang Li, Ping Xu, Wentao Cui, Ran Zhang, Qingqing Long, Zhilong Hu, Chen Fang, Jingxi Dong, Chunyang Zhang, Yan Chen, Chengrui Wang, Guole Liu, Hanyu Xie, Yiyang Zhang, Meng Xiao, Shubai Chen, Haiping Jiang, The X-Compass Consortium, Yiqiang Chen, Ge Yang, Shihua Zhang, Zhen Meng, Xuezhi Wang, Guihai Feng, Xin Li, Yuanchun Zhou

Gut microbiota and tuberculosis
Yanhua Liu, Ling Yang, Maryam Meskini, Anjana Goel, Monique Opperman, Sagar Singh Shyamal, Ajay Manaithiya, Meng Xiao, Ruizi Ni, Yajing An, Mingming Zhang, Yuan Tian, Shuang Zhou, Zhaoyang Ye, Li Zhuang, Linsheng Li, Istuti Saraswat, Ankita Kar, Syed Luqman Ali, Shakir Ullah, Syed Yasir Ali, Shradha Kaushik, Tianmu Tian, Mingyang Jiao, Shujun Wang, Giulia Ghisleni, Alice Armanni, Sara Fumagalli, WenYu Wang, Chao Cao, Miguel Prieto Lage, Marla Carpena Rodriguez, Antonia Bruno, Chanyuan Jin, Hanqing Hu, Yuhang Zhang, Ilse du Preez, Ashok Aspatwar, Lingxia Zhang, Wenping Gong

FastFT: Accelerating Reinforced Feature Transformation via Advanced Exploration Strategies
Tianqi He, Xiaohan Huang, Yi Du, Qingqing Long, Ziyue Qiao, Min Wu, Yanjie Fu, Yuanchun Zhou, Meng Xiao

Reinforcement Learning-based Feature Generation Algorithm for Scientific Data
Meng Xiao, Junfeng Zhou, Yuanchun Zhou

GCAL: Adapting Graph Models to Evolving Domain Shifts
Ziyue Qiao, Qianyi Cai, Hao Dong, Jiawei Gu, Pengyang Wang, Meng Xiao, Xiao Luo, Hui Xiong

Knowledge Hierarchy Guided Biological-Medical Dataset Distillation for Domain LLM Training
Xunxin Cai, Chengrui Wang, Qingqing Long, Yuanchun Zhou, Meng Xiao

COMAE: COMprehensive Attribute Exploration for Zero-shot Hashing
Yuqi Li, Qingqing Long, Yihang Zhou, Ran Zhang, Zhiyuan Ning, Zhihong Zhu, Yuanchun Zhou, Xuezhi Wang, Meng Xiao
2024

GeneCompass: Deciphering Universal Gene Regulatory Mechanisms with a Knowledge-informed Cross-species Foundation Model
Xiaodong Yang, Guole Liu, Guihai Feng, Dechao Bu, Pengfei Wang, Jie Jiang, Shubai Chen, Qinmeng Yang, Hefan Miao, Yiyang Zhang, Zhenpeng Man, Zhongming Liang, Zichen Wang, Yaning Li, Zheng Li, Yana Liu, Yao Tian, Wenhao Liu, Cong Li, Ao Li, Jingxi Dong, Zhilong Hu, Chen Fang, Lina Cui, Zixu Deng, Haiping Jiang, Wentao Cui, Jiahao Zhang, Zhaohui Yang, Handong Li, Xingjian He, Liqun Zhong, Jiaheng Zhou, Zijian Wang, Qingqing Long, Ping Xu, Hongmei Wang, Zhen Meng, Xuezhi Wang, Yangang Wang, Yong Wang, Shihua Zhang, Jingtao Guo, Yi Zhao, Yuanchun Zhou, Fei Li, Jing Liu, Yiqiang Chen, Ge Yang, Xin Li

Traceable Group-Wise Self-Optimizing Feature Transformation Learning: A Dual Optimization Perspective
Meng Xiao, Dongjie Wang, Min Wu, Kunpeng Liu, Hui Xiong, Yuanchun Zhou, Yanjie Fu

Interdisciplinary Fairness in Imbalanced Research Proposal Topic Inference: A Hierarchical Transformer-based Method with Selective Interpolation
Meng Xiao, Min Wu, Ziyue Qiao, Yanjie Fu, Zhiyuan Ning, Yi Du, Yuanchun Zhou

SCReader: Prompting Large Language Models to Interpret scRNA-seq Data
Cong Li, Qingqing Long, Yuanchun Zhou, Meng Xiao

GUME: Graphs and User Modalities Enhancement for Long-Tail Multimodal Recommendation
Guojiao Lin, Zhen Meng, Dongjie Wang, Qingqing Long, Yuanchun Zhou, Meng Xiao

MOAT: Graph Prompting for 3D Molecular Graphs
Qingqing Long, Yuchen Yan, Wentao Cui, Zhihong Zhu, Wei Ju, Zhihong Zhu, Yuanchun Zhou, Xuezhi Wang, Meng Xiao

GENESUM: Large Language Model-based Gene Summary Extraction
Zhijian Chen, Chuan Hu, Min Wu, Qingqing Long, Xuezhi Wang, Yuanchun Zhou, Meng Xiao
2023

Hierarchical Interdisciplinary Topic Detection Model for Research Proposal Classification
Meng Xiao, Ziyue Qiao, Yanjie Fu, Hao Dong, Yi Du, Pengyang Wang, Hui Xiong, Yuanchun Zhou

Beyond Discrete Selection: Continuous Embedding Space Optimization for Generative Feature Selection
Meng Xiao, Dongjie Wang, Min Wu, Pengfei Wang, Yuanchun Zhou, Yanjie Fu

Traceable Automatic Feature Transformation via Cascading Actor-Critic Agents
Meng Xiao, Dongjie Wang, Min Wu, Ziyue Qiao, Pengfei Wang, Kunpeng Liu, Yuanchun Zhou, Yanjie Fu

Resolving the Imbalance Issue in Hierarchical Disciplinary Topic Inference via LLM-based Data Augmentation
Xunxin Cai, Meng Xiao, Zhiyuan Ning, Yuanchun Zhou

Reinforcement-Enhanced Autoregressive Feature Transformation: Gradient-steered Search in Continuous Space for Postfix Expressions
Dongjie Wang, Meng Xiao, Min Wu, Pengfei Wang, Yuanchun Zhou, Yanjie Fu

NEEDED: Introducing Hierarchical Transformer to Eye Diseases Diagnosis
Xu Ye, Meng Xiao, Zhiyuan Ning, Weiwei Dai, Wenjuan Cui, Yi Du, Yuanchun Zhou