Meng Xiao (肖濛)

肖濛

|

Computer Network Information Center, CAS
Duke-NUS Medical School, NUS
Meng Xiao (肖濛) Profile Photo

About Me

Passionate researcher dedicated to advancing AI for scientific discovery and life sciences

Meng Xiao received joint doctoral training from the University of Chinese Academy of Sciences and Institute for Infocomm Research, Agency for Science, Technology and Research (ASTAR), Singapore in June 2023.

Currently, he is a research fellow at DUKE-NUS Medical School, National University of Singapore. He is also an assistant researcher (postdoctoral fellow) at the Computer Network Information Center, Chinese Academy of Sciences from 2023.

Meng Xiao has published over 30 papers, including iMeta, NeurIPS, ICLR, ICML, IEEE TKDE, IEEE ICDE, ACM SIGKDD, AIJ, and ACM TKDD. He actively contributes to the academic community as a (s)PC member or reviewer on many premier international conferences and journals.

Research Interests

AI4SAI4DataData Mining

Latest News

2025.07Latest

Two papers accepted by CRAD DCAI Issue and BMC Bioinformatics!

2025.05

New survey about Gut microbiota and tuberculosis published!

Key Achievements

CAS President Award - Special Prize (中国科学院院长奖学金特别奖)

The highest award for graduate students of the CAS

2023

Young Talent Support Project by BAST (北京市科协青年人才托举工程)

Batch of 2024-2026

2024

Doctoral Degree (博士学位)

University of Chinese Academy of Sciences (中国科学院大学)

2023

Published 30+ Research Articles

Includes top journals and conferences

To Date

Timeline

Research Fellow

2025-Present

DUKE-NUS Medical School, NUS

Single-cell Data Mining, AI for lifescience

Assistant Researcher

2023-Present

Computer Network Information Center, CAS

Scientific Data Mining, Data-centric AI

PhD Student

2019-2023

University of Chinese Academy of Sciences

Scientific Literature Mining, Interdisciplinary Knowledge Graph

Get in Touch

Interested in collaboration, research opportunities, or academic discussions?

Contact Information

shaow@cnic.cn

Work Email

meng.xiao@nus.edu.sg

Academic Email

Computer Network Information Center, CAS

No. 2 South Dongsheng Road, Haidian District, Beijing

Duke-NUS Medical School, NUS

8 College Road, Singapore, 169857

Research Collaboration

I'm always interested in collaborating with researchers and practitioners in:

  • Foundation models for biomedical scientific discovery
  • Agentic AI applications in life sciences and healthcare
  • Scientific literature data mining and knowledge discovery
  • Data-centric AI and efficient machine learning systems

Academic Services

Area Chair

AISTAT 2025

Workshop Organizer

Data-Centric AI (ICDM 2023/2024, CIKM 2024)

Reviewer

NeurIPS, ICML, ICLR, SIGKDD, and more

Guest Editor

Electronics Special Issue

Response Time: I typically respond to emails within 2-3 business days. For urgent matters, please mention "URGENT" in the subject line.

Academic Impact

Quantifying research contributions and scholarly influence

709
Since 2019: 790
Total Citations
16
Since 2019: 15
h-index
22
Since 2019: 19
i10-index
2020: 14
2021: 8
2022: 20
2023: 59
2024: 228
2025: 370
Past 6 Years
Citation Trends

Last updated: 8/23/2025

Featured Projects

Leading collaborative research initiatives and open-source contributions

Featured
SciHorizon: AI-for-Science Benchmark thumbnail
SIGKDD 2025

SciHorizon: AI-for-Science Benchmark

A comprehensive benchmarking platform for evaluating AI systems in scientific applications, from data processing to large language model integration.

AI4ScienceBenchmarkLLM
Featured
scCompass: Multi-Species scRNA-seq Database thumbnail
Advanced Sciences 2025

scCompass: Multi-Species scRNA-seq Database

An integrated database and analysis platform for single-cell RNA sequencing data across multiple species, designed for AI-ready applications.

BioinformaticsDatabasescRNA-seq
Featured
GeneCompass: Gene Regulatory Foundation Model thumbnail
Cell Research 2024

GeneCompass: Gene Regulatory Foundation Model

A knowledge-informed cross-species foundation model for understanding universal gene regulatory mechanisms.

Foundation ModelGene RegulationCross-species
Data-Centric AI Workshop Series thumbnail
ICDM 2023/2024, CIKM 2024

Data-Centric AI Workshop Series

Organizing workshops at top-tier conferences (ICDM, CIKM) to advance data-centric approaches in AI research.

WorkshopData-Centric AICommunity

Collaborative Research

Most projects are collaborative efforts with talented researchers from leading institutions. I believe in open science and actively contribute to open-source initiatives that benefit the broader research community.

Selected Publications

24 selected publications • First author, co-first author, and corresponding author works

2025

SciHorizon: Benchmarking AI-for-Science Readiness from Scientific Data to Large Language Models thumbnail
ACM SIGKDD2025Co-author
0 citations

SciHorizon: Benchmarking AI-for-Science Readiness from Scientific Data to Large Language Models

Chuan Qin, Xin Chen, Chengrui Wang, Pengmin Wu, Xi Chen, Yihang Cheng, Jingyi Zhao, Meng Xiao, Xiangchao Dong, Qingqing Long, Boya Pan, Han Wu, Chengzan Li, Yuanchun Zhou, Hui Xiong, Hengshu Zhu

AI4Sciencebenchmarkingscientific datalarge language models
scCompass: An Integrated Multi-Species scRNA-seq Database for AI-Ready thumbnail
Advanced Sciences2025Co-author
0 citations

scCompass: An Integrated Multi-Species scRNA-seq Database for AI-Ready

Pengfei Wang, Wenhao Liu, Jiajia Wang, Yana Liu, Pengjiang Li, Ping Xu, Wentao Cui, Ran Zhang, Qingqing Long, Zhilong Hu, Chen Fang, Jingxi Dong, Chunyang Zhang, Yan Chen, Chengrui Wang, Guole Liu, Hanyu Xie, Yiyang Zhang, Meng Xiao, Shubai Chen, Haiping Jiang, The X-Compass Consortium, Yiqiang Chen, Ge Yang, Shihua Zhang, Zhen Meng, Xuezhi Wang, Guihai Feng, Xin Li, Yuanchun Zhou

single-cellscRNA-seqdatabasemulti-species
Gut microbiota and tuberculosis thumbnail
iMeta2025Co-first Author
0 citations

Gut microbiota and tuberculosis

Yanhua Liu, Ling Yang, Maryam Meskini, Anjana Goel, Monique Opperman, Sagar Singh Shyamal, Ajay Manaithiya, Meng Xiao, Ruizi Ni, Yajing An, Mingming Zhang, Yuan Tian, Shuang Zhou, Zhaoyang Ye, Li Zhuang, Linsheng Li, Istuti Saraswat, Ankita Kar, Syed Luqman Ali, Shakir Ullah, Syed Yasir Ali, Shradha Kaushik, Tianmu Tian, Mingyang Jiao, Shujun Wang, Giulia Ghisleni, Alice Armanni, Sara Fumagalli, WenYu Wang, Chao Cao, Miguel Prieto Lage, Marla Carpena Rodriguez, Antonia Bruno, Chanyuan Jin, Hanqing Hu, Yuhang Zhang, Ilse du Preez, Ashok Aspatwar, Lingxia Zhang, Wenping Gong

tuberculosisgut microbiotamicrobiomeAI
FastFT: Accelerating Reinforced Feature Transformation via Advanced Exploration Strategies thumbnail
IEEE ICDE2025Corresponding Author
0 citations

FastFT: Accelerating Reinforced Feature Transformation via Advanced Exploration Strategies

Tianqi He, Xiaohan Huang, Yi Du, Qingqing Long, Ziyue Qiao, Min Wu, Yanjie Fu, Yuanchun Zhou, Meng Xiao

feature transformationreinforcement learningexploration strategies
Reinforcement Learning-based Feature Generation Algorithm for Scientific Data thumbnail
Journal of Computer Research and Development2025First Author
0 citations

Reinforcement Learning-based Feature Generation Algorithm for Scientific Data

Meng Xiao, Junfeng Zhou, Yuanchun Zhou

feature generationreinforcement learningscientific data
GCAL: Adapting Graph Models to Evolving Domain Shifts thumbnail
ICML2025Corresponding Author
0 citations

GCAL: Adapting Graph Models to Evolving Domain Shifts

Ziyue Qiao, Qianyi Cai, Hao Dong, Jiawei Gu, Pengyang Wang, Meng Xiao, Xiao Luo, Hui Xiong

graph modelsdomain adaptationtransfer learning
Knowledge Hierarchy Guided Biological-Medical Dataset Distillation for Domain LLM Training thumbnail
DASFAA2025Corresponding Author
0 citations

Knowledge Hierarchy Guided Biological-Medical Dataset Distillation for Domain LLM Training

Xunxin Cai, Chengrui Wang, Qingqing Long, Yuanchun Zhou, Meng Xiao

dataset distillationbiomedicaldomain LLM
COMAE: COMprehensive Attribute Exploration for Zero-shot Hashing thumbnail
ACM ICMR2025Corresponding Author
0 citations

COMAE: COMprehensive Attribute Exploration for Zero-shot Hashing

Yuqi Li, Qingqing Long, Yihang Zhou, Ran Zhang, Zhiyuan Ning, Zhihong Zhu, Yuanchun Zhou, Xuezhi Wang, Meng Xiao

zero-shot hashingattribute explorationmultimedia retrieval

2024

GeneCompass: Deciphering Universal Gene Regulatory Mechanisms with a Knowledge-informed Cross-species Foundation Model thumbnail
Cell Research2024Co-author
12 citations

GeneCompass: Deciphering Universal Gene Regulatory Mechanisms with a Knowledge-informed Cross-species Foundation Model

Xiaodong Yang, Guole Liu, Guihai Feng, Dechao Bu, Pengfei Wang, Jie Jiang, Shubai Chen, Qinmeng Yang, Hefan Miao, Yiyang Zhang, Zhenpeng Man, Zhongming Liang, Zichen Wang, Yaning Li, Zheng Li, Yana Liu, Yao Tian, Wenhao Liu, Cong Li, Ao Li, Jingxi Dong, Zhilong Hu, Chen Fang, Lina Cui, Zixu Deng, Haiping Jiang, Wentao Cui, Jiahao Zhang, Zhaohui Yang, Handong Li, Xingjian He, Liqun Zhong, Jiaheng Zhou, Zijian Wang, Qingqing Long, Ping Xu, Hongmei Wang, Zhen Meng, Xuezhi Wang, Yangang Wang, Yong Wang, Shihua Zhang, Jingtao Guo, Yi Zhao, Yuanchun Zhou, Fei Li, Jing Liu, Yiqiang Chen, Ge Yang, Xin Li

gene regulationfoundation modelcross-species
Traceable Group-Wise Self-Optimizing Feature Transformation Learning: A Dual Optimization Perspective thumbnail
ACM TKDD2024First Author
15 citations

Traceable Group-Wise Self-Optimizing Feature Transformation Learning: A Dual Optimization Perspective

Meng Xiao, Dongjie Wang, Min Wu, Kunpeng Liu, Hui Xiong, Yuanchun Zhou, Yanjie Fu

feature transformationgroup-wise optimizationdual optimization
Interdisciplinary Fairness in Imbalanced Research Proposal Topic Inference: A Hierarchical Transformer-based Method with Selective Interpolation thumbnail
ACM TKDD2024First Author
8 citations

Interdisciplinary Fairness in Imbalanced Research Proposal Topic Inference: A Hierarchical Transformer-based Method with Selective Interpolation

Meng Xiao, Min Wu, Ziyue Qiao, Yanjie Fu, Zhiyuan Ning, Yi Du, Yuanchun Zhou

research proposaltopic inferencefairnesstransformer
SCReader: Prompting Large Language Models to Interpret scRNA-seq Data thumbnail
IEEE ICDM2024Corresponding Author
3 citations

SCReader: Prompting Large Language Models to Interpret scRNA-seq Data

Cong Li, Qingqing Long, Yuanchun Zhou, Meng Xiao

scRNA-seqlarge language modelsbioinformatics
GUME: Graphs and User Modalities Enhancement for Long-Tail Multimodal Recommendation thumbnail
ACM CIKM2024Corresponding Author
5 citations

GUME: Graphs and User Modalities Enhancement for Long-Tail Multimodal Recommendation

Guojiao Lin, Zhen Meng, Dongjie Wang, Qingqing Long, Yuanchun Zhou, Meng Xiao

multimodal recommendationlong-tailgraph neural networks
MOAT: Graph Prompting for 3D Molecular Graphs thumbnail
ACM CIKM2024Corresponding Author
2 citations

MOAT: Graph Prompting for 3D Molecular Graphs

Qingqing Long, Yuchen Yan, Wentao Cui, Zhihong Zhu, Wei Ju, Zhihong Zhu, Yuanchun Zhou, Xuezhi Wang, Meng Xiao

molecular graphs3D graphsgraph prompting
GENESUM: Large Language Model-based Gene Summary Extraction thumbnail
IEEE BIBM2024Corresponding Author
1 citations

GENESUM: Large Language Model-based Gene Summary Extraction

Zhijian Chen, Chuan Hu, Min Wu, Qingqing Long, Xuezhi Wang, Yuanchun Zhou, Meng Xiao

gene summarylarge language modelsbioinformatics
DP-CRE: Continual Relation Extraction via Decoupled Contrastive Learning and Memory Structure Preservation thumbnail
LREC-COLING2024Co-first Author
4 citations

DP-CRE: Continual Relation Extraction via Decoupled Contrastive Learning and Memory Structure Preservation

Mengyi Huang, Meng Xiao, Ludi Wang, Yi Du

continual learningrelation extractioncontrastive learning

2023

Hierarchical Interdisciplinary Topic Detection Model for Research Proposal Classification thumbnail
IEEE TKDE2023First Author
25 citations

Hierarchical Interdisciplinary Topic Detection Model for Research Proposal Classification

Meng Xiao, Ziyue Qiao, Yanjie Fu, Hao Dong, Yi Du, Pengyang Wang, Hui Xiong, Yuanchun Zhou

topic detectionresearch proposalhierarchical classification
Beyond Discrete Selection: Continuous Embedding Space Optimization for Generative Feature Selection thumbnail
IEEE ICDM2023First Author
28 citations

Beyond Discrete Selection: Continuous Embedding Space Optimization for Generative Feature Selection

Meng Xiao, Dongjie Wang, Min Wu, Pengfei Wang, Yuanchun Zhou, Yanjie Fu

feature selectioncontinuous optimizationgenerative models
Traceable Automatic Feature Transformation via Cascading Actor-Critic Agents thumbnail
SIAM SDM2023First Author
18 citations

Traceable Automatic Feature Transformation via Cascading Actor-Critic Agents

Meng Xiao, Dongjie Wang, Min Wu, Ziyue Qiao, Pengfei Wang, Kunpeng Liu, Yuanchun Zhou, Yanjie Fu

feature transformationactor-critictraceable learning
Resolving the Imbalance Issue in Hierarchical Disciplinary Topic Inference via LLM-based Data Augmentation thumbnail
IEEE ICDM2023Corresponding Author
12 citations

Resolving the Imbalance Issue in Hierarchical Disciplinary Topic Inference via LLM-based Data Augmentation

Xunxin Cai, Meng Xiao, Zhiyuan Ning, Yuanchun Zhou

topic inferencedata augmentationlarge language models
Reinforcement-Enhanced Autoregressive Feature Transformation: Gradient-steered Search in Continuous Space for Postfix Expressions thumbnail
NeurIPS2023Co-first AuthorSpotlight Paper
42 citations

Reinforcement-Enhanced Autoregressive Feature Transformation: Gradient-steered Search in Continuous Space for Postfix Expressions

Dongjie Wang, Meng Xiao, Min Wu, Pengfei Wang, Yuanchun Zhou, Yanjie Fu

autoregressivefeature transformationreinforcement learning
NEEDED: Introducing Hierarchical Transformer to Eye Diseases Diagnosis thumbnail
SIAM SDM2023Co-first Author
8 citations

NEEDED: Introducing Hierarchical Transformer to Eye Diseases Diagnosis

Xu Ye, Meng Xiao, Zhiyuan Ning, Weiwei Dai, Wenjuan Cui, Yi Du, Yuanchun Zhou

medical diagnosiseye diseaseshierarchical transformer
Semi-supervised Domain Adaptation in Graph Transfer Learning thumbnail
IJCAI2023Co-first Author
22 citations

Semi-supervised Domain Adaptation in Graph Transfer Learning

Ziyue Qiao, Luo Xiao, Meng Xiao, Hao Dong, Yuanchun Zhou, Hui Xiong

domain adaptationgraph transfer learningsemi-supervised

2021

Expert Knowledge Guided Length-Variant Hierarchical Label Generation for Proposal Classification thumbnail
IEEE ICDM2021First Author
35 citations

Expert Knowledge Guided Length-Variant Hierarchical Label Generation for Proposal Classification

Meng Xiao, Ziyue Qiao, Yanjie Fu, Yi Du, Pengyang Wang, Yuanchun Zhou

hierarchical classificationexpert knowledgeproposal classification