特约综述

“人工智能与生物医学”专题特约综述:深度学习在单细胞转录组学中的应用

展开
  • (1 中国科学院广州生物医药与健康研究院, 粤港干细胞与再生医学联合实验室, 广东省干细胞与再生医学重点实验室,数字生物医学研究中心, 广州 510530; 2 中国科学院大学, 北京 100049; 3 广东粤港澳大湾区协同创新研究院,广州 510555)
△ shaojf@jingjinji.cn; zhao_yongbing@gibh.ac.cn

收稿日期: 2025-01-03

  修回日期: 2025-03-10

  录用日期: 2025-03-12

  网络出版日期: 2025-06-25

基金资助

国家重点研发计划(2024YFA1802300)和广东省科技计划(2023B1212060050,2023B1212120009)资助课题

Deep Learning Applications in Single-Cell Transcriptomics

Expand
  • (1 Center for Biomedical Digital Science, Guangdong Provincial Key Laboratory of Stem Cell and Regenerative Medicine, Guangdong-Hong Kong Joint Laboratory for Stem Cell and Regenerative Medicine, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou 510530, China; 2 University of Chinese Academy of Sciences, Beijing 100049, China; 3 GBA Institute of Collaborative Innovation, Guangzhou 510555, China)
△ shaojf@jingjinji.cn; zhao_yongbing@gibh.ac.cn

Received date: 2025-01-03

  Revised date: 2025-03-10

  Accepted date: 2025-03-12

  Online published: 2025-06-25

摘要

单细胞转录组测序(single-cell RNA sequencing)是一种在单细胞水平上对全基因组基因表达进行高通量测序的技术,能有效解析细胞群体异质性,目前广泛应用于发育、疾病等研究领域。由于单细胞转录组数据通常存在高噪声、高维度和高稀疏性等特征,传统分析方法在处理这些数据时存在明显局限性。近年来,以自编码器、生成对抗网络为代表的深度学习模型被广泛应用到单细胞转录组数据分析中,包括表达值插补、批次效应校正、数据降维、细胞聚类和细胞类型注释等,并展现了深度学习在单细胞转录组数据分析中的优越性。特别地,基于Transformer的深度学习大模型,通过自注意力机制学习基因间隐含依赖关系以及基因表达与细胞之间的关联,为单细胞转录组数据分析提供了新路径和发展方向,并为多模态组学整合分析提供了创新的解决方案和潜在的应用前景。

本文引用格式

王天宇, 高诗铠, 邵金凤, 赵永兵 . “人工智能与生物医学”专题特约综述:深度学习在单细胞转录组学中的应用[J]. 生理科学进展, 2025 , 56(3) : 226 -234 . DOI: 10.20059/j.cnki.pps.2025.03.1008

Abstract

Single-cell RNA sequencing (scRNA-seq) is a high-throughput sequencing technology that profiles genome-wide gene expression at the single-cell level,and can efficiently resolve cellular heterogeneity.It is widely applied in fields such as developmental biology and disease research. However,scRNA-seq data often exhibit characteristics such as high noise,high dimensionality, and high sparsity,which pose significant challenges to traditional data analysis methods.In recent years,deep learning models,represented by autoencoders and generative adversarial networks, have been extensively applied to scRNA-seq data analysis tasks,including expression imputation, batch effect correction,dimensionality reduction,cell clustering,and cell type annotation. These applications demonstrate the power of deep learning. Notably, Transformer-based deep learning models, leveraging self-attention mechanisms to capture implicit dependencies among genes and associations between gene expression and cells, offer a novel strategy and direction for scRNA-seq data analysis, and provide innovative solutions with promising applications for the integration of multimodal omics data.
Options
文章导航

/