hdWGCNA identifies co-expression networks in high-dimensional transcriptomics data

Cell Rep Methods. 2023 Jun 12;3(6):100498. doi: 10.1016/j.crmeth.2023.100498. eCollection 2023 Jun 26.

Abstract

Biological systems are immensely complex, organized into a multi-scale hierarchy of functional units based on tightly regulated interactions between distinct molecules, cells, organs, and organisms. While experimental methods enable transcriptome-wide measurements across millions of cells, popular bioinformatic tools do not support systems-level analysis. Here we present hdWGCNA, a comprehensive framework for analyzing co-expression networks in high-dimensional transcriptomics data such as single-cell and spatial RNA sequencing (RNA-seq). hdWGCNA provides functions for network inference, gene module identification, gene enrichment analysis, statistical tests, and data visualization. Beyond conventional single-cell RNA-seq, hdWGCNA is capable of performing isoform-level network analysis using long-read single-cell data. We showcase hdWGCNA using data from autism spectrum disorder and Alzheimer's disease brain samples, identifying disease-relevant co-expression network modules. hdWGCNA is directly compatible with Seurat, a widely used R package for single-cell and spatial transcriptomics analysis, and we demonstrate the scalability of hdWGCNA by analyzing a dataset containing nearly 1 million cells.

Keywords: Alzheimer's disease; Autism spectrum disorder; co-expression network; gene network; long-read RNA-seq; microglia; single-cell RNA-seq; single-cell genomics; spatial transcriptomics.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alzheimer Disease* / genetics
  • Autism Spectrum Disorder* / genetics
  • Gene Expression Profiling
  • Gene Regulatory Networks / genetics
  • Humans
  • Transcriptome / genetics