Drug target inference by mining transcriptional data using a novel graph convolutional network framework

Protein Cell. 2022 Apr;13(4):281-301. doi: 10.1007/s13238-021-00885-0. Epub 2021 Oct 22.

Abstract

A fundamental challenge that arises in biomedicine is the need to characterize compounds in a relevant cellular context in order to reveal potential on-target or off-target effects. Recently, the fast accumulation of gene transcriptional profiling data provides us an unprecedented opportunity to explore the protein targets of chemical compounds from the perspective of cell transcriptomics and RNA biology. Here, we propose a novel Siamese spectral-based graph convolutional network (SSGCN) model for inferring the protein targets of chemical compounds from gene transcriptional profiles. Although the gene signature of a compound perturbation only provides indirect clues of the interacting targets, and the biological networks under different experiment conditions further complicate the situation, the SSGCN model was successfully trained to learn from known compound-target pairs by uncovering the hidden correlations between compound perturbation profiles and gene knockdown profiles. On a benchmark set and a large time-split validation dataset, the model achieved higher target inference accuracy as compared to previous methods such as Connectivity Map. Further experimental validations of prediction results highlight the practical usefulness of SSGCN in either inferring the interacting targets of compound, or reversely, in finding novel inhibitors of a given target of interest.

Keywords: deep learning; drug target inference; experimental verification; transcriptomics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Drug Delivery Systems*
  • Proteins*
  • Transcriptome

Substances

  • Proteins