Naïve Bayesian Models for Vero Cell Cytotoxicity

Alexander L Perryman; Jimmy S Patel; Riccardo Russo; Eric Singleton; Nancy Connell; Sean Ekins; Joel S Freundlich

doi:10.1007/s11095-018-2439-9

Naïve Bayesian Models for Vero Cell Cytotoxicity

Pharm Res. 2018 Jun 29;35(9):170. doi: 10.1007/s11095-018-2439-9.

Authors

Alexander L Perryman¹, Jimmy S Patel¹, Riccardo Russo², Eric Singleton², Nancy Connell², Sean Ekins³, Joel S Freundlich^{4

5}

Affiliations

¹ Department of Pharmacology, Physiology and Neuroscience, and Medicine, Rutgers University-New Jersey Medical School, Medical Sciences Building, I-503, 185 South Orange Ave, Newark, NJ, 07103, USA.
² Division of Infectious Diseases, Department of Medicine, and the Ruy V. Lourenço Center for the Study of Emerging and Re-emerging Pathogens, Rutgers University-New Jersey Medical School, Medical Sciences Building, I-503, 185 South Orange Ave, Newark, NJ, 07103, USA.
³ Collaborations Pharmaceuticals, Inc., Main Campus Drive Lab 3510, Raleigh, North Carolina,, 27606, USA.
⁴ Department of Pharmacology, Physiology and Neuroscience, and Medicine, Rutgers University-New Jersey Medical School, Medical Sciences Building, I-503, 185 South Orange Ave, Newark, NJ, 07103, USA. freundjs@rutgers.edu.
⁵ Division of Infectious Diseases, Department of Medicine, and the Ruy V. Lourenço Center for the Study of Emerging and Re-emerging Pathogens, Rutgers University-New Jersey Medical School, Medical Sciences Building, I-503, 185 South Orange Ave, Newark, NJ, 07103, USA. freundjs@rutgers.edu.

Abstract

Purpose: To advance translational research of potential therapeutic small molecules against infectious microbes, the compounds must display a relative lack of mammalian cell cytotoxicity. Vero cell cytotoxicity (CC₅₀) is a common initial assay for this metric. We explored the development of naïve Bayesian models that can enhance the probability of identifying non-cytotoxic compounds.

Methods: Vero cell cytotoxicity assays were identified in PubChem, reformatted, and curated to create a training set with 8741 unique small molecules. These data were used to develop Bayesian classifiers, which were assessed with internal cross-validation, external tests with a set of 193 compounds from our laboratory, and independent validation with an additional diverse set of 1609 unique compounds from PubChem.

Results: Evaluation with independent, external test and validation sets indicated that cytotoxicity Bayesian models constructed with the ECFP_6 descriptor were more accurate than those that used FCFP_6 fingerprints. The best cytotoxicity Bayesian model displayed predictive power in external evaluations, according to conventional and chance-corrected statistics, as well as enrichment factors.

Conclusions: The results from external tests demonstrate that our novel cytotoxicity Bayesian model displays sufficient predictive power to help guide translational research. To assist the chemical tool and drug discovery communities, our curated training set is being distributed as part of the Supplementary Material. Graphical Abstract Naive Bayesian models have been trained with publically available data and offer a useful tool for chemical biology and drug discovery to select for small molecules with a high probability of exhibiting acceptably low Vero cell cytotoxicity.

Keywords: Bayesian model; machine learning; predicting mammalian cytotoxicity; translational research; vero cell CC50.

MeSH terms

Animals
Bayes Theorem*
Chlorocebus aethiops
Databases, Pharmaceutical
Drug Discovery
Information Storage and Retrieval
Models, Biological*
Models, Molecular
Small Molecule Libraries / chemistry
Small Molecule Libraries / toxicity*
Toxicity Tests / methods*
Vero Cells

Substances

Small Molecule Libraries

Abstract

MeSH terms

Substances

Grants and funding