A novel algorithm for scalable and accurate Bayesian network learning

Laura E Brown; Ioannis Tsamardinos; Constantin F Aliferis

A novel algorithm for scalable and accurate Bayesian network learning

Stud Health Technol Inform. 2004;107(Pt 1):711-5.

Authors

Laura E Brown¹, Ioannis Tsamardinos, Constantin F Aliferis

Affiliation

¹ Discoivery Systems Laboratory, Department of Biomedical Informatics, Vanderbilt University, 2209 Garland Avenue, Nashville, TN 37232, USA. laura.e.brown@vanderbilt.edu

PMID: 15360905

Abstract

Bayesian Networks (BN) is a knowledge representation formalism that has been proven to be valuable in biomedicine for constructing decision support systems and for generating causal hypotheses from data. Given the emergence of datasets in medicine and biology with thousands of variables and that current algorithms do not scale more than a few hundred variables in practical domains, new efficient and accurate algorithms are needed to learn high quality BNs from data. We present a new algorithm called Max-Min Hill-Climbing (MMHC) that builds upon and improves the Sparse Candidate (SC) algorithm; a state-of-the-art algorithm that scales up to datasets involving hundreds of variables provided the generating networks are sparse. Compared to the SC, on a number of datasets from medicine and biology, (a) MMHC discovers BNs that are structurally closer to the data-generating BN, (b) the discovered networks are more probable given the data, (c) MMHC is computationally more efficient and scalable than SC, and (d) the generating networks are not required to be uniformly sparse nor is the user of MMHC required to guess correctly the network connectivity

Publication types

Research Support, N.I.H., Extramural
Research Support, U.S. Gov't, P.H.S.

MeSH terms

Algorithms*
Artificial Intelligence
Bayes Theorem*
Humans
Neural Networks, Computer*

Abstract

Publication types

MeSH terms

Grants and funding