Multiple-Swarm Ensembles: Improving the Predictive Power and Robustness of Predictive Models and Its Use in Computational Biology

IEEE/ACM Trans Comput Biol Bioinform. 2018 May-Jun;15(3):926-933. doi: 10.1109/TCBB.2017.2691329. Epub 2017 Apr 5.

Abstract

Machine learning is an integral part of computational biology, and has already shown its use in various applications, such as prognostic tests. In the last few years in the non-biological machine learning community, ensembling techniques have shown their power in data mining competitions such as the Netflix challenge; however, such methods have not found wide use in computational biology. In this work, we endeavor to show how ensembling techniques can be applied to practical problems, including problems in the field of bioinformatics, and how they often outperform other machine learning techniques in both predictive power and robustness. Furthermore, we develop a methodology of ensembling, Multi-Swarm Ensemble (MSWE) by using multiple particle swarm optimizations and demonstrate its ability to further enhance the performance of ensembles.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Breast Neoplasms / genetics
  • Breast Neoplasms / metabolism
  • Computational Biology / methods*
  • Data Mining / methods*
  • Databases, Genetic
  • Female
  • Gene Expression Profiling
  • Humans
  • Machine Learning*