Feature selection methods on gene expression microarray data for cancer classification: A systematic review

Comput Biol Med. 2022 Jan:140:105051. doi: 10.1016/j.compbiomed.2021.105051. Epub 2021 Nov 23.

Abstract

This systematic review provides researchers interested in feature selection (FS) for processing microarray data with comprehensive information about the main research directions for gene expression classification conducted during the recent seven years. A set of 132 researches published by three different publishers is reviewed. The studied papers are categorized into nine directions based on their objectives. The FS directions that received various levels of attention were then summarized. The review revealed that 'propose hybrid FS methods' represented the most interesting research direction with a percentage of 34.9%, while the other directions have lower percentages that ranged from 13.6% down to 3%. This guides researchers to select the most competitive research direction. Papers in each category are thoroughly reviewed based on six perspectives, mainly: method(s), classifier(s), dataset(s), dataset dimension(s) range, performance metric(s), and result(s) achieved.

Keywords: Embedded techniques; Ensemble; Feature selection; Filters; Hybrid; Wrappers.