Participant flow diagrams for health equity in AI

Jacob G Ellen; João Matos; Martin Viola; Jack Gallifant; Justin Quion; Leo Anthony Celi; Nebal S Abu Hussein

doi:10.1016/j.jbi.2024.104631

Participant flow diagrams for health equity in AI

J Biomed Inform. 2024 Apr:152:104631. doi: 10.1016/j.jbi.2024.104631. Epub 2024 Mar 27.

Authors

Jacob G Ellen¹, João Matos², Martin Viola³, Jack Gallifant⁴, Justin Quion⁵, Leo Anthony Celi⁶, Nebal S Abu Hussein⁷

Affiliations

¹ Harvard Medical School, Boston, MA, USA. Electronic address: jellen@hms.harvard.edu.
² Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, MA, USA; Faculty of Engineering, University of Porto, Porto, Portugal; Institute for Systems and Computer Engineering, Technology and Science (INESCTEC), Porto, Portugal.
³ Harvard Medical School, Boston, MA, USA.
⁴ Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, MA, USA; Department of Critical Care, Guy's and St Thomas' NHS Trust, London, United Kingdom.
⁵ University of the East Ramon Magsaysay Memorial Medical School, Quezon City, Philippines.
⁶ Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, MA, USA; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA; Department of Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA.
⁷ Pulmonary, Critical Care & Sleep Medicine, Yale School of Medicine, CT, USA.

PMID: 38548006
DOI: 10.1016/j.jbi.2024.104631

Abstract

Selection bias can arise through many aspects of a study, including recruitment, inclusion/exclusion criteria, input-level exclusion and outcome-level exclusion, and often reflects the underrepresentation of populations historically disadvantaged in medical research. The effects of selection bias can be further amplified when non-representative samples are used in artificial intelligence (AI) and machine learning (ML) applications to construct clinical algorithms. Building on the "Data Cards" initiative for transparency in AI research, we advocate for the addition of a participant flow diagram for AI studies detailing relevant sociodemographic and/or clinical characteristics of excluded participants across study phases, with the goal of identifying potential algorithmic biases before their clinical implementation. We include both a model for this flow diagram as well as a brief case study explaining how it could be implemented in practice. Through standardized reporting of participant flow diagrams, we aim to better identify potential inequities embedded in AI applications, facilitating more reliable and equitable clinical algorithms.

Keywords: Data cards; Flow diagram; Health equity; Machine learning; Selection bias.

MeSH terms

Algorithms
Artificial Intelligence
Biomedical Research*
Health Equity*
Humans
Machine Learning