Assessing the Performance of Machine Learning Methods Trained on Public Health Observational Data: A Case Study From COVID-19

Davide Pigoli; Kieran Baker; Jobie Budd; Lorraine Butler; Harry Coppock; Sabrina Egglestone; Steven G Gilmour; Chris Holmes; David Hurley; Radka Jersakova; Ivan Kiskin; Vasiliki Koutra; Jonathon Mellor; George Nicholson; Joe Packham; Selina Patel; Richard Payne; Stephen J Roberts; Björn W Schuller; Ana Tendero-Cañadas; Tracey Thornley; Alexander Titcomb

doi:10.1002/sim.10211

Assessing the Performance of Machine Learning Methods Trained on Public Health Observational Data: A Case Study From COVID-19

Stat Med. 2024 Nov 10;43(25):4861-4871. doi: 10.1002/sim.10211. Epub 2024 Sep 5.

Authors

Davide Pigoli^{1

2}, Kieran Baker^{1

2}, Jobie Budd³, Lorraine Butler⁴, Harry Coppock^{2

5}, Sabrina Egglestone⁴, Steven G Gilmour^{1

2}, Chris Holmes^{2

6}, David Hurley⁴, Radka Jersakova², Ivan Kiskin⁷, Vasiliki Koutra^{1

2}, Jonathon Mellor⁴, George Nicholson^{2

6}, Joe Packham⁴, Selina Patel^{3

4}, Richard Payne⁴, Stephen J Roberts⁸, Björn W Schuller^{2

5}, Ana Tendero-Cañadas^{4

9}, Tracey Thornley¹⁰, Alexander Titcomb⁴

Affiliations

¹ Department of Mathematics, King's College London, UK.
² The Alan Turing Institute, London, UK.
³ Division of Medicine, University College London, UK.
⁴ UK Health Security Agency, London, UK.
⁵ Group on Language Audio & Music, Imperial College London, UK.
⁶ Department of Statistics, University of Oxford, UK.
⁷ Centre for Vision, Speech and Signal Processing, University of Surrey, UK.
⁸ Department of Engineering Science, University of Oxford, UK.
⁹ Centre for Lifelong Health, University of Brighton, UK.
¹⁰ Pharmacy Practice and Policy Division, University of Nottingham, UK.

PMID: 39237100
DOI: 10.1002/sim.10211

Abstract

From early in the coronavirus disease 2019 (COVID-19) pandemic, there was interest in using machine learning methods to predict COVID-19 infection status based on vocal audio signals, for example, cough recordings. However, early studies had limitations in terms of data collection and of how the performances of the proposed predictive models were assessed. This article describes how these limitations have been overcome in a study carried out by the Turing-RSS Health Data Laboratory and the UK Health Security Agency. As part of the study, the UK Health Security Agency collected a dataset of acoustic recordings, SARS-CoV-2 infection status and extensive study participant meta-data. This allowed us to rigorously assess state-of-the-art machine learning techniques to predict SARS-CoV-2 infection status based on vocal audio signals. The lessons learned from this project should inform future studies on statistical evaluation methods to assess the performance of machine learning techniques for public health tasks.

Keywords: UK COVID‐19 vocal audio dataset; bioacoustic markers; choice of test set; confounding; matching.

MeSH terms

COVID-19* / epidemiology
Humans
Machine Learning*
Public Health
SARS-CoV-2
United Kingdom

Abstract

MeSH terms

Grants and funding