What Patients Say: Large-Scale Analyses of Replies to the Parkinson's Disease Patient Report of Problems (PD-PROP)

Connie Marras; Lakshmi Arbatti; Abhishek Hosamath; Amy Amara; Karen E Anderson; Lana M Chahine; Shirley Eberly; Dan Kinel; Sneha Mantri; Soania Mathur; David Oakes; Jennifer L Purks; David G Standaert; Caroline M Tanner; Daniel Weintraub; Ira Shoulson

doi:10.3233/JPD-225083

What Patients Say: Large-Scale Analyses of Replies to the Parkinson's Disease Patient Report of Problems (PD-PROP)

J Parkinsons Dis. 2023;13(5):757-767. doi: 10.3233/JPD-225083.

Authors

Connie Marras¹, Lakshmi Arbatti², Abhishek Hosamath², Amy Amara³, Karen E Anderson⁴, Lana M Chahine⁵, Shirley Eberly⁶, Dan Kinel⁷, Sneha Mantri⁸, Soania Mathur⁹, David Oakes⁶, Jennifer L Purks⁷, David G Standaert⁹, Caroline M Tanner¹⁰, Daniel Weintraub¹¹, Ira Shoulson^{2

7}

Affiliations

¹ Edmond J Safra Program in Parkinson's Disease, University Health Network, University of Toronto, Toronto, Canada.
² Grey Matter Technologies, a Wholly Owned Subsidiary of Modality.ai, San Francisco, CA, USA.
³ Department of Neurology, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.
⁴ Departments of Psychiatry and Neurology, Georgetown University, Washington DC, USA.
⁵ Department of Neurology, University of Pittsburgh, Pittsburgh, PA, USA.
⁶ Department of Biostatistics and Computational Biology, University of Rochester, Rochester, NY, USA.
⁷ Department of Neurology, University of Rochester, Rochester NY, USA.
⁸ Department of Neurology, Duke University, Durham, NC, USA.
⁹ PD Avengers, Toronto, Canada.
¹⁰ Department of Neurology, Weill Institute for Neurosciences, University of California - San Francisco, San Francisco, CA, USA.
¹¹ Departments of Psychiatry and Neurology, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA, USA.

Abstract

Background: Free-text, verbatim replies in the words of people with Parkinson's disease (PD) have the potential to provide unvarnished information about their feelings and experiences. Challenges of processing such data on a large scale are a barrier to analyzing verbatim data collection in large cohorts.

Objective: To develop a method for curating responses from the Parkinson's Disease Patient Report of Problems (PD-PROP), open-ended questions that asks people with PD to report their most bothersome problems and associated functional consequences.

Methods: Human curation, natural language processing, and machine learning were used to develop an algorithm to convert verbatim responses to classified symptoms. Nine curators including clinicians, people with PD, and a non-clinician PD expert classified a sample of responses as reporting each symptom or not. Responses to the PD-PROP were collected within the Fox Insight cohort study.

Results: Approximately 3,500 PD-PROP responses were curated by a human team. Subsequently, approximately 1,500 responses were used in the validation phase; median age of respondents was 67 years, 55% were men and median years since PD diagnosis was 3 years. 168,260 verbatim responses were classified by machine. Accuracy of machine classification was 95% on a held-out test set. 65 symptoms were grouped into 14 domains. The most frequently reported symptoms at first report were tremor (by 46% of respondents), gait and balance problems (>39%), and pain/discomfort (33%).

Conclusion: A human-in-the-loop method of curation provides both accuracy and efficiency, permitting a clinically useful analysis of large datasets of verbatim reports about the problems that bother PD patients.

Keywords: Parkinson’s disease; Patient-reported outcome; machine learning; measurement.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Aged
Algorithms
Cohort Studies
Female
Humans
Machine Learning
Male
Parkinson Disease* / complications
Parkinson Disease* / diagnosis
Tremor