Interpretation of Thoracic Radiography Shows Large Discrepancies Depending on the Qualification of the Physician-Quantitative Evaluation of Interobserver Agreement in a Representative Emergency Department Scenario

Jan Rudolph; Nicola Fink; Julien Dinkel; Vanessa Koliogiannis; Vincent Schwarze; Sophia Goller; Bernd Erber; Thomas Geyer; Boj Friedrich Hoppe; Maximilian Fischer; Najib Ben Khaled; Maximilian Jörgens; Jens Ricke; Johannes Rueckel; Bastian Oliver Sabel

doi:10.3390/diagnostics11101868

Interpretation of Thoracic Radiography Shows Large Discrepancies Depending on the Qualification of the Physician-Quantitative Evaluation of Interobserver Agreement in a Representative Emergency Department Scenario

Diagnostics (Basel). 2021 Oct 11;11(10):1868. doi: 10.3390/diagnostics11101868.

Authors

Jan Rudolph¹, Nicola Fink^{1

2}, Julien Dinkel^{1

2

3}, Vanessa Koliogiannis¹, Vincent Schwarze¹, Sophia Goller¹, Bernd Erber¹, Thomas Geyer¹, Boj Friedrich Hoppe¹, Maximilian Fischer⁴, Najib Ben Khaled⁵, Maximilian Jörgens⁶, Jens Ricke¹, Johannes Rueckel¹, Bastian Oliver Sabel¹

Affiliations

¹ Department of Radiology, University Hospital, LMU Munich, Marchioninistr. 15, 81377 Munich, Germany.
² Comprehensive Pneumology Center (CPC-M), German Center for Lung Research, Max-Lebsche-Platz 31, 81377 Munich, Germany.
³ Department of Radiology, Asklepios Fachklinik München, Robert-Koch-Allee 2, 82131 Gauting, Germany.
⁴ Department of Medicine I, University Hospital, LMU Munich, Marchioninistr. 15, 81377 Munich, Germany.
⁵ Department of Medicine II, University Hospital, LMU Munich, Marchioninistr. 15, 81377 Munich, Germany.
⁶ Department of Orthopaedics and Trauma Surgery, Musculoskeletal University Center Munich (MUM), University Hospital, LMU Munich, Marchioninistr. 15, 81377 Munich, Germany.

Abstract

(1) Background: Chest radiography (CXR) is still a key diagnostic component in the emergency department (ED). Correct interpretation is essential since some pathologies require urgent treatment. This study quantifies potential discrepancies in CXR analysis between radiologists and non-radiology physicians in training with ED experience. (2) Methods: Nine differently qualified physicians (three board-certified radiologists [BCR], three radiology residents [RR], and three non-radiology residents involved in ED [NRR]) evaluated a series of 563 posterior-anterior CXR images by quantifying suspicion for four relevant pathologies: pleural effusion, pneumothorax, pneumonia, and pulmonary nodules. Reading results were noted separately for each hemithorax on a Likert scale (0-4; 0: no suspicion of pathology, 4: safe existence of pathology) adding up to a total of 40,536 reported pathology suspicions. Interrater reliability/correlation and Kruskal-Wallis tests were performed for statistical analysis. (3) Results: While interrater reliability was good among radiologists, major discrepancies between radiologists' and non-radiologists' reading results could be observed in all pathologies. Highest overall interrater agreement was found for pneumothorax detection and lowest agreement in raising suspicion for malignancy suspicious nodules. Pleural effusion and pneumonia were often suspected with indifferent choices (1-3). In terms of pneumothorax detection, all readers mainly decided for a clear option (0 or 4). Interrater reliability was usually higher when evaluating the right hemithorax (all pathologies except pneumothorax). (4) Conclusions: Quantified CXR interrater reliability analysis displays a general uncertainty and strongly depends on medical training. NRR can benefit from radiology reporting in terms of time efficiency and diagnostic accuracy. CXR evaluation of long-time trained ED specialists has not been tested.

Keywords: chest radiography; clinicians; emergency department; interrater reliability; radiologists.