Multicenter assessment of reliability of cranial MRI

M Ewers; S J Teipel; O Dietrich; S O Schönberg; F Jessen; R Heun; P Scheltens; L van de Pol; N R Freymann; H-J Moeller; H Hampel

doi:10.1016/j.neurobiolaging.2005.05.032

Multicenter assessment of reliability of cranial MRI

Neurobiol Aging. 2006 Aug;27(8):1051-9. doi: 10.1016/j.neurobiolaging.2005.05.032. Epub 2005 Sep 15.

Authors

M Ewers¹, S J Teipel, O Dietrich, S O Schönberg, F Jessen, R Heun, P Scheltens, L van de Pol, N R Freymann, H-J Moeller, H Hampel

Affiliation

¹ Department of Psychiatry, Dementia and Neuroimaging Section, Alzheimer Memorial Center D2, Ludwig Maximilian University of Munich, Nussbaumstr. 7, 80336 Munich, Germany.

PMID: 16169126
DOI: 10.1016/j.neurobiolaging.2005.05.032

Abstract

Clinical utility of magnetic resonance imaging (MRI) for the diagnosis and assessment of neurodegenerative diseases may depend upon the reliability of MRI measurements, especially when applied within a multicenter context. In the present study, we assessed the reliability of MRI through a phantom test at a total of eleven clinics. Performance and entry criteria were defined liberally in order to support generalizability of the results. For manual hippocampal volumetry, automatic segmentation of brain compartments and voxel-based morphometry, multicenter variability was assessed on the basis of MRIs of a single subject scanned at ten of the eleven sites. In addition, cranial MRI scans obtained from 73 patients with Alzheimer's disease (AD) and 76 patients with mild cognitive impairment were collected at subset of six centers to assess differences in grey matter volume. Results show that nine out of eleven centers tested met the reliability criteria of the phantom test, where two centers showed aberrations in spatial resolution, slice thickness and slice position. The coefficient of variation was 3.55% for hippocampus volumetry, 5.02% for grey matter, 4.87% for white matter and 4.66% for cerebrospinal fluid (CSF). The coefficient of variation was 12.81% (S.D.=9.06) for the voxel intensities within grey matter and 8.19% (S.D.=6.9) within white matter. Power analysis for the detection of a difference in the volumes of grey matter between AD and MCI patients across centers (d=0.42) showed that the total sample size needed is N=180. In conclusion, despite minimal inclusion criteria, the reliability of MRI across centers was relatively good.

Publication types

Multicenter Study
Research Support, Non-U.S. Gov't

MeSH terms

Adult
Alzheimer Disease / diagnosis*
Brain / pathology*
Cognition Disorders / diagnosis*
Female
Germany
Humans
Imaging, Three-Dimensional / methods*
Magnetic Resonance Imaging / methods*
Male
Netherlands
Phantoms, Imaging
Reproducibility of Results
Sensitivity and Specificity