Use of differential item functioning analysis to assess the equivalence of translations of a questionnaire

Morten Aa Petersen; Mogens Groenvold; Jakob B Bjorner; Neil Aaronson; Thierry Conroy; Ann Cull; Peter Fayers; Marianne Hjermstad; Mirjam Sprangers; Marianne Sullivan; European Organisation for Research and Treatment of Cancer Quality of Life Group

doi:10.1023/a:1023488915557

Use of differential item functioning analysis to assess the equivalence of translations of a questionnaire

Qual Life Res. 2003 Jun;12(4):373-85. doi: 10.1023/a:1023488915557.

Authors

Morten Aa Petersen¹, Mogens Groenvold, Jakob B Bjorner, Neil Aaronson, Thierry Conroy, Ann Cull, Peter Fayers, Marianne Hjermstad, Mirjam Sprangers, Marianne Sullivan; European Organisation for Research and Treatment of Cancer Quality of Life Group

Affiliation

¹ Department of Palliative Medicine, Bispebjerg Hospital, Copenhagen, Denmark. map01@bbh.hosp.dk

PMID: 12797710
DOI: 10.1023/a:1023488915557

Abstract

In cross-national comparisons based on questionnaires, accurate translations are necessary to obtain valid results. Differential item functioning (DIF) analysis can be used to test whether translations of items in multi-item scales are equivalent to the original. In data from 10,815 respondents representing 10 European languages we tested for DIF in the nine translations of the EORTC QLQ-C30 emotional function scale when compared to the original English version. We tested for DIF using two different methods in parallel, a contingency table method and logistic regression. The DIF results obtained with the two methods were similar. We found indications of DIF in seven of the nine translations. At least two of the DIF findings seem to reflect linguistic problems in the translation. 'Imperfect' translations can affect conclusions drawn from cross-national comparisons. Given that translations can never be identical to the original we discuss how findings of DIF can be interpreted and discuss the difference between linguistic DIF and DIF caused by confounding, cross-cultural differences, or DIF in other items in the scale. We conclude that testing for DIF is a useful way to validate questionnaire translations.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Adult
Aged
Culture
Female
Health Status Indicators
Humans
Logistic Models
Male
Middle Aged
Reproducibility of Results
Surveys and Questionnaires*
Translating*