Self-reported exposure data in epidemiologic studies: measurement errors and missing values
Appears in the following Collection
AbstractIn observational epidemiology many studies rely on self-reported exposure information, which is prone to measurement errors and missing values. This may subsequently lead to bias in relative risk estimates of disease or other outcome measures. Therefore, it is important to assess data quality. The subject of this thesis in epidemiology was measurement error and missing data analysis with emphasis on self-reported data on diet and melanoma risk factors. The data were collected by postal health- and lifestyle questionnaires used in the Norwegian Women and Cancer Study (NOWAC), an ongoing population-based cohort study. The aspects of measurement error studied were reproducibility and recall bias, based on two separate data collections: a test-retest study (3 month time interval) in a sample of 1,995 women, and a nested case-control study of 208 cases of malignant melanoma and 2,080 matched controls. Reproducibility was studied for food frequency questionnaire (FFQ) data, and melanoma risk factors. The effect of different imputation methods for missing values in FFQ data was also investigated. A common procedure for microarray data, k nearest neighbors imputation (KNN), was adapted and applied to FFQ data. Recall bias was studied for melanoma risk factors. The observed level of reproducibility for the FFQ data was within the reported range for similar questionnaires, but may attenuate disease risk estimates, as illustrated in an example using regression calibration. Comparing different imputation methods, the largest differences were seen for KNN compared to the null value (no consumption), and for food items with a high proportion of missing (>30%). The overall reproducibility of melanoma risk factors was found to be acceptable. Indications of recall bias in the reporting of melanoma risk factors were found, but the results were not consistent for the different exposures.
List of papers
|I. Parr CL, Veierod MB, Laake P, Lund E, Hjartaker A. Test-retest reproducibility of a food frequency questionnaire (FFQ) and estimated effects on disease risk in the Norwegian Women and Cancer Study (NOWAC). Nutrition Journal 2006;5:4. The paper is not available in DUO. The published version is available at: https://doi.org/10.1186/1475-2891-5-4|
|II. Parr CL, Hjartaker A, Scheel I, Lund E, Laake P, Veierod MB. Comparing methods for handling missing values in food-frequency questionnaires and proposing k nearest neighbours imputation: effects on dietary intake in the Norwegian Women and Cancer study (NOWAC). Public Health Nutrition 2008;11:361-70. The paper is not available in DUO. The published version is available at: https://doi.org/10.1017/S1368980007000365|
|III. Veierod MB, Parr CL, Lund E, Hjartaker A. Reproducibility of self-reported melanoma risk factors in a large cohort study of Norwegian women. Melanoma Research 2008;18:1-9. The paper is not available in DUO. The published version is available at: https://doi.org/10.1097/CMR.0b013e3282f120d2|
|IV. Parr CL, Hjartaker A, Laake P, Lund E, Veierod MB. Recall bias in melanoma risk factors and measurement error effects: a nested case-control study within the Norwegian Women and Cancer Study. American Journal of Epidemiology 2008; (resubmitted). The paper is not available in DUO.|