LOGIN

International Epidemiology Databases to Evaluate AIDS

Home >> Publications >> Simultaneous Treatment of Missing Data and Measurement Error in HIV Research Using Multiple Overimputation.

Publication

Author(s):

Schomaker M, Hogger S, Johnson LF, Hoffmann CJ, Bärnighausen T, Heumann C.

Pub Title:

Simultaneous Treatment of Missing Data and Measurement Error in HIV Research Using Multiple Overimputation.

Pub Date:

Sep 1 2015

Pub Region(s):

Southern Africa

Page Number:
628-636

Journal:

Title: 
Epidemiology
Link: 
http://journals.lww.com/epidem/pages/articleviewer.aspx?year=2015&issue=09000&article=00002&type=abstract

PubMed: 26214336
Pub PDF:

BACKGROUND:

Both CD4 count and viral load in HIV-infected persons are measured with error. There is no clear guidance on how to deal with this measurement error in the presence of missing data.

METHODS:

We used multiple overimputation, a method recently developed in the political sciences, to account for both measurement error and missing data in CD4 count and viral load measurements from four South African cohorts of a Southern African HIV cohort collaboration. Our knowledge about the measurement error of ln CD4 and log10 viral load is part of an imputation model that imputes both missing and mismeasured data. In an illustrative example, we estimate the association of CD4 count and viral load with the hazard of death among patients on highly active antiretroviral therapy by means of a Cox model. Simulation studies evaluate the extent to which multiple overimputation is able to reduce bias in survival analyses.

RESULTS:

Multiple overimputation emphasizes more strongly the influence of having high baseline CD4 counts compared to both a complete case analysis and multiple imputation (hazard ratio for >200 cells/mm vs. <25 cells/mm: 0.21 [95% confidence interval: 0.18, 0.24] vs. 0.38 [0.29, 0.48], and 0.29 [0.25, 0.34], respectively). Similar results are obtained when varying assumptions about measurement error, when using p-splines, and when evaluating time-updated CD4 count in a longitudinal analysis. The estimates of the association with viral load are slightly more attenuated when using multiple imputation instead of multiple overimputation. Our simulation studies suggest that multiple overimputation is able to reduce bias and mean squared error in survival analyses.

CONCLUSIONS:

Multiple overimputation, which can be used with existing software, offers a convenient approach to account for both missing and mismeasured data in HIV research.

The following websites provide guidelines and policies when citing from PubMed®: http://www.ncbi.nlm.nih.gov/books/NBK7243/
http://www.nlm.nih.gov/bsd/policy/cit_format.html

Citation:

Schomaker M, Hogger S, Johnson LF, Hoffmann CJ, Bärnighausen T, Heumann C. Simultaneous Treatment of Missing Data and Measurement Error in HIV Research Using Multiple Overimputation. Epidemiology. 2015 Sep;26(5):628-36. doi: 10.1097/EDE.0000000000000334. PubMed PMID: 26214336.