Abstract
BACKGROUND: Few studies have compared methods to correct for retest effects or practice effects in settings where an acute event could influence test performance, such as major surgery. Our goal in this study was to evaluate the use of different methods to correct for the effects of practice or retest on repeated test administration in the context of an observational study of older adults undergoing elective surgery.
METHODS: In a cohort of older surgical patients (N = 560) and a non-surgical comparison group (N = 118), we compared changes on repeated cognitive testing using a summary measure of general cognitive performance (GCP) between patients who developed post-operative delirium and those who did not. Surgical patients were evaluated pre-operatively and at 1, 2, 6, 12, and 18 months following surgery. Inferences from linear mixed effects models using four approaches were compared: 1) no retest correction, 2) mean-difference correction, 3) predicted-difference correction, and 4) model-based correction.
RESULTS: Using Approaches 1 or 4, which use uncorrected data, both surgical groups appeared to improve or remain stable after surgery. In contrast, Approaches 2 and 3, which dissociate retest and surgery effects by using retest-adjusted GCP scores, revealed an acute decline in performance in both surgical groups followed by a recovery to baseline. Relative differences between delirium groups were generally consistent across all approaches: the delirium group showed greater short- and longer-term decline compared to the group without delirium, although differences were attenuated after 2 months. Standard errors and model fit were also highly consistent across approaches.
CONCLUSION: All four approaches would lead to nearly identical inferences regarding relative mean differences between groups experiencing a key post-operative outcome (delirium) but produced qualitatively different impressions of absolute performance differences following surgery. Each of the four retest correction approaches analyzed in this study has strengths and weakness that should be evaluated in the context of future studies. Retest correction is critical for interpretation of absolute cognitive performance measured over time and, consequently, for advancing our understanding of the effects of exposures such as surgery, hospitalization, acute illness, and delirium.