Abstract
Much of epidemiology and clinical medicine is focused on estimating the effects of treatments or interventions administered over time. In such settings of longitudinal treatment, time-dependent confounding is often an important source of bias. Marginal structural models (MSMs) are a powerful tool for estimating the causal effect of a treatment using observational data, particularly when time-dependent confounding is present. In recent statistical work, van der Laan et al. presented a generalized form of MSMs called “history-adjusted” MSMs (Int J Biostat 2005;1:article 4). Unlike standard MSMs, history-adjusted MSMs can be used to estimate modification of treatment effects by time-varying covariates. Estimation of time-dependent causal effect modification is frequently of great practical relevance. For example, clinical researchers are often interested in how the prognostic significance of a biomarker for treatment response can change over time. This article provides a practical introduction to the implementation and interpretation of history-adjusted MSMs. The method is illustrated using a clinical question drawn from the treatment of human immunodeficiency virus infection. Observational cohort data from San Francisco, California, collected between 2000 and 2004, are used to estimate the effect of time until switching antiretroviral therapy regimens among patients receiving a nonsuppressive regimen and how this effect differs depending on CD4-positive T-lymphocyte count.
Keywords: antiretroviral therapy, highly active, causality, confounding factors (epidemiology), HIV, longitudinal studies, observational data, structural model, time-dependent covariate
BACKGROUND AND MOTIVATION
Marginal structural models
Marginal structural models (MSMs), introduced by Robins et al. (1–3), represent a major advance in statistical methodology for causal inference. In epidemiologic and clinical research, longitudinal data on treatment and covariates are often collected from cohorts of subjects. When treatment status can change over time, conventional analytic approaches (such as multivariable regression) often fail to provide valid causal inferences about treatment effects. MSMs address this well-recognized problem of time-dependent confounding. As a result, in many research contexts, MSMs are the best available analytic method.
MSMs have traditionally been restricted to the estimation of the causal effects of treatment or exposure conditional on baseline covariates (2). Thus, it has been possible to use this method to address questions such as “What is the effect of a treatment and how does it differ between study members with different covariate values at a given time point, such as entry to the study?” but not “How does the effect of a treatment differ as a result of changing values of a covariate over the course of the study?”. In other words, it has not been possible to use MSMs to estimate modification of causal effects by time-dependent covariates. In many research settings, estimation of time-dependent effect modification is of major interest. In clinical settings in particular, treatment decisions are often modified over time as a result of changing values for a patient’s covariates. Estimation of how past values of these covariates modify the future causal effect of a treatment has important implications for understanding the mechanistic action of the treatment, as well as for guiding clinical decision-making.
In this paper, we provide a practical introduction to history-adjusted marginal structural models (HA-MSMs), a type of MSM that allows estimation of effect modification by time-varying covariates. HA-MSMs are based on the concept, first introduced by Joffe et al. (4, 5) and implemented by Feldman et al. (6), of assuming a standard MSM at each time point during a study. Covariates and treatment history up to that time point are treated as baseline data, and the causal effect of treatment after that time point is estimated. Notably, HA-MSMs can assume a common model across time points, rather than simply assuming a separate MSM at each time point. HA-MSMs provide an alternative to structural nested models, an approach proposed by Robins (7) for the estimation of effect modification by time-dependent covariates.
Van der Laan et al. (8) recently established the formal statistical theory behind HA-MSMs. In this article, we use a practical application to provide a nontechnical introduction to HA-MSMs, including the underlying assumptions, their implementation using standard software, and the interpretation of results. Using observational cohort data from human immunodeficiency virus (HIV)-infected persons with detectable viremia on antiretroviral therapy, we estimate the effect of time to treatment modification on future CD4-positive T-lymphocyte (CD4 T-cell) count and how this effect differs given current CD4 T-cell count.
Antiretroviral therapy: when to switch?
HIV evolves rapidly in the presence of selective pressure. This leads to the accumulation of mutations that confer “resistance” to antiretroviral drugs. Therefore, the optimal manner in which to avoid the rapid emergence of resistance-associated mutations is to completely suppress viral replication. This can be achieved in many patients with standard three-drug combination regimens (9). However, a substantial proportion of treated patients fail to achieve complete viral suppression. Such patients are often switched to a new treatment regimen, but this can lead to the use of all available therapeutic options. Since many patients with incomplete viral suppression continue to do well immunologically (and therefore clinically) (10–12), some clinicians choose not to switch them to a new regimen as long as their CD4 T-cell counts remain elevated. Hence, clinicians are often faced with a dilemma in patients with detectable viremia who are on antiretroviral therapy: Should therapy be switched as soon as possible, thereby risking using up all available drugs quickly and exposing patients to increasingly complicated and potentially toxic regimens? Or should patients be maintained on a partially suppressive regimen as long as they are doing well immunologically and clinically, even though this approach will allow the ongoing accumulation of drug-resistance mutations that can limit future therapeutic options? (For a review of this issue, see the paper by Deeks (13).)
In this article, we estimate the effect of nonsuppressive therapy on future CD4 T-cell count and estimate how this outcome differs depending on a patient’s current CD4 T-cell count and time since virologic failure. Data for these analyses were drawn from the Study on the Consequences of the Protease Inhibitor Era (SCOPE), an observational study of a cohort of HIV-infected patients in San Francisco, California. Participants in SCOPE are seen at 4-month intervals. At each study visit, they complete interviewer- and self-administered questionnaires examining numerous domains, including socioeconomic status (housing, income, employment), antiretroviral medication use and adherence, occurrence of opportunistic infection or malignancy, and recreational drug use. Plasma HIV RNA levels and CD4/CD8 T-cell counts are measured at each visit, as well as between visits, according to clinician discretion. Importantly, decisions as to when and how to modify therapy are made by primary care providers.
We retrospectively identified all subjects who experienced virologic failure while being observed in SCOPE. Subjects became eligible for our analyses (t = 0) if they failed to achieve an undetectable HIV RNA level (<75 copies/ml) by week 24 on a new regimen or if they rebounded from an undetectable level. The exposure of interest was time to modification of the antiretroviral regimen, defined as switching or interrupting the use of at least one drug. This exposure was summarized as a binary variable at each time point, indicating whether or not a subject had switched from his original non-suppressive antiretroviral therapy regimen. Subjects were only allowed to switch once in our analyses. The method can be easily extended, however, to encompass more complex treatment patterns.
Below, we rely on this data structure to illustrate our method. We then present the results of our analyses and discuss their clinical significance.
THE METHOD
The counterfactual framework
The causal effect of a treatment on an individual can be defined as the difference between that person’s outcomes with and without the treatment. Such outcomes are termed “counterfactual,” because only one outcome is observed for each individual. MSMs are models of how the population distribution of these counterfactual outcomes changes as a result of changes in treatment.
We begin by introducing some standard notation. Treatment over the course of the study (t = 0, …, K) is denoted Ā(K) = (A(0), …, A(K)), and covariates are denoted L̄ (K + 1), where treatment occurs after covariates are measured at a given time point and K + 1 is the end of follow-up. In our HIV example, Ā(K) is a vector of binary variables, equal to 1 for each time point up to the point at which a subject switches therapy and 0 thereafter. L̄ā (K + 1) denotes the counterfactual CD4 T-cell count and other covariates that would have been observed over time if the subject had switched therapy at the time implied by Ā = ā. The set of these counterfactuals under each possible treatment history (switch time) is the full data set, X = (L̄ā (K + 1), ā ∈ A), where A denotes the set of possible switch times. The outcome for a given time point t is the counterfactual CD4 T-cell count measured 8 months in the future (m = 8) under the switching time indicated by ā, denoted Yā (t + m).
If we observed the counterfactual CD4 T-cell counts for each subject under each possible switch time, we could estimate the causal effect of waiting to switch therapy by simply comparing the counterfactual outcomes under different switch times. However, we only observe the CD4 T-cell counts for each person under a single (nonrandom) switch time. As a result, in order to estimate the effect of time to switching treatments on CD4 T-cell count using the observed data, we must assume that the covariates we measured are sufficient to control for confounding. For example, within strata defined by our measured confounders, there must be no unmeasured variables that predict, at any time point, both the probability of switching treatment and CD4 T-cell count 8 months in the future. More formally, this assumption is stated as the sequential randomization assumption, which says that the treatment assignment at each time point depends only on the observed past:
where g(A(t)|X) = P(A(t)|X). Under the sequential randomization assumption, the treatment mechanism, which describes how treatment is assigned at each time point, can be written
In addition, we assume experimental treatment assignment. Experimental treatment assignment states that treatment is not assigned deterministically on the basis of a patient’s covariates. For instance, regardless of her covariate values, a patient who has not yet switched treatments must have a positive probability of both switching and not switching treatments at each time point.
Under these assumptions, we could use standard MSMs to ask, for example, “At baseline (when virologic failure occurs), how does time until switching to a new regimen affect CD4 T-cell count 8 months later? How does this effect differ depending on a patient’s CD4 T-cell count at baseline?” However, an MSM assumed at a single time point does not allow us to estimate how the effect of future non-suppressive therapy may change as a result of changes in a patient’s CD4 T-cell count. HA-MSMs directly address this question.
History-adjusted MSMs
HA-MSMs rely on the identical causal framework as standard MSMs but estimate a different parameter of interest. HA-MSMs assume a standard MSM at each time point during the study and model the counterfactual outcome indexed by treatment that occurs after that time point, conditional on some subset of the observed history up to that time point. Importantly, HA-MSMs allow the use of a common model across time points. In other words, HA-MSMs model some parameter of the counterfactual outcome if the study population were to follow their observed treatment history up to time j, followed by a specified counterfactual future treatment history until the outcome is measured, conditional on a subset of (possibly time-varying) covariates and/or treatment history measured before time j. In this article, we focus on HA-MSMs concerned with the mean of these counterfactual outcomes; however, the same framework can be readily adopted to model any other parameter.
We denote a future longitudinal treatment regimen, beginning at time j and continuing until the outcome is measured m time points later, as a(j, j + m − 1) ≡ (a(j), a(j + 1), …a(j + m − 1)), for j = 0, …, K + 1 − m. The effect modifiers of interest are denoted V(j) ⊂ (L̄ (j), Ā (j − 1)), a subset of a subject’s treatment and covariate history up to time j. For each time point in the study for which the outcome m time points later is defined, HA-MSMs model the expectation of the counterfactual outcome YĀ(j−1),a(j, j+m−1)(j + m), conditional on V(j), under each possible future treatment regimen. Thus, HA-MSMs are concerned with estimation of the following parameter: E(YĀ(j−1),a(j, j+m−1)(j + m)|V(j)).
Applied to our example, future antiretroviral treatment from time j until the outcome is measured, denoted a(j, j + m − 1), consists of a vector of counterfactual treatment decisions a(j), …a(j + m − 1), where a(t) = 0 if a subject is assigned to switch treatment at or before time t; otherwise, a(t) = 1. This vector of future treatment decisions exists for each subject beginning at each time point j = 0, …, K + 1 − m. We summarize a(j, j + m − 1) as , which represents the future time (beginning at time point j) that the subject will spend on his/her original failing therapy before either treatment is switched or the outcome is measured. The current CD4 T-cell count at time j is denoted CD4(j), a subset of the full covariate history measured over time, L̄ (j). For each time point j, we are interested in the mean counterfactual CD4 T-cell count that would exist 8 months later (m = 8) among persons who had not yet switched therapy if they were to switch therapy at a specified counterfactual time after j.
To address this question, we might assume the following model:
(1) |
In other words, we might assume that, among persons who have not yet switched treatment (Ā (j − 1) = 1), the counterfactual CD4 T-cell count 8 months later depends on additional time until switching (c(j)), but the magnitude of this effect differs depending on the amount of time a patient has already spent on nonsuppressive therapy (j) and current CD4 T-cell count (CD4(j)).
This model allows us to estimate the effect of each additional month until treatment modification on future CD4 T-cell count, among patients who have been on their current nonsuppressive therapy for different durations and have different current CD4 T-cell counts. For example, by testing whether β4 = 0, we are testing the hypothesis that a subject’s current CD4 T-cell count modifies the effect of future time until switching.
Inverse-probability-of-treatment-weighted estimation
Several HA-MSM estimators are available (8); here, we focus on the inverse-probability-of-treatment-weighted (IPTW) estimator, which can be implemented with standard software using weighted least squares. For each time point j in the study, each subject receives a weight which is, informally, the inverse of that subject’s probability of receiving the treatment that she actually received, from time point j until the outcome is measured. If a subject has a longitudinal treatment regimen beginning at time point j that occurs frequently in the data among subjects with her covariate and treatment history, she receives a small j-specific weight. In contrast, if the subject has an unusual longitudinal treatment regimen given her covariates, the subject will receive a large weight. For example, patients whose CD4 T-cell counts have recently declined may be more likely to switch therapy. In this case, a subject who did not switch therapy despite a recent decline in CD4 T-cell count would receive a large weight.
The first step in implementing the IPTW estimator is to model the treatment mechanism or, in other words, to fit a predictive model of treatment at each time point t, given the observed past up to that time point: g(A(t)|Ā (t − 1), L̄ (t)), t = 0, …, K. For example, we can model the treatment decision (to switch therapy or not) made at every time point using logistic regression. Recall that once a subject switches therapy, he is no longer at risk of switching in the future. Thus, when fitting our model of the probability of staying on a certain regimen at a given time point (A(t) = 1), we fit the model only among subjects who have not already switched before that time point (A(t − 1) = 1). Note that, for the IPTW estimator to be consistent, the estimate of the treatment mechanism must be consistent and the covariates included in the model must be sufficient to control for confounding.
For each time point j = 0, …, K + 1 − m, the model of the treatment mechanism is used to estimate the denominator of the j-specific weight:
For subjects who do not switch therapy before the outcome is measured m months later, the denominator of the j-specific weight is
For subjects who have not switched therapy by time j but switch at some point T = j + C(j) before the outcome is measured (C(j) < m), the denominator of the j-specific weight is
Recall that subjects who have already switched therapy by time j do not contribute to our counterfactuals of interest.
The choice of a numerator for the weights will not affect the consistency of the IPTW estimator, as long as the numerator is only a function of treatment history and baseline covariates at time j. One common approach is to use “stabilized” weights (2). As applied to HA-MSMs, this approach involves estimating the following numerator for each j-specific weight:
where Ā(j,l − 1) = (A(j), …,A(l − 1)). The numerator can be estimated by simply fitting a model of the treatment decision at each time point, analogous to the model of the treatment mechanism fit for the denominator, but now including only those covariates contained in the effect modifiers of interest at time j.
Note that the same subject will have a separate weight for each time point j in the study, with denominators corresponding to the probability that the subject received her observed treatment from that time point j until the outcome is measured. Once each subject has been assigned a set of K + 1 − m weights, a weighted least-squares regression analysis is conducted using standard software, with each subject contributing K + 1 − m weighted lines of data. Because each subject contributes multiple lines of data, an approach such as the nonparametric bootstrap or robust standard error estimation using generalized estimating equations is needed to ensure accurate inference.
RESULTS
In the current analyses, we focused on the first 8 months following loss of suppression (j = 0, …, 8). Thus, the full follow-up time for a given subject consisted of either 16 months following loss of suppression or, if the subject switched therapy before 8 months, 8 months after the switch time. We identified from SCOPE a total of 167 episodes of virologic failure among 133 patients treated between 2000 and 2004. For 39 episodes, censoring due to the end of follow-up in 2004 occurred within 8 months following loss of suppression, and thus no outcome was available. Of the remaining 128 episodes, 12 were censored within 8 months and an additional six were censored between month 8 and the end of follow-up (a total of three deaths and 15 losses to follow-up). Potentially informative censoring was addressed by inverse-probability-of-censoring weighting, which involves incorporating an additional factor into the weights under the assumption of censoring at random (2, 14).
At least one outcome was available for 116 episodes occurring among 100 patients. Most patients had been on multiple treatment regimens prior to inclusion in our analysis. The median time to switching therapy after onset of failure was 6 months. Tables 1 and 2 show the characteristics of the sample at the time of confirmed virologic failure.
TABLE 1.
Characteristics of subjects at the time of virologic failure (continuous variables), sampled from a cohort of HIV*-infected persons in San Francisco, California, 2000–2004†
Characteristic | First quartile | Median | Mean | Third quartile | No. with missing data |
---|---|---|---|---|---|
Plasma HIV RNA level (copies/ml) | 365.5 | 4,317 | 34,190 | 24,940 | 0 |
CD4 T-cell count (cells/μl) | 175.5 | 261.5 | 321 | 428.8 | 0 |
CD8 T-cell count (cells/μl) | 726.8 | 1,022 | 1,168 | 1,497 | 0 |
% average adherence (self-report) | 100 | 100 | 92.4 | 100 | 0 |
Year diagnosed with HIV | 1986 | 1989 | 1989 | 1993 | 2 |
Age (years) | 44.2 | 50.5 | 49.9 | 55.5 | 0 |
Year of first antiretroviral treatment | 1991 | 1996 | 1995 | 1997 | 0 |
Peak HIV RNA level (laboratory records) (copies/ml) | 46,020 | 177,500 | 242,300 | 381,200 | 0 |
Nadir of CD4 T-cell count (laboratory records) (cells/μl) | 36.3 | 72.5 | 118.3 | 165 | 0 |
No. of protease inhibitors tried | 2 | 3 | 3.2 | 4 | 0 |
No. of nucleoside reverse transcriptase inhibitors tried | 4 | 5 | 4.8 | 6 | 0 |
No. of nonnucleoside reverse transcriptase inhibitors tried | 0 | 1 | 0.9 | 1 | 0 |
HIV, human immunodeficiency virus.
Among 100 persons (116 episodes) with a known time of viral failure and follow-up for at least 8 months following time of failure.
TABLE 2.
Characteristics of subjects at the time of virologic failure (categorical variables), sampled from a cohort of HIV*-infected persons in San Francisco, California, 2000–2004†
Characteristic | No. | % | No. with missing data |
---|---|---|---|
Treatment history | 0 | ||
Enfuvurtide | 8 | 7 | |
Tenofovir | 41 | 35 | |
Lamivudine | 115 | 99 | |
Mono- or dual antiretroviral therapy | 57 | 49 | |
Current treatment | 0 | ||
Protease inhibitor | 87 | 75 | |
Nucleoside reverse transcriptase inhibitor | 113 | 97 | |
Nonnucleoside reverse transcriptase inhibitor | 22 | 19 | |
Recency of laboratory measurements | 0 | ||
Most recent HIV RNA level >1 month prior | 49 | 42 | |
Most recent CD4 T-cell count >1 month prior | 42 | 36 | |
Subject characteristics | |||
History of intravenous drug use (ever use) | 43 | 37 | 0 |
Male | 100 | 86 | 0 |
Sexual orientation “man who has sex with men” | 79 | 69 | 1 |
Homeless within the past year | 6 | 5 | 0 |
Current diagnosis of an opportunistic disease | 25 | 22 | 0 |
Self-identified HIV risk group | 0 | ||
Man having sex with men | 79 | 68 | |
Intravenous drug use | 22 | 19 | |
Heterosexual intercourse | 8 | 7 | |
Other | 7 | 6 | |
Race/ethnicity | 0 | ||
White | 51 | 44 | |
African-American/Black | 35 | 30 | |
Latino/Hispanic/Mexican-American | 17 | 15 | |
Other | 13 | 11 | |
Use of “crack” cocaine (past 4 months) | 1 | ||
Every day | 3 | 3 | |
Once per week | 6 | 5 | |
Once per month | 3 | 3 | |
Less than once per month | 7 | 6 | |
Never | 96 | 83 | |
Use of methamphetamine (past 4 months) | 1 | ||
Once per week | 2 | 2 | |
Once per month | 4 | 3 | |
Less than once per month | 3 | 3 | |
Never | 106 | 92 | |
Alcohol drinking (past 4 months) | 1 | ||
At least once per day | 10 | 9 | |
Nearly every day | 6 | 5 | |
3–4 times per week | 7 | 6 | |
1–2 times per week | 26 | 23 | |
2 or 3 times in total | 14 | 12 | |
Once | 12 | 10 | |
Never | 40 | 35 | |
Education (highest year of schooling completed) | 0 | ||
Grades 7–11 | 16 | 14 | |
High school/General Equivalency Diploma | 24 | 21 | |
Some college | 46 | 40 | |
4 years of college | 19 | 16 | |
Any graduate school | 11 | 9 | |
Yearly household income | 0 | ||
≤$6,000 | 5 | 4 | |
$6,001–$12,000 | 50 | 43 | |
$12,001–$18,000 | 20 | 17 | |
$18,001–$24,000 | 14 | 12 | |
$24,001–$30,000 | 2 | 2 | |
$30,001–$36,000 | 4 | 3 | |
$36,001–$75,000 | 13 | 11 | |
>$75,000 | 8 | 7 |
HIV, human immunodeficiency virus.
Among 100 persons (116 episodes) with a known time of viral failure and follow-up for at least 8 months following time of failure.
We employed cross-validated data-adaptive logistic regression analysis using the deletion/substitution/addition algorithm (15; http://www.stat.berkeley.edu/~laan/Software/index.html) to model the probability of switching therapy at each time point (the treatment mechanism). The deletion/substitution/addition algorithm fits models of varying sizes (1–10 terms) and complexities (main terms only or two-way interactions), selecting the size and complexity providing the optimal bias/variance tradeoff by assessing performance in independent samples. The algorithm considered 40 candidate covariates (this included all covariates listed in tables 1 and 2 and time elapsed since loss of suppression). The fit of the treatment mechanism is shown in table 3. Figure 1 shows the resulting stabilized weights, plotted for both the standard MSM and HA-MSM analyses.
TABLE 3.
Adjusted odds ratios for switching treatment among subjects with virologic failure, sampled from a cohort of HIV*-infected persons in San Francisco, California, 2000–2004
Covariate | Adjusted odds ratio† |
---|---|
Current diagnosis with an opportunistic disease | 1.21 |
No. of protease inhibitors tried (per drug) | 1.11 |
Most recent HIV RNA level undetectable | 0.44 |
% average adherence (per 10%) | 0.92 |
Most recent CD4 T-cell count (per 100 cells/μl) | 0.92 |
Nadir of CD4 T-cell count (per 100 cells/μl) | 1.06 |
Most recent HIV RNA level >1 month prior | 0.90 |
Age (per 5 years) | 0.80 |
HIV, human immunodeficiency virus.
Based on a multivariable logistic regression model of the treatment mechanism, fitted data-adaptively using the deletion/substitution/addition algorithm (15) and fivefold cross-validation. Variables for the treatment mechanism were selected among a larger sample of nonsuppressed subjects in the SCOPE cohort (Study on the Consequences of the Protease Inhibitor Era): 255 persons and 368 episodes, including people with unknown loss of suppression time and missing outcome data. The coefficients in the selected model (corresponding to the odds ratios reported here) were then refitted in the population with known loss of suppression time. Standard errors and p values are not shown, to emphasize the role of the treatment mechanism in the construction of weights.
FIGURE 1.
Estimated weights for standard and history-adjusted marginal structural models, the latter assuming a common model across time points, among human immunodeficiency virus-infected subjects with virologic failure sampled from a cohort in San Francisco, California, 2000–2004. For marginal structural models, estimated weights are plotted separately for each distinct episode of virologic failure, represented numerically on the x-axis as an index; for history-adjusted marginal structural models, estimated weights are plotted separately for each distinct episode of virologic failure and each time point at which subjects remain on their nonsuppressive therapy, represented numerically on the x-axis as an index.
The deletion/substitution/addition algorithm was further used to model the probability of being censored given the observed past, considering the same set of candidate covariates. A separate censoring model was fitted for each censoring mechanism, the end of follow-up in 2004 and death or study drop-out. In both cases, cross-validation selected a model containing the intercept only, resulting in no change in the IPTW weights (the prior weights were each multiplied by 1). Since CD4 T-cell count was believed to be the most important potential source of informative censoring, in an additional sensitivity analysis we fitted a (non-data-adaptive) model using the prior CD4 T-cell count to predict censoring. Inverse-probability-of-censoring weighting based on this model produced minimal relative changes (≤3 percent) in all reported effect estimates.
Standard MSMs were used to estimate 1) the marginal effect of time until switching therapy (c) on CD4 T-cell count 8 months after loss of suppression (baseline) (Y(8)): E(Yc(8)) = β0 + β1c; and 2) the effect of time to switching therapy on CD4 T-cell count 8 months after baseline, conditional on CD4 T-cell count at baseline (CD4(0)): E(Yc(8)| CD4(0)) = β0 + β1c + β2CD4(0) + β3c × CD4(0). Table 4 shows the effect estimates based on these models, as well as the corresponding noncausal associations (unadjusted for confounding). The MSM results suggest that, while waiting to switch therapy is generally associated with a lower future CD4 T-cell count (4.8 cells/μl lower per additional month until switching), waiting to switch is not detrimental among patients with high CD4 T-cell counts (>226 cells/μl) at the time of virologic failure. The discrepancy between the causal coefficients, as estimated using MSMs (−4.8 for the population as a whole and −11.3 conditional on baseline CD4 T-cell count), and the noncausal associations (4.9 for the population as a whole and −9.5 conditional on baseline CD4 T-cell count) suggests the presence of time-dependent confounding.
TABLE 4.
Noncausal associations and estimated causal effects of each additional month until switching therapy on CD4 T-cell count 8 months after loss of suppression among subjects with virologic failure, sampled from a cohort of HIV*-infected persons in San Francisco, California, 2000–2004†
Model and term | Estimate (cells/μl) per month of waiting to switch therapy | 95% confidence interval‡ |
---|---|---|
Regression-based estimates (residual confounding) | ||
Model: E(Y(8)|C) = β0 + β1C | ||
β1 | 4.8 | −8.1, 17.9 |
Model: E(Y(8)|C,CD4(0) = β0 + β1C + β2CD4(0) + β3C × CD4(0) | ||
β1 | −9.5 | −19.3, 0.2 |
β3 | 0.05 | 0.02, 0.08 |
Marginal structural model-based estimates | ||
Model: E(Yc(8)) = β0 + β1c | ||
β1 | −4.8 | −20.3, 10.6 |
Model: E(Yc(8)|CD4(0) = β0 + β1c + β2CD4(0) + β3c × CD4(0) | ||
β1 | −11.3 | −21.3, −1.3 |
β3 | 0.05 | 0.02, 0.08 |
HIV, human immunodeficiency virus.
“C” and “c” denote, respectively, the observed and counterfactual numbers of months after baseline (virologic failure) at which the participant switched therapy from the original nonsuppressive regimen. “CD4(0)” denotes the observed CD4 T-cell count at baseline (time of virologic failure).
Based on robust standard error estimates using generalized estimating equations.
At each time point, HA-MSMs were used to estimate the effect of additional time to switching therapy among patients who remained on their original treatment regimen, conditional on current CD4 T-cell count and time since failure. Nineteen persons achieved resuppression of the virus during follow-up despite remaining on the same regimen (an indicator that virologic failure was not due to resistance). Since we aimed to estimate the effect of waiting to switch therapy among persons with resistant virus, HA-MSMs were fitted only among those persons with no history of resuppression (Supp(j) = 0). Our HA-MSMs aimed to replicate the results of a hypothetical randomized trial in which persons currently on a nonsuppressive treatment regimen and with no history of resuppression on this regimen were assigned to switch to a new therapy at a random time in the future.
Two sets of HA-MSM analyses were conducted. In the first, the following model was assumed, and separate coefficients were estimated for each time point j:
Based on the resulting coefficient estimates for the first 9 months (j = 0, …, 8), figure 2 plots the estimated effect of each additional month of waiting to switch therapy for four current CD4 T-cell counts.
FIGURE 2.
Separate history-adjusted marginal structural model fitted for each of 9 months beginning at the time of virologic failure, showing the estimated effect of each additional month until switching therapy on CD4 T-cell count 8 months later, given current CD4 T-cell count (CD4(j)) and elapsed months since failure occurred (j), among human immunodeficiency virus-infected subjects with virologic failure sampled from a cohort in San Francisco, California, 2000–2004. Results were estimated among persons who had not yet switched therapy and had not resuppressed the virus.
Figure 2 suggests that the effect of additional time until switching differs depending on the amount of time a person has already spent on nonsuppressive therapy, as well as on the person’s current CD4 T-cell count. Specifically, in the months immediately subsequent to loss of virologic suppression, waiting to switch therapy appears to be beneficial among persons with high current CD4 T-cell counts but detrimental in persons with low CD4 T-cell counts. In contrast, among persons who have already spent at least 5 months on their current nonsuppressive therapy, spending additional time waiting to switch has a negligible effect on future CD4 T-cell count, regardless of a person’s current CD4 T-cell count.
In the second set of analyses, a single model was fitted for the first 9 months (j = 0, …, 8), now assuming common parameters across time and including time since failure as a covariate in the model:
Using this common model, the estimated effect of each additional month until switching therapy is
Table 5 shows the estimated effect of waiting to switch based on this model (plotted for four CD4 T-cell values in figure 3). Similarly to the analysis assuming a separate model at each time point, this analysis suggests that effect modification by current CD4 T-cell count decreases over time.
TABLE 5.
Estimated effect of each additional month until switching therapy, given current CD4 T-cell count (CD4(j)) and elapsed months since failure occurred (j), among subjects with virologic failure, sampled from a cohort of HIV*-infected persons in San Francisco, California, 2000–2004
Term† | Estimate (cells/μl) per month of waiting to switch therapy‡ | 95% confidence interval§ |
---|---|---|
β1 | −9.2 | −18.1, −0.3 |
β4 | 0.05 | 0.02, 0.08 |
β5 | 1.5 | −0.4, 3.3 |
β7 | −0.01 | −0.02, −0.002 |
HIV, human immunodeficiency virus.
Based on the model E(YĀ(j−1),c(j)(j + 8)|A(j − 1) = 1, Supp(j) = 0, CD4(j)) = β0 + β1c(j)+ β2CD4(j) + β3j + β4c(j) × CD4(j)+ β5c(j) × j + β6j × CD4(j) + β7c(j) × CD4(j) × j, where c(j)denotes the counterfactual switching time in number of months after time j.
Coefficient from a history-adjusted marginal structural model for the first nine time points (j = 0, …, 8).
Based on robust standard error estimates using generalized estimating equations.
FIGURE 3.
Common history-adjusted marginal structural model fitted for the 9 months beginning at the time of virologic failure, showing the estimated effect of each additional month until switching therapy on CD4 T-cell count 8 months later, given current CD4 T-cell count (CD4(j)) and elapsed months since failure occurred (j), among human immunodeficiency virus-infected subjects with virologic failure sampled from a cohort in San Francisco, California, 2000–2004. Results were estimated among persons who had not yet switched therapy and had not resuppressed the virus.
DISCUSSION
Modification of nonsuppressive antiretroviral therapy
Our results support a clinical scenario in which the major immunologic effects of exposure to nonsuppressive therapy accrue during the early months following virologic failure. One of the primary detrimental effects of delayed switching is hypothesized to arise because of continuing evolution of the virus in the presence of the drug, resulting in the accumulation of additional mutations. It may prove to be the case that the majority of such evolution has occurred within the first 5 months of nonsuppressive therapy. The changing role of CD4 T-cell count may also be due in part to the fact that the population remaining on nonsuppressive therapy for at least 5 months may consist disproportionately of persons better able to tolerate it.
Finally, we emphasize that the clinical relevance of the results presented here is limited by several factors: 1) the sample size was small; 2) due in part to sample size, we used a broad definition of treatment modification, which included treatment interruptions as well as initiation of new regimens with the goal of viral resuppression; 3) 19 of the persons included in the baseline MSM analysis achieved viral resuppression without treatment modification, suggesting that these persons did not fail therapy because of resistance; and 4) we made the assumption that time until switching had a linear effect on the outcome, which may or may not be an interesting summary measure of the true causal relation. Because of these concerns, the results presented are intended as an illustration of how HA-MSMs can be applied to a real question, rather than as a guide to clinical practice.
History-adjusted MSMs
HA-MSMs generalize MSM methodology. Like MSMs, HA-MSMs control for time-dependent confounding; in addition, HA-MSMs can be used to estimate time-dependent causal effect modification. As we have demonstrated, such an approach can be used to address practical clinical questions.
In the data example presented, loss of virologic failure represented a clear baseline. In many analyses, however, the assignment of a given time point as baseline is essentially arbitrary (for example, when baseline simply corresponds to entry into a study). In such cases, HA-MSMs offer an additional advantage, allowing gains in the precision of coefficient estimates by assuming a common model across time points. As a result of both this gain in precision and the ability of HA-MSMs to estimate effect modification in longitudinal data by time-varying covariates, we anticipate that these models will prove useful in multiple fields of applied research.
Acknowledgments
Dr. Maya Petersen was supported by a Predoctoral Fellowship from the Howard Hughes Medical Institute. Dr. Mark van der Laan was supported by grant R01 GM071397 from the National Institute of Mental Health. The SCOPE study was supported in part by grants from the National Institute of Allergy and Infectious Diseases (AI052745, AI055273, and AI44595), the California AIDS Research Center (CC99-SF and ID01-SF-049), the UCSF-GIVI (University of California, San Francisco–Gladstone Institute of Virology and Immunology) Center for AIDS Research (P30 MH59037 and P30 AI27763), the Center for AIDS Prevention Studies (P30 MH62246), and the General Clinical Research Center at San Francisco General Hospital (5-MO1-RR00083-37).
Abbreviations
- HA-MSM
history-adjusted marginal structural model
- HIV
human immunodeficiency virus
- IPTW
inverse-probability-of-treatment-weighted
- MSM
marginal structural model
- SCOPE
Study on the Consequences of the Protease Inhibitor Era
Footnotes
Conflict of interest: none declared.
References
- 1.Robins JM. Marginal structural models versus structural nested models as tools for causal inference. In: Halloran ME, Berry D, editors. Statistical models in epidemiology, the environment, and clinical trials. New York, NY: Springer-Verlag; 1999. pp. 95–134. [Google Scholar]
- 2.Robins JM, Hernán MA, Brumback B. Marginal structural models and causal inference in epidemiology. Epidemiology. 2000;11:550–60. doi: 10.1097/00001648-200009000-00011. [DOI] [PubMed] [Google Scholar]
- 3.Robins JM. 1999 Proceedings of the American Statistical Association, Section on Bayesian Statistical Science. Alexandria, VA: American Statistical Association; 2000. Robust estimation in sequentially ignorable missing data and causal inference models; pp. 6–10. [Google Scholar]
- 4.Joffe M, Santanna J, Feldman H. Partially marginal structural models for causal inference. (Abstract) Am J Epidemiol. 2001;153(suppl):S261. [Google Scholar]
- 5.Joffe MM, Knauss J, Robinson B, et al. Lagged partially marginal structural models for controlling confounding. (Abstract) Am J Epidemiol. 2004;159(suppl):S66. [Google Scholar]
- 6.Feldman HI, Joffe M, Robinson B, et al. Administration of parenteral iron and mortality among hemodialysis patients. J Am Soc Nephrol. 2004;15:1623–32. doi: 10.1097/01.asn.0000128009.69594.be. [DOI] [PubMed] [Google Scholar]
- 7.Robins JM. Structural nested failure time models. In: Armitage P, Colton T, editors. The encyclopedia of biostatistics. Chichester, United Kingdom: John Wiley and Sons Ltd; 1998. pp. 4372–89. [Google Scholar]
- 8.van der Laan MJ, Petersen ML, Joffe MM. History-adjusted marginal structural models and statically-optimal dynamic treatment regimens. Int J Biostat. 2005;1(article 4) (Electronic article). ( http://www.bepress.com/ijb/vol1/iss1/4)
- 9.Panel on Clinical Practices for Treatment of HIV infection, US Department of Health and Human Services. Guidelines for the use of antiretroviral agents in HIV-1-infected adults and adolescents. Washington, DC: US Department of Health and Human Services; 2005. [Google Scholar]
- 10.Deeks SG, Barbour JD, Martin JN, et al. Sustained CD4) T cell response after virologic failure of protease inhibitor-based regimens in patients with human immunodeficiency virus infection. J Infect Dis. 2000;181:946–53. doi: 10.1086/315334. [DOI] [PubMed] [Google Scholar]
- 11.Deeks SG, Barbour JD, Grant RM, et al. Duration and predictors of CD4 T-cell gains in patients who continue combination therapy despite detectable plasma viremia. AIDS. 2002;16:201–7. doi: 10.1097/00002030-200201250-00009. [DOI] [PubMed] [Google Scholar]
- 12.Piketty C, Weiss L, Thomas F, et al. Long-term clinical outcome of human immunodeficiency virus-infected patients with discordant immunologic and virologic responses to a protease inhibitor-containing regimen. J Infect Dis. 2001;183:1328–35. doi: 10.1086/319861. [DOI] [PubMed] [Google Scholar]
- 13.Deeks SG. Treatment of antiretroviral-drug-resistant HIV-1 infection. Lancet. 2003;362:2002–11. doi: 10.1016/S0140-6736(03)15022-2. [DOI] [PubMed] [Google Scholar]
- 14.van der Laan MJ, Robins JM. Unified methods for censored longitudinal data and causality. New York, NY: Springer-Verlag; 2002. pp. 329–47. [Google Scholar]
- 15.Sinisi SE, van der Laan MJ. Deletion/substitution/addition algorithm in learning with applications in genomics. Stat Appl Genet Mol Biol. 2004;3(article 18) doi: 10.2202/1544-6115.1069. (Electronic article).( http://www.bepress.com/sagmb/vol3/iss1/art18/) [DOI] [PubMed]