<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
<ui>1742-5573-6-4</ui>
<ji>1742-5573</ji>
<fm>
<dochead>Analytic Perspective</dochead>
<bibl>
<title><p>Identifiability, exchangeability and confounding revisited</p></title>
<aug><au ca="yes" id="A1"><snm>Greenland</snm><fnm>Sander</fnm><insr iid="I1"/><email>lesdomes@ucla.edu</email></au>
<au id="A2"><snm>Robins</snm><mi>M</mi><fnm>James</fnm><insr iid="I2"/><email>robins@hsph.harvard.edu</email></au>
</aug>
<insg>
<ins id="I1"><p>Department of Epidemiology and Department of Statistics, University of California, Los Angeles, CA 90095-1772, USA</p></ins>
<ins id="I2"><p>Departments of Epidemiology and Biostatistics, Harvard School of Public Health, Boston, MA 02115, USA</p></ins>
</insg>
<source>Epidemiologic Perspectives &amp; Innovations</source>
<issn>1742-5573</issn>
<pubdate>2009</pubdate>
<volume>6</volume>
<issue>1</issue>
<fpage>4</fpage>
<url>http://www.epi-perspectives.com/content/6/1/4</url>
<xrefbib><pubidlist><pubid idtype="pmpid">19732410</pubid><pubid idtype="doi">10.1186/1742-5573-6-4</pubid></pubidlist></xrefbib></bibl>
<history><rec><date><day>19</day><month>6</month><year>2009</year></date></rec><acc><date><day>4</day><month>9</month><year>2009</year></date></acc><pub><date><day>4</day><month>9</month><year>2009</year></date></pub></history><cpyrt><year>2009</year><collab>Greenland and Robins; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec><st><p>Abstract</p></st>
<p>In 1986 the International Journal of Epidemiology published "Identifiability, Exchangeability and Epidemiological Confounding". We review the article from the perspective of a quarter century after it was first drafted and relate it to subsequent developments on confounding, ignorability, and collapsibility.</p>
</sec>
</abs>
</fm>
<meta>
<classifications>
<classification id="refman" subtype="user_supplied_xml" type="bmc"/>
</classifications>
</meta>
<bdy>
<sec><st><p>Introduction</p></st>
<p>Nearly a quarter of a century ago, we published an article titled "Identifiability, exchangeability and epidemiological confounding" <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, hereafter IEEC. At the request of the editor of the present journal, we review the article in light of the extensive developments since that time. In brief, the article gave a formal definition of confounding and a logical justification for correct intuitions about confounding that existed before. Among its deficiencies were that it failed to give adequate historical context and it was not general enough in its discussion. Greater generality was achieved in subsequent articles, especially "Confounding, collapsibility, and causal inference" <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> - which in some ways was an expansion and extension of IEEC addressed to a statistical audience - and in the less technical epidemiologic article, "Estimating causal effects" <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>.</p>
<p>Since then, some brief histories of confounding have appeared <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. There have been many conceptual and technical developments as well; below we will cite a few of them that have been the focus of our research in recent decades.</p>
<sec><st><p>Confounding Discussions Before IEEC</p></st>
<sec><st><p>Confounding Before the 1980s</p></st>
<p>Like nearly all work before the 1980s, IEEC dealt only with estimating effects of exposure histories that could be captured adequately by a single summary, such as "exposed" and "unexposed." IEEC is in some ways no more than a logical finale to a long history of development regarding simple exposures. Concepts of confounding can be traced far back, e.g., John Stuart Mill discussed issues of confounded comparisons in his famed treatise on inductive logic <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>; so did Yule (p. 134) <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> under the heading of "fictitious association." The key concepts were well known to many observational researchers in sociology and epidemiology long before we entered the field (e.g., <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>), although not always under the rubric of "confounding" - "spurious association" was a common term for the same idea.</p>
<p>Regardless of terms, confounding is the problem of confusing or mixing of exposure effects with other "extraneous" effects: If at the time of its occurrence, exposure was associated with pre-existing risk for the outcome, its association would reflect at least in part the effect of this baseline association, not the effect of exposure itself. The portion of the association reflecting this baseline association was called confounding or "spurious association." The factors responsible for this confounding (those producing the differences in baseline risk) were called <it>confounders</it>.</p>
<p>A few issues were not completely clear to some practitioners. One problem was that many researchers were inclined to overlook the <it>pre-existing </it>(baseline) proviso in the description. It was not unusual to see causal intermediates (causes of disease affected by exposure) treated as confounders; unfortunately, this practice adjusts away part of the very effect under study and can induce selection bias even under the null hypothesis of no direct, indirect, or overall effect of exposure. This problem was especially common in cardiovascular research, where studies of diet and lifestyle often adjusted for clinical measurements (blood pressure, serum cholesterol, etc.) taken after the behaviors in question. Apparently, exhortations against adjustment for post-exposure variables in randomized experiments (e.g., <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>) had not effectively filtered from experimental statistics into epidemiology.</p>
<p>To explain the problem, a number of authors (e.g., <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>) illustrated and contrasted confounders with intermediates using causal diagrams (then known as path diagrams). IEEC used a different mode of illustration, one imported from experimental statistics: The potential-outcomes model.</p>
</sec>
<sec><st><p>Potential Outcomes</p></st>
<p>The potential-outcomes model of causation, also known as the response-schedule or "counterfactual" model, was first formalized by Neyman in 1923 <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> (who soon thereafter teamed with Egon Pearson to develop the theories of alpha-level testing and confidence intervals). Of primary interest in IEEC, the model supplied justifications for a number of intuitions about epidemiologic confounding that existed before the 1980s, as well as the nonintuitive ideas that randomization did not guarantee absence of confounding <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, and that confounding did not correspond fully to the statistical notion of noncollapsibility <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. Thus the model is worth reviewing in detail.</p>
<p>As described in many articles and books (e.g., <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>), the model encodes causal statements by assigning to each study unit (whether a person, cohort, or population) a different outcome variable for each exposure level. For a binary exposure indicator X, the familiar "Y" of associational (noncausal) regression analysis is replaced by a pair of variables Y<sub>1</sub>, Y<sub>0 </sub>representing the unit's outcome when exposed and the unit's outcome when not exposed, respectively: Y<sub>1 </sub>is the outcome when X = 1, Y<sub>0 </sub>is the outcome when X = 0. Unit-level effects are differences or ratios of these exposure-specific and unit-specific outcomes. For more general X (including multivariate treatments, wherein X is a vector or even more complex), Y<sub>x </sub>represents the outcome if X = x. As did IEEC, for simplicity we will focus on the binary X case, noting that our comments apply generally.</p>
<p>With this model, the problem of causal inferences devolves to how one can identify these effects when for each unit at most one of the outcomes can be observed. In other words: How can we estimate an effect such as Y<sub>1</sub>-Y<sub>0 </sub>when we cannot observe both Y<sub>1</sub>, Y<sub>0 </sub>at once? Strong assumptions are needed. Randomization was a natural assumption for experimental research, and by the 1930s the potential-outcomes model was established in experimental statistics (e.g., <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>), although without that name. It was sometimes called the "randomization model" <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>; nonetheless, randomization was an assumption for its use, not intrinsic to the model itself, and by the 1970s the model was being used for nonrandomized studies.</p>
<p>Rubin <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp> introduced the term "potential outcomes" and formalized a set of assumptions that identified average causal effects within the model. An important part of Rubin's formulation was to link the causal-inference problem to the missing-data problem in surveys: Under the model, at least one of the potential outcomes is missing. IEEC however was based instead on the formulation spelled out by Copas (p. 269) <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, which had been common currency in studies of randomization-based statistics before the 1970s (e.g., <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B31">31</abbr></abbrgrp>). IEEC never used the term "potential outcome" or cited Rubin's work, which at the time was not familiar (nor apparently to the reviewers). This lapse was rectified in later updates (e.g., <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B23">23</abbr></abbrgrp>).</p>
<p>The potential-outcomes model has often caused consternation because it treats the potential outcomes Y<sub>1 </sub>and Y<sub>0 </sub>as if they were baseline covariates, fixed from the start of follow-up. The only possible effect of exposure is then on whether we see Y<sub>1 </sub>or Y<sub>0</sub>. In other words, the exposure status X of a unit is unrelated to each <it>potential </it>outcome of the unit; exposure only determines which potential outcome we may observe. If exposure occurs (X = 1), we may observe Y<sub>1 </sub>but not Y<sub>0</sub>; if exposure doesn't occur (X = 0), we may observe Y<sub>0 </sub>but not Y<sub>1</sub>. The unobservable potential outcome (Y<sub>0 </sub>if X = 1, Y<sub>1 </sub>if X = 0) is sometimes called <it>counterfactual </it>(contrary to fact). This name is inaccurate under the strictest formulations of the model, because each potential outcome is presumed to be conceivable regardless of the exposure history, and its value may be the actual outcome regardless of the actual exposure history. For example, if a person's potential outcomes are Y<sub>1 </sub>= 1, Y<sub>0 </sub>= 1 where Y is a death indicator, the person's actual outcome is death regardless of exposure; only the unrealized <it>exposure </it>history ("exposed" when no exposure occurs; "not exposed" when exposure occurs) is counterfactual.</p>
<p>These somewhat odd features helped inspire vigorous attacks on the model (e.g., see the comments following Maldonado and Greenland <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>), but those were not accompanied by evidence that the model gave misleading results in any real example. On the contrary, the idea of potential outcomes fixed at baseline brings a certain transparency to and supplies logical justification for many accepted statistical procedures <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>.</p>
<p>One limitation of IEEC, shared with most descriptions of potential outcomes, was that it confined its discussion to deterministic Y<sub>1 </sub>and Y<sub>0 </sub>(which were taken to be binary indicators). There is nothing essential about this simplification, however. Sequels (e.g., <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B35">35</abbr></abbrgrp>) made clear that Y<sub>1 </sub>and Y<sub>0 </sub>could instead be replaced by potential <it>parameters </it>&#952;<sub>1 </sub>and &#952;<sub>0 </sub>of outcome distributions to allow for stochastic outcomes. And, as could Y<sub>1 </sub>and Y<sub>0</sub>, &#952;<sub>1 </sub>and &#952;<sub>0 </sub>could be vectors or more complex structures.</p>
<p>Another shortcoming of IEEC (shared by most of the literature on causation in epidemiology, especially the graphical literature) is that it did not emphasize the importance of limiting the exposure X to a potentially changeable condition, in order to make sense of the unobserved potential outcomes <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>. Importantly, this shortcoming was <it>not </it>shared by other articles appearing at the time <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>. Unfortunately, Holland's famous description thoroughly botched the model's history, misattributing it to Rubin.</p>
</sec>
<sec><st><p>Confounding, Collapsibility, and Exchangeability</p></st>
<p>Noncollapsibility of a measure of association over a covariate means that the measure changes upon stratification by the covariate; stated in reverse, it means we get a different measure if we collapse over (ignore) the covariate. As the above citations noted, noncollapsibility by itself could not imply confounding, since the covariate might be affected by exposure; the change would then reflect adjusting away the exposure effect or the creation of selection bias, rather than any confounding reduction. More subtly, as noted in IEEC, the change might reflect the introduction of confounding or selection bias by another, uncontrolled covariate <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B25">25</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>, or it might reflect an effect of measurement error rather than confounding <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. And of course the change might reflect only random error (the hypothesis addressed by collapsibility tests). But many authors naturally assumed that, absent these phenomena, any change must reflect removal of confounding by the covariate.</p>
<p>Miettinen and Cook <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> argued that this was not true in general: Some measures (those that were not differences or ratios of probabilities, such as odds ratios) could change upon adjustment even if no confounding or other bias were present. Even more striking, the converse could hold: confounding by a covariate could be present even if no change in such measures occurred upon adjustment for the covariate. Their intuition was lost on many statisticians, who continued to equate confounding with noncollapsibility. It thus seemed worthwhile to explain the source of the intuition in a more rigorous manner.</p>
<p>In doing so, the approach in IEEC was to take care to <it>not </it>define confounding in terms of other covariates. Rather, the strategy was to show there could be no confounding (mixing of effects) given "exchangeability" of the groups being compared (or "comparability", as Miettinen and Cook called it). The compared groups were said to be <it>exchangeable </it>with respect to an outcome measure if their outcomes would be the same whenever they were subjected to the identical exposure history. The groups were said to be only <it>partially </it>exchangeable if their outcomes would be the same when they were subjected only to certain (not necessarily all) exposure histories.</p>
<p>Using a table to contrast the distribution of potential outcomes in two groups (which we will here label A and B), IEEC focused on the example of a binary exposure variable X leading to binary potential-outcome variables Y<sub>1 </sub>and Y<sub>0</sub>, where the latter indicate the development of a disease. In this setting, the average outcome is the incidence proportion. The two groups would be exchangeable with respect to all-or-none exposure and average outcome if they had identical average values of both Y<sub>1 </sub>and Y<sub>0 </sub>(i.e., identical incidence when subject to the same exposure). They would thus have the same average outcome if they were both entirely exposed or if they were both entirely unexposed. They would be only partially exchangeable if the average of (say) Y<sub>0 </sub>was the same for both groups but the average of Y<sub>1 </sub>differed between the groups; in that case they would have the same average outcome when not exposed but a different average outcome when exposed.</p>
<p>As did Miettinen and Cook, IEEC assumed that interest focused on comparing the average outcome of an exposed (X = 1) population A to what that outcome would have been had the <it>same </it>population not been exposed (X = 0). We could observe the average of Y<sub>1 </sub>in A, but not the average of Y<sub>0 </sub>in A. We would hope to find or construct a group B that plausibly had the same average of Y<sub>0 </sub>as did A, and was not exposed so that we observe this average. By substituting the average of Y<sub>0 </sub>in B for the average of Y<sub>0 </sub>in A, we could now take the difference in the two observable quantities (the average of Y<sub>1 </sub>in A and the average of Y<sub>0 </sub>in B) as our measure of effect of exposure in A. But, if the Y<sub>0 </sub>average in B did not equal that in A, simply pretending the two were equal would create a bias in our estimate of the effect of exposure in A. That bias is what IEEC called confounding, since it mixed "baseline" differences in A and B (i.e., the difference in the Y<sub>0 </sub>averages in A and B) with the desired effect (the difference of the Y<sub>1 </sub>and Y<sub>0 </sub>averages in A).</p>
<p>As discussed subsequently to IEEC <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B6">6</abbr><abbr bid="B24">24</abbr></abbrgrp>, the concepts generalize immediately to situations in which X = 1 and X = 0 represent two different exposure <it>distributions </it>or <it>population </it>(group-level) <it>interventions </it>for the target population A, to situations in which the outcome of interest is something other than an average, and to situations in which the target A is observed under <it>neither </it>pattern of interest. In the latter situations, observable substitutes must be found for the outcome of A under X = 1 and the outcome of A under X = 0, perhaps from subsets of A (as in randomized trials) or from entirely different populations. Failure of these substitutes to equal the target outcomes would normally lead to bias, which again we would call confounding.</p>
</sec>
<sec><st><p>Induced Confounding and Illusory Confounding</p></st>
<p>Control of intermediates (often called <it>mediators </it>in the social-science literature) is sometimes promoted as a strategy to estimate effects transmitted through pathways not involving the intermediates. Judd and Kenny (<abbrgrp><abbr bid="B45">45</abbr></abbrgrp>, p. 608-609) noted that, even in randomized trials, this strategy could be biased by failure to control for factors that affect both the intermediates and the outcome of interest. Robins <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> and Robins and Greenland <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> extended the potential-outcomes framework of IEEC to illustrate this problem, and provided no-confounding criteria for estimating direct effects; see also Pearl (<abbrgrp><abbr bid="B47">47</abbr></abbrgrp>, sec. 4.2). The problem is much more easily seen using causal diagrams, where it can be described as confounding induced by opening noncausal (biasing) paths between the exposure and the outcome due to conditioning on the intermediate <abbrgrp><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr></abbrgrp>.</p>
<p>As mentioned earlier, IEEC and the companion paper by Robins and Greenland <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> noted that confounding can be induced by control of baseline covariates. Again, diagrams better show how this confounding arises by opening noncausal paths between the exposure and the outcome <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B41">41</abbr><abbr bid="B49">49</abbr><abbr bid="B51">51</abbr></abbrgrp>. The practical importance of the phenomenon may be limited apart from special situations <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. Nonetheless, the examples show why one cannot be sure that confounding is being reduced as one adjusts for additional confounders and sees the estimate change: Even if the confounders satisfy the usual conditions for being a confounder and there is no other bias, it is theoretically possible that the change represents an increase in confounding.</p>
<p>There are other problems with examining changes in estimates to judge whether confounding is being controlled. One is that, for a common outcome, changes in odds ratios and (to a lesser extent) rate ratios may largely reflect noncollapsibility rather than confounding removal <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B24">24</abbr></abbrgrp>. Another is that adjustment may induce changes in estimates solely by increasing sparse-data bias, leading to a misimpression that confounding is being reduced <abbrgrp><abbr bid="B53">53</abbr></abbrgrp> (p.525).</p>
</sec>
</sec>
<sec><st><p>Epidemiologic Confounding, Randomization, and Ignorability</p></st>
<p>In statistics, the word "confounding" has often been used to describe related but different concepts, and does not even play a central role in many statistical discussions of causal inference (e.g., in the theory of experimental design, where "confounding" refers to an intentional design strategy). Instead, assumptions of no confounding are replaced by <it>randomization </it>or else by <it>ignorability </it>assumptions. Ignorability assumptions are sometimes called "no unmeasured confounding" assumptions, even though this usage differs from the usual epidemiologic meaning of "no confounding" (although in very large samples the two are equivalent in practical terms).</p>
<sec><st><p>Randomization and Ignorability</p></st>
<p>Simple (complete) randomization of exposure means that exposure events occur independently of <it>every </it>event that precedes their occurrence. In particular, because the potential outcomes already exist at the times of exposure events, it implies that exposure events occur independently of the potential outcomes of interest. The latter independence condition is often called an <it>ignorability </it>assumption <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>; that is, ignorability is a narrowing of the randomization assumption to the specific outcome under study. For example, <it>weak </it>ignorability for a binary exposure event (X = 1 or X = 0) says exposure events occur independently of <it>each </it>potential outcome Y<sub>1 </sub>and Y<sub>0</sub>; <it>strong </it>ignorability says exposure events occur independently of the <it>pair </it>of potential outcomes (Y<sub>1</sub>, Y<sub>0</sub>). Strong ignorability implies weak ignorability, and randomization implies them both; thus any nonignorability implies lack of randomization. Each of these conditions may also be defined conditionally on some set of covariates (e.g., strong ignorability conditional on age and sex).</p>
<p>Ignorability of exposure events (X levels) with respect to a pair of potential outcomes Y<sub>1</sub>, Y<sub>0 </sub>says X occurs independently of Y<sub>1 </sub>and Y<sub>0</sub>. This property makes X independent of any function of Y<sub>1 </sub>or Y<sub>0 </sub>(e.g., their logs). It follows that the subpopulation defined by X = 1 and the subpopulation defined by X = 0 would be exchangeable with respect to any function or summary of the potential outcomes (e.g., the mean of Y<sub>1</sub>, the mean of Y<sub>0</sub>, or their geometric means). Thus it seemed (and still seems) to many statisticians that there should be no confounding of any measure, once we are given the ignorability condition.</p>
<p>Given a causal diagram for the data-generating process under study, we may ask if we can unbiasedly estimate the effect of X on Y conditional on a set of covariates Z in the diagram. It turns out that is the case if Z satisfies the <it>back-door criterion </it>of Pearl <abbrgrp><abbr bid="B51">51</abbr><abbr bid="B54">54</abbr></abbrgrp>, which implies ignorability of X events with respect to (Y<sub>1</sub>, Y<sub>0</sub>) given Z. Pearl however refers to this condition as "no confounding" <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>; as discussed next, this usage diverges from our usage of "no confounding" in IEEC and at various points since <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B6">6</abbr><abbr bid="B55">55</abbr></abbrgrp>.</p>
</sec>
<sec><st><p>Randomization and Ignorability versus No Confounding</p></st>
<p>There is a discrepancy between the concepts of randomization and ignorability as defined in statistics and the concepts of no confounding and exchangeability as defined in IEEC and in Robins and Morgenstern <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. To maintain conformity with traditional usage, IEEC defined nonexchangeability and hence confounding in terms of the <it>actual </it>rather than probabilistic exposure associations with potential outcomes. Thus confounding can be present even if the exposure assignment mechanism is completely random.</p>
<p>To see this problem, suppose the outcome is 10-year mortality and the compared groups differ by baseline age and sex. Then it is almost certain some confounding is present, because age and sex differences will almost certainly lead to mortality differences, and those differences are not due to exposure. As noted in IEEC, it does not matter if exposure was randomized or otherwise assigned in an ignorable manner; once the assignment is made and the groups are created, any outcome differences between them will be confounded (mixed) with exposure effects <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B17">17</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr></abbrgrp>.</p>
<p>Why do randomization and ignorability fail to capture no-confounding assumptions in epidemiology and sociology? Because randomization and ignorability refer only to the <it>mechanism </it>that generates exposure events, not to the product of that mechanism. An ignorable mechanism may by chance leave some degree of confounding in any exposure assignment it makes. As has long been recognized, most assignments made by actual ignorable mechanisms (such as simple randomizers) will have some degree of confounding in the sense of mixing of effects (e.g., <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>). Analytic adjustment for baseline covariates can remove confounding by those covariates, but will leave confounding by unadjusted covariates. This problem is sometimes called one of "unmeasured confounding" but remains a problem whether the covariate is measured or not; what matters is whether it is adjusted in a manner that removes confounding by the covariate and that does not introduce more confounding.</p>
<p>As the size of the randomized groups in a given trial increases, randomization does make it ever less probable that substantial confounding remains. This feature reflects a key statistical advantage flowing from successful randomization: The confidence limits capture uncertainty about the confounding left by randomization <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B55">55</abbr></abbrgrp>. In this sense, randomization-based confidence limits account for concerns about residual baseline confounding by unmeasured factors in randomized trials, although post-randomization events (such as censoring) may reintroduce these concerns. Note that other statistical aspects of the trial (e.g., number of participants, power) do not enter into these considerations except through confidence limits, so that a trial that is "large" and "powerful" may nonetheless be poorly informative because it produced a wide confidence interval.</p>
</sec>
<sec><st><p>Adjustment for Baseline Covariates under an Ignorable Mechanism</p></st>
<p>As noted above, there have been many intuitive and mathematical arguments as to why adjustment for baseline covariates can be important, even with an ignorable exposure-assignment mechanism such as randomization (e.g., <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B55">55</abbr><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr></abbrgrp>). This importance is reflected by the fact that in propensity-score (exposure-probability) adjustment for confounding, adjustment by the fitted score provides better frequency properties than the true score <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B60">60</abbr></abbrgrp>. For example, under simple 50% randomization to exposure, the true propensity score is constant across all individuals (it is 0.5, the chance of being assigned to exposure); its use thus produces no adjustment at all, since everyone will end up in the same score stratum. In contrast, a fitted score will adjust for the covariates it includes, although the extent of that adjustment may depend heavily on the form of the propensity model.</p>
<p>In sum, randomization (or more generally, ignorability) does <b>not </b>impose "no confounding" in the common-sense use of the term. Rather, it provides the following related properties <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B33">33</abbr><abbr bid="B55">55</abbr><abbr bid="B58">58</abbr></abbrgrp>:</p>
<p>1) <it>Unconditional expected </it>confounding of zero: This is a pre-allocation expectation, corresponding to no asymptotic bias over the randomization distribution; but once allocation occurs, it becomes secondary to properties of the allocation.</p>
<p>2) A randomization-based ("objective") derivation of a prior for <it>residual </it>confounding <it>after conditioning on all measured confounders</it>. This prior applies after allocation as well as before, and becomes more narrowly centered around zero as the sample size increases. This is a key post-allocation benefit of randomization.</p>
<p>As emphasized by R.A. Fisher <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>, it also provides a randomization-based distribution for conducting frequentist inference on effects <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>; property (2) can be viewed as a Bayesian version of this property <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>.</p>
</sec>
<sec><st><p>Exchangeability in IEEC and Subjective Probability Theory</p></st>
<p>Property (2) above was the basis for IEEC tying the epidemiologic concept of confounding to the subjective-Bayesian concept of exchangeability. In probability, random variables are said to be <it>exchangeable </it>(under a given joint distribution) if they can be interchanged (permuted) in any statement without altering the probability of the statement <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. For example, if we consider U and V exchangeable with joint probability distribution Pr(U, V), the probabilities Pr(U&lt;V) and Pr(V&lt;U) derived from Pr(U, V) must be equal. Exchangeable random variables are in essence indistinguishable with regard to our bets about their values (whether absolutely or relative to one another). Note that exchangeability is a property of the joint distribution of the variables. In subjective probability systems, different observers may have different views on whether the variables are exchangeable because they may assign different distributions to variables.</p>
<p>In IEEC, the random variables at issue are the distributions of potential outcomes in the compared groups A and B, and the discussion focused on a binary deterministic outcome. To describe the situation more generally (as in <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>) for a binary exposure X, let U<sub>1A </sub>and U<sub>0A </sub>be some summary of the outcomes in group A when everyone in A is exposed (X = 1) and when everyone in A is unexposed (X = 0); similarly for group B, let U<sub>1B </sub>and U<sub>0B </sub>be the summaries when everyone in group B is exposed and when everyone is unexposed. X might be an individual exposure (e.g., carbohydrate consumption) or a group-level exposure (e.g., legislation, insurance). Further, assume that what happens in each group has no effect on what happens in the other, but that U<sub>0A </sub>and U<sub>0B </sub>are exchangeable given U<sub>1A</sub>. Then in drawing inferences about the effect of exposure (of X = 1 vs. X = 0) on A, say U<sub>1A</sub>-U<sub>0A</sub>, we could substitute U<sub>0B </sub>for U<sub>0A</sub>.</p>
<p>The Bayesian exchangeability connection provides one explanation (beyond common-sense and abstruse ancillarity arguments) for why we might declare confounding to be present in a study even though the mechanism that assigned the exposures was ignorable. Because Bayesians condition on all the observed data, once we observe an association of a baseline risk factor with exposure, exchangeability is lost for us: Upon observing an association, we assign different distributions to U<sub>0A </sub>and U<sub>0B</sub>, reflective of the information suggesting a baseline difference.</p>
<p>Note well that this version of exchangeability is a property of our information about the groups and the variables, hence is a subjective (observer-relative) property, not an absolute (objective) property of either. In this view, evaluation of confounding is equally subjective.</p>
</sec>
</sec>
<sec><st><p>Our Research on Confounding Since IEEC</p></st>
<p>Since IEEC we have separately pursued quite different paths to deal with problems that involve confounding by many covariates. Our divergence stems primarily from differences in the applied settings and target problems that have been our focus over the decades since IEEC. Nonetheless, it has at times raised a few interesting philosophical issues, especially when dealing with many exposures or confounders.</p>
<sec><st><p>Multiple Exposures as Multiple Confounders</p></st>
<p>Consider first a study involving single measurements of multiple exposures, as is common in case-control studies of occupation and lifestyle. In these settings each exposure may be a potential confounder for every other exposure, so it is natural to consider an outcome-regression model containing all exposures. In conventional theory, this sort of model could provide an estimate for each exposure "adjusted" for all the rest. Unfortunately, it often happens that the number of subjects available may appear large but is not large enough to produce approximately unbiased estimates from the usual maximum-likelihood (whether unconditional, conditional, or partial) or estimating-equation methods <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>. These problems arise because the number of subjects needed to get approximate unbiasedness grows exponentially with the number of covariates in the model. In some applications the usual estimators fail completely due to collinearity <abbrgrp><abbr bid="B65">65</abbr><abbr bid="B66">66</abbr><abbr bid="B67">67</abbr></abbrgrp>.</p>
<p>One way to address these problems is to accept that (in observational studies) some bias is unavoidable, and may even be tolerable in exchange for improved accuracy on average over all the exposures being considered. The usual methods then become unacceptable because they can incur huge bias without a compensating accuracy benefit. Methods that pursue over-all accuracy improvement include the vast array of multilevel and hierarchical techniques developed under the headings of ridge regression, shrinkage (Stein) estimation, penalized estimation, and empirical and semi-Bayes estimation (see <abbrgrp><abbr bid="B68">68</abbr></abbrgrp> for an elementary overview). These methods have also been proposed as replacements for conventional variable-selection procedures in order to avoid the distortions produced by selection <abbrgrp><abbr bid="B53">53</abbr><abbr bid="B69">69</abbr></abbrgrp>. Finally, the methods have been extended to study the impact of uncontrolled confounding and other biases in observational research <abbrgrp><abbr bid="B70">70</abbr><abbr bid="B71">71</abbr><abbr bid="B72">72</abbr><abbr bid="B73">73</abbr></abbrgrp>.</p>
</sec>
<sec><st><p>Time-Varying Exposures</p></st>
<p>At the time of IEEC, one of us (Robins) was developing methods estimating effects of time-varying treatments from longitudinal data. Early accessible introductions in this work include Robins <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B62">62</abbr></abbrgrp> and Robins et al. <abbrgrp><abbr bid="B74">74</abbr></abbrgrp>; see also Robins et al. <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>. Estimating effects of potentially complex treatment histories created extreme difficulties for likelihood-based methods (which were standard at the time). These difficulties led to a frequentist approach with confounder adjustment based on regression of treatment probabilities on measured confounders (as in propensity-score methods) followed by use of this fitted treatment model for adjustment. Fitting is accomplished by semi-parametric methods such as g-estimation and inverse-probability-of-treatment (IPT) weighting. These methods were further extended in many directions, including control of confounding due to treatment noncompliance <abbrgrp><abbr bid="B76">76</abbr></abbrgrp>, estimation of direct effects <abbrgrp><abbr bid="B77">77</abbr><abbr bid="B78">78</abbr><abbr bid="B79">79</abbr></abbrgrp>, and to the study of uncontrolled confounding <abbrgrp><abbr bid="B80">80</abbr><abbr bid="B81">81</abbr><abbr bid="B82">82</abbr></abbrgrp>.</p>
<p>In the course of this IPT development, it was shown that familiar likelihood-based methods (including maximum-likelihood and Bayesian outcome regression) could exhibit poor large-sample frequency properties if many covariates needed adjustment and no use was made of treatment probabilities in fitting <abbrgrp><abbr bid="B83">83</abbr><abbr bid="B84">84</abbr></abbrgrp>. These observations have led to recommendations for doubly robust procedures that fit outcome regressions using IPT weighting <abbrgrp><abbr bid="B85">85</abbr><abbr bid="B86">86</abbr></abbrgrp>. This approach resembles recommendations that outcome regressions be conducted after propensity-score matching has removed gross confounder imbalances between compared groups <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. The shared intuition is that if one of the regression models (for treatment or outcome) is mis-specified, the resulting adjustment failures will be compensated for by the adjustment induced by the other model.</p>
<p>A topic worthy of further research that we have long recognized but not pursued is how to blend ideas from our divergent methods and applications. Some applications of shrinkage-like methods, such as boosting to predict treatment, appear useful for taming problems due to unstable weights <abbrgrp><abbr bid="B87">87</abbr></abbrgrp>. One logical next step might be to apply shrinkage methods to the IPT-weighted outcome regression as well as to the weight regression. We are not aware of studies of such "doubly robust double shrinkage."</p>
</sec>
</sec>
</sec>
<sec><st><p>Conclusion</p></st>
<p>Some colleagues have said that they think IEEC is among the most important methods papers from its era, and it is often, though not always, cited in reviews of epidemiologic confounding, e.g., in Nurminen <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> and Pearl <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> but not in Vandenbroucke <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Regardless of its historical status, IEEC is a snapshot of a topic in transition. Using a standard causal model familiar to experimental statisticians since the 1930s, IEEC provided a formal definition of confounding and logical justifications for traditional intuitions about confounding graphed in earlier literature <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>. It also provided logical justifications for less intuitive distinctions such as that between confounding and collapsibility, and that between nonconfounding and randomization or ignorability.</p>
<p>The terminology in IEEC could have been clearer; for example, it talked of "causal confounding" without ever explaining what noncausal confounding would be. The phrase "causal confounding" was a nod to the fact that some researchers also talked of "selection confounding," in which bias arose because a covariate related to exposure affected selection rather than disease; today we would simply call this phenomenon selection bias. There also could have been a clearer connection made to the topic of standardization and its relation to the target population for inference. Nonetheless, the framework in IEEC served as a basis for many of the more general and more extensive treatments that followed <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B33">33</abbr><abbr bid="B35">35</abbr><abbr bid="B46">46</abbr></abbrgrp>. We hope that its sequels rectified the shortcomings of IEEC, and that modern readers come away from it feeling equipped to move on to subsequent literature without too much difficulty.</p>
</sec>
<sec><st><p>Competing interests</p></st>
<p>The authors declare that they have no competing interests.</p>
</sec>
<sec><st><p>Authors' contributions</p></st>
<p>SG wrote and revised the manuscript. JMR suggested revisions. Both authors read and approved the final manuscript.</p>
</sec>
</bdy>
<bm>
<ack><sec><st><p>Acknowledgements</p></st>
<p>We thank George Maldonado, Judea Pearl, and Tyler VanderWeele for helpful comments and Karyn Heavner for editing.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>Identifiability, exchangeability, and epidemiological confounding</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au><au><snm>Robins</snm><fnm>JM</fnm></au></aug><source>Int J Epidemiol</source><pubdate>1986</pubdate><volume>15</volume><fpage>413</fpage><lpage>419</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/ije/15.3.413</pubid><pubid idtype="pmpid" link="fulltext">3771081</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>Confounding and collapsibility in causal inference</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au><au><snm>Robins</snm><fnm>JM</fnm></au><au><snm>Pearl</snm><fnm>J</fnm></au></aug><source>Statistical Science</source><pubdate>1999</pubdate><volume>14</volume><fpage>29</fpage><lpage>46</lpage><xrefbib><pubid idtype="doi">10.1214/ss/1009211805</pubid></xrefbib></bibl><bibl id="B3"><title><p>Estimating causal effects</p></title><aug><au><snm>Maldonado</snm><fnm>G</fnm></au><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>Int J Epidemiol</source><pubdate>2002</pubdate><volume>31</volume><fpage>422</fpage><lpage>429</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/ije/31.2.422</pubid><pubid idtype="pmpid" link="fulltext">11980807</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Confounding in health research</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au><au><snm>Morgenstern</snm><fnm>H</fnm></au></aug><source>Annu Rev Public Health</source><pubdate>2001</pubdate><volume>22</volume><fpage>189</fpage><lpage>212</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1146/annurev.publhealth.22.1.189</pubid><pubid idtype="pmpid" link="fulltext">11274518</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>The history of confounding</p></title><aug><au><snm>Vandenbroucke</snm><fnm>JP</fnm></au></aug><source>History of Epidemiological Methods and Concepts</source><publisher>Basel, Switzerland: Birkhaser Verlag</publisher><editor>Morabia A</editor><pubdate>2004</pubdate><fpage>313</fpage><lpage>326</lpage></bibl><bibl id="B6"><title><p>Confounding</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>Encyclopedia of Epidemiology</source><publisher>Thousand Oaks, CA: Sage Publications</publisher><editor>Boslaugh S</editor><pubdate>2007</pubdate></bibl><bibl id="B7"><title><p>On the epidemiologic notion of confounding and confounder identification</p></title><aug><au><snm>Nurminen</snm><fnm>M</fnm></au></aug><source>Scand J Work Environ Health</source><pubdate>1997</pubdate><volume>23</volume><fpage>64</fpage><lpage>68</lpage><xrefbib><pubid idtype="pmpid">9098915</pubid></xrefbib></bibl><bibl id="B8"><aug><au><snm>Mill</snm><fnm>JS</fnm></au></aug><source>A System of Logic, Ratiocinative and Inductive (1843 edition, reprinted in 1956)</source><publisher>London: Longmans, Green, and Company</publisher><pubdate>1956</pubdate></bibl><bibl id="B9"><title><p>Notes on the theory of association of attributes in statistics</p></title><aug><au><snm>Yule</snm><fnm>GU</fnm></au></aug><source>Biometrika</source><pubdate>1903</pubdate><volume>2</volume><fpage>121</fpage><lpage>134</lpage><xrefbib><pubid idtype="doi">10.1093/biomet/2.2.121</pubid></xrefbib></bibl><bibl id="B10"><title><p>Smoking and lung cancer: recent evidence and a discussion of some questions</p></title><aug><au><snm>Cornfield</snm><fnm>J</fnm></au><au><snm>Haenszel</snm><fnm>W</fnm></au><au><snm>Hammond</snm><fnm>EC</fnm></au><au><snm>Lilienfeld</snm><fnm>AM</fnm></au><au><snm>Shimkin</snm><fnm>MB</fnm></au><au><snm>Wynder</snm><fnm>EL</fnm></au></aug><source>J Natl Cancer Inst</source><pubdate>1959</pubdate><volume>22</volume><fpage>173</fpage><lpage>203</lpage><xrefbib><pubid idtype="pmpid">13621204</pubid></xrefbib></bibl><bibl id="B11"><title><p>Some statistical problems in research design</p></title><aug><au><snm>Kish</snm><fnm>L</fnm></au></aug><source>Am Sociol Rev</source><pubdate>1959</pubdate><volume>24</volume><fpage>328</fpage><lpage>338</lpage><xrefbib><pubid idtype="doi">10.2307/2089381</pubid></xrefbib></bibl><bibl id="B12"><aug><au><snm>Blalock</snm><fnm>H</fnm></au></aug><source>Causal inference in nonexperimental research</source><publisher>Chapel Hill, NC: University of North Carolina Press</publisher><pubdate>1964</pubdate></bibl><bibl id="B13"><aug><au><snm>MacMahon</snm><fnm>B</fnm></au><au><snm>Pugh</snm><fnm>TF</fnm></au></aug><source>Epidemiology: Principles and Methods</source><publisher>Boston: Little, Brown and Company</publisher><pubdate>1970</pubdate></bibl><bibl id="B14"><aug><au><snm>Susser</snm><fnm>M</fnm></au></aug><source>Causal Thinking in the Health Sciences</source><publisher>New York City: Oxford University Press</publisher><pubdate>1973</pubdate></bibl><bibl id="B15"><aug><au><snm>Cox</snm><fnm>DR</fnm></au></aug><source>Planning of Experiments</source><publisher>New York City: John Wiley and Sons Inc</publisher><pubdate>1958</pubdate></bibl><bibl id="B16"><source>Statistical methods in cancer research. Vol I: the analysis of case-control data</source><publisher>Lyon, France: International Agency for Research on Cancer (IARC)</publisher><pubdate>1980</pubdate></bibl><bibl id="B17"><title><p>Control of confounding in the assessment of medical technology</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au><au><snm>Neutra</snm><fnm>R</fnm></au></aug><source>Int J Epidemiol</source><pubdate>1980</pubdate><volume>9</volume><fpage>361</fpage><lpage>367</lpage><xrefbib><pubid idtype="pmpid">7203778</pubid></xrefbib></bibl><bibl id="B18"><aug><au><snm>Schlesselman</snm><fnm>JJ</fnm></au></aug><source>Case-Control Studies: Design, Conduct, Analysis</source><publisher>Oxford: Oxford University Press</publisher><pubdate>1982</pubdate></bibl><bibl id="B19"><title><p>On the application of probability theory to agricultural experiments. Essay on principles. Section 9 (1923)</p></title><aug><au><snm>Neyman</snm><fnm>J</fnm></au></aug><source>Stat Sci</source><pubdate>1990</pubdate><volume>5</volume><fpage>465</fpage><lpage>480</lpage></bibl><bibl id="B20"><title><p>Epidemiologic methods in clinical trials</p></title><aug><au><snm>Rothman</snm><fnm>KJ</fnm></au></aug><source>Cancer</source><pubdate>1977</pubdate><volume>39</volume><fpage>1771</fpage><lpage>1775</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/1097-0142(197704)39:4+&lt;1771::AID-CNCR2820390803&gt;3.0.CO;2-2</pubid><pubid idtype="pmpid">322841</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Confounding: essence and detection</p></title><aug><au><snm>Miettinen</snm><fnm>OS</fnm></au><au><snm>Cook</snm><fnm>EF</fnm></au></aug><source>Am J Epidemiol</source><pubdate>1981</pubdate><volume>114</volume><fpage>593</fpage><lpage>603</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">7304589</pubid></xrefbib></bibl><bibl id="B22"><aug><au><snm>Berk</snm><fnm>RA</fnm></au></aug><source>Regression Analysis: A Constructive Critique</source><publisher>Newbury Park, CA: Sage</publisher><pubdate>2004</pubdate></bibl><bibl id="B23"><title><p>An overview of methods for causal inference from observational studies</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>Applied Bayesian modeling and causal inference from an incomplete-data perspective</source><publisher>New York City: John Wiley &amp; Sons</publisher><editor>Gelman A, Meng XL</editor><pubdate>2004</pubdate></bibl><bibl id="B24"><title><p>Measures of effect and association</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au><au><snm>Rothman</snm><fnm>KJ</fnm></au><au><snm>Lash</snm><fnm>TL</fnm></au></aug><source>Modern Epidemiology</source><publisher>Philadelphia, PA: Lippincott Williams &amp; Wilkins</publisher><editor>Rothman KJ, Greenland S, Lash TL</editor><pubdate>2008</pubdate></bibl><bibl id="B25"><aug><au><snm>Pearl</snm><fnm>J</fnm></au></aug><source>Causality</source><publisher>Cambridge: Cambridge University Press</publisher><edition>2</edition><pubdate>2009</pubdate></bibl><bibl id="B26"><title><p>Causal analysis in the health sciences</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>Journal of the American Statistical Association</source><pubdate>2000</pubdate><volume>95</volume><fpage>286</fpage><lpage>289</lpage><xrefbib><pubid idtype="doi">10.2307/2669548</pubid></xrefbib></bibl><bibl id="B27"><title><p>On the z-test in randomized blocks and Latin squares</p></title><aug><au><snm>Welch</snm><fnm>BL</fnm></au></aug><source>Biometrika</source><pubdate>1937</pubdate><volume>29</volume><fpage>21</fpage><lpage>52</lpage></bibl><bibl id="B28"><title><p>Randomization models for the matched and unmatched 2 &#215; 2 tables</p></title><aug><au><snm>Copas</snm><fnm>JB</fnm></au></aug><source>Biometrika</source><pubdate>1973</pubdate><volume>60</volume><fpage>467</fpage><lpage>476</lpage></bibl><bibl id="B29"><title><p>Estimating causal effects of treatments in randomized and nonrandomized treatments</p></title><aug><au><snm>Rubin</snm><fnm>DB</fnm></au></aug><source>J Educ Psychol</source><pubdate>1974</pubdate><volume>66</volume><fpage>688</fpage><lpage>701</lpage><xrefbib><pubid idtype="doi">10.1037/h0037350</pubid></xrefbib></bibl><bibl id="B30"><title><p>Bayesian inference for causal effects: the role of randomization</p></title><aug><au><snm>Rubin</snm><fnm>DB</fnm></au></aug><source>Ann Stat</source><pubdate>1978</pubdate><volume>6</volume><fpage>34</fpage><lpage>58</lpage><xrefbib><pubid idtype="doi">10.1214/aos/1176344064</pubid></xrefbib></bibl><bibl id="B31"><title><p>The randomization analysis of a generalized randomized block design</p></title><aug><au><snm>Wilk</snm><fnm>M</fnm></au></aug><source>Biometrika</source><pubdate>1955</pubdate><volume>42</volume><fpage>70</fpage><lpage>79</lpage></bibl><bibl id="B32"><aug><au><snm>Rosenbaum</snm><fnm>PR</fnm></au></aug><source>Observational Studies</source><publisher>New York City: Springer</publisher><edition>2</edition><pubdate>2002</pubdate></bibl><bibl id="B33"><title><p>Randomization, statistics, and causal inference</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>Epidemiology</source><pubdate>1990</pubdate><volume>1</volume><fpage>421</fpage><lpage>429</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1097/00001648-199011000-00003</pubid><pubid idtype="pmpid">2090279</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>On the logical justification of conditional tests for two-by-two-contingency tables</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>The American Statistician</source><pubdate>1991</pubdate><volume>45</volume><fpage>248</fpage><lpage>251</lpage><xrefbib><pubid idtype="doi">10.2307/2684304</pubid></xrefbib></bibl><bibl id="B35"><title><p>Interpretation and choice of effect measures in epidemiologic analyses</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>Am J Epidemiol</source><pubdate>1987</pubdate><volume>125</volume><fpage>761</fpage><lpage>768</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">3551588</pubid></xrefbib></bibl><bibl id="B36"><title><p>Epidemiologic measures and policy formulation: lessons from potential outcomes</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>Emerg Themes Epidemiol</source><pubdate>2005</pubdate><volume>2</volume><fpage>5</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1742-7622-2-5</pubid><pubid idtype="pmcid">1180843</pubid><pubid idtype="pmpid">15921514</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>Invited commentary: hypothetical interventions to define causal effects--afterthought or prerequisite?</p></title><aug><au><snm>Hernan</snm><fnm>MA</fnm></au></aug><source>Am J Epidemiol</source><pubdate>2005</pubdate><volume>162</volume><fpage>618</fpage><lpage>620</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/aje/kwi255</pubid><pubid idtype="pmpid" link="fulltext">16120710</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>Statistics and causal inference (with discussion)</p></title><aug><au><snm>Holland</snm><fnm>PW</fnm></au></aug><source>J Am Stat Assoc</source><pubdate>1986</pubdate><volume>81</volume><fpage>945</fpage><lpage>970</lpage><xrefbib><pubid idtype="doi">10.2307/2289064</pubid></xrefbib></bibl><bibl id="B39"><title><p>A graphical approach to the identification and estimation of causal parameters in mortality studies with sustained exposure periods</p></title><aug><au><snm>Robins</snm><fnm>J</fnm></au></aug><source>J Chronic Dis</source><pubdate>1987</pubdate><volume>40</volume><issue>Suppl 2</issue><fpage>139S</fpage><lpage>161S</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0021-9681(87)80018-8</pubid><pubid idtype="pmpid">3667861</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>The role of model selection in causal inference from nonexperimental data</p></title><aug><au><snm>Robins</snm><fnm>JM</fnm></au><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>Am J Epidemiol</source><pubdate>1986</pubdate><volume>123</volume><fpage>392</fpage><lpage>402</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">3946386</pubid></xrefbib></bibl><bibl id="B41"><title><p>Causal diagrams for epidemiologic research</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au><au><snm>Pearl</snm><fnm>J</fnm></au><au><snm>Robins</snm><fnm>JM</fnm></au></aug><source>Epidemiology</source><pubdate>1999</pubdate><volume>10</volume><fpage>37</fpage><lpage>48</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1097/00001648-199901000-00008</pubid><pubid idtype="pmpid" link="fulltext">9888278</pubid></pubidlist></xrefbib></bibl><bibl id="B42"><title><p>Causal knowledge as a prerequisite for confounding evaluation: an application to birth defects epidemiology</p></title><aug><au><snm>Hernan</snm><fnm>MA</fnm></au><au><snm>Hernandez-Diaz</snm><fnm>S</fnm></au><au><snm>Werler</snm><fnm>MM</fnm></au><au><snm>Mitchell</snm><fnm>AA</fnm></au></aug><source>Am J Epidemiol</source><pubdate>2002</pubdate><volume>155</volume><fpage>176</fpage><lpage>184</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/aje/155.2.176</pubid><pubid idtype="pmpid" link="fulltext">11790682</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>A structural approach to selection bias</p></title><aug><au><snm>Hernan</snm><fnm>MA</fnm></au><au><snm>Hernandez-Diaz</snm><fnm>S</fnm></au><au><snm>Robins</snm><fnm>JM</fnm></au></aug><source>Epidemiology</source><pubdate>2004</pubdate><volume>15</volume><fpage>615</fpage><lpage>625</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1097/01.ede.0000135174.63482.43</pubid><pubid idtype="pmpid" link="fulltext">15308962</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Confounding and misclassification</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au><au><snm>Robins</snm><fnm>JM</fnm></au></aug><source>Am J Epidemiol</source><pubdate>1985</pubdate><volume>122</volume><fpage>495</fpage><lpage>506</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">4025298</pubid></xrefbib></bibl><bibl id="B45"><title><p>Process analysis: Estimating mediation in treatment evaluations</p></title><aug><au><snm>Judd</snm><fnm>CM</fnm></au><au><snm>Kenny</snm><fnm>DA</fnm></au></aug><source>Evaluation Review</source><pubdate>1981</pubdate><volume>5</volume><fpage>602</fpage><lpage>619</lpage><xrefbib><pubid idtype="doi">10.1177/0193841X8100500502</pubid></xrefbib></bibl><bibl id="B46"><title><p>Identifiability and exchangeability for direct and indirect effects</p></title><aug><au><snm>Robins</snm><fnm>JM</fnm></au><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>Epidemiology</source><pubdate>1992</pubdate><volume>3</volume><fpage>143</fpage><lpage>155</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1097/00001648-199203000-00013</pubid><pubid idtype="pmpid">1576220</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>Graphs, causality, and structural equation models</p></title><aug><au><snm>Pearl</snm><fnm>J</fnm></au></aug><source>Sociological Methods Research</source><pubdate>1998</pubdate><volume>27</volume><fpage>226</fpage><lpage>284</lpage><xrefbib><pubid idtype="doi">10.1177/0049124198027002004</pubid></xrefbib></bibl><bibl id="B48"><title><p>Fallibility in estimating direct effects</p></title><aug><au><snm>Cole</snm><fnm>SR</fnm></au><au><snm>Hernan</snm><fnm>MA</fnm></au></aug><source>Int J Epidemiol</source><pubdate>2002</pubdate><volume>31</volume><fpage>163</fpage><lpage>165</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/ije/31.1.163</pubid><pubid idtype="pmpid" link="fulltext">11914314</pubid></pubidlist></xrefbib></bibl><bibl id="B49"><title><p>Causal Diagrams</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au><au><snm>Pearl</snm><fnm>J</fnm></au></aug><source>Encyclopedia of Epidemiology</source><publisher>Thousand Oaks, CA: Sage Publications</publisher><editor>Boslaugh S</editor><pubdate>2007</pubdate><fpage>149</fpage><lpage>156</lpage></bibl><bibl id="B50"><title><p>Causal diagrams</p></title><aug><au><snm>Glymour</snm><fnm>MM</fnm></au><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>Modern Epidemiology</source><publisher>Philadelphia, PA: Lippincott Williams &amp; Wilkins</publisher><editor>Rothman KJ, Greenland S, Lash TL</editor><pubdate>2008</pubdate></bibl><bibl id="B51"><title><p>Causal diagrams for empirical research</p></title><aug><au><snm>Pearl</snm><fnm>J</fnm></au></aug><source>Biometrika</source><pubdate>1995</pubdate><volume>82</volume><fpage>669</fpage><lpage>710</lpage><xrefbib><pubid idtype="doi">10.1093/biomet/82.4.669</pubid></xrefbib></bibl><bibl id="B52"><title><p>Quantifying biases in causal models: classical confounding vs collider-stratification bias</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>Epidemiology</source><pubdate>2003</pubdate><volume>14</volume><fpage>300</fpage><lpage>306</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1097/00001648-200305000-00009</pubid><pubid idtype="pmpid" link="fulltext">12859030</pubid></pubidlist></xrefbib></bibl><bibl id="B53"><title><p>Invited commentary: variable selection versus shrinkage in the control of multiple confounders</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>Am J Epidemiol</source><pubdate>2008</pubdate><volume>167</volume><fpage>523</fpage><lpage>529</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/aje/kwm355</pubid><pubid idtype="pmpid" link="fulltext">18227100</pubid></pubidlist></xrefbib></bibl><bibl id="B54"><title><p>Comment: Graphical models, causality, and intervention</p></title><aug><au><snm>Pearl</snm><fnm>J</fnm></au></aug><source>Stat Sci</source><pubdate>1993</pubdate><volume>8</volume><fpage>266</fpage><lpage>269</lpage><xrefbib><pubid idtype="doi">10.1214/ss/1177010894</pubid></xrefbib></bibl><bibl id="B55"><title><p>The foundations of confounding in epidemiology</p></title><aug><au><snm>Robins</snm><fnm>JM</fnm></au><au><snm>Morgenstern</snm><fnm>H</fnm></au></aug><source>Comp Math Appl</source><pubdate>1987</pubdate><volume>14</volume><fpage>869</fpage><lpage>916</lpage><xrefbib><pubid idtype="doi">10.1016/0898-1221(87)90236-7</pubid></xrefbib></bibl><bibl id="B56"><aug><au><snm>Ostle</snm><fnm>B</fnm></au></aug><source>Statistics in Research</source><publisher>Ames, Iowa: Iowa State University Press</publisher><edition>2</edition><pubdate>1963</pubdate></bibl><bibl id="B57"><title><p>The University Group Diabetes Program. A further statistical analysis of the mortality findings</p></title><aug><au><snm>Cornfield</snm><fnm>J</fnm></au></aug><source>JAMA</source><pubdate>1971</pubdate><volume>217</volume><fpage>1676</fpage><lpage>1687</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1001/jama.217.12.1676</pubid><pubid idtype="pmpid">4937386</pubid></pubidlist></xrefbib></bibl><bibl id="B58"><title><p>Recent methodological contributions to clinical trials</p></title><aug><au><snm>Cornfield</snm><fnm>J</fnm></au></aug><source>Am J Epidemiol</source><pubdate>1976</pubdate><volume>104</volume><fpage>408</fpage><lpage>421</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">788503</pubid></xrefbib></bibl><bibl id="B59"><title><p>Testing for baseline balance in clinical trials</p></title><aug><au><snm>Senn</snm><fnm>S</fnm></au></aug><source>Stat Med</source><pubdate>1994</pubdate><volume>13</volume><fpage>1715</fpage><lpage>1726</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/sim.4780131703</pubid><pubid idtype="pmpid">7997705</pubid></pubidlist></xrefbib></bibl><bibl id="B60"><title><p>Estimating exposure effects by modelling the expectation of exposure conditional on confounders</p></title><aug><au><snm>Robins</snm><fnm>JM</fnm></au><au><snm>Mark</snm><fnm>SD</fnm></au><au><snm>Newey</snm><fnm>WK</fnm></au></aug><source>Biometrics</source><pubdate>1992</pubdate><volume>48</volume><fpage>479</fpage><lpage>495</lpage><xrefbib><pubidlist><pubid idtype="doi">10.2307/2532304</pubid><pubid idtype="pmpid">1637973</pubid></pubidlist></xrefbib></bibl><bibl id="B61"><aug><au><snm>Fisher</snm><fnm>RA</fnm></au></aug><source>The Design of Experiments</source><publisher>Edinburgh, Scotland: Oliver and Boyd</publisher><pubdate>1935</pubdate></bibl><bibl id="B62"><title><p>Confidence intervals for causal parameters</p></title><aug><au><snm>Robins</snm><fnm>JM</fnm></au></aug><source>Stat Med</source><pubdate>1988</pubdate><volume>7</volume><fpage>773</fpage><lpage>785</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/sim.4780070707</pubid><pubid idtype="pmpid">3406605</pubid></pubidlist></xrefbib></bibl><bibl id="B63"><aug><au><snm>de Finetti</snm><fnm>B</fnm></au></aug><source>The Theory of Probability</source><publisher>New York: John Wiley &amp; Sons</publisher><pubdate>1974</pubdate><volume>I</volume></bibl><bibl id="B64"><title><p>Problems due to small samples and sparse data in conditional logistic regression analysis</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au><au><snm>Schwartzbaum</snm><fnm>JA</fnm></au><au><snm>Finkle</snm><fnm>WD</fnm></au></aug><source>Am J Epidemiol</source><pubdate>2000</pubdate><volume>151</volume><fpage>531</fpage><lpage>539</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">10707923</pubid></xrefbib></bibl><bibl id="B65"><title><p>False models and post-data model construction</p></title><aug><au><snm>Leamer</snm><fnm>EE</fnm></au></aug><source>J Am Stat Assoc</source><pubdate>1974</pubdate><volume>69</volume><fpage>122</fpage><lpage>131</lpage><xrefbib><pubid idtype="doi">10.2307/2285510</pubid></xrefbib></bibl><bibl id="B66"><title><p>When should epidemiologic regressions use random coefficients?</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>Biometrics</source><pubdate>2000</pubdate><volume>56</volume><fpage>915</fpage><lpage>921</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.0006-341X.2000.00915.x</pubid><pubid idtype="pmpid">10985237</pubid></pubidlist></xrefbib></bibl><bibl id="B67"><title><p>The performance of random coefficient regression in accounting for residual confounding</p></title><aug><au><snm>Gustafson</snm><fnm>P</fnm></au><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>Biometrics</source><pubdate>2006</pubdate><volume>62</volume><fpage>760</fpage><lpage>768</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1541-0420.2005.00510.x</pubid><pubid idtype="pmpid" link="fulltext">16984318</pubid></pubidlist></xrefbib></bibl><bibl id="B68"><title><p>Principles of multilevel modelling</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>Int J Epidemiol</source><pubdate>2000</pubdate><volume>29</volume><fpage>158</fpage><lpage>167</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/ije/29.1.158</pubid><pubid idtype="pmpid" link="fulltext">10750618</pubid></pubidlist></xrefbib></bibl><bibl id="B69"><title><p>Multilevel modeling and model averaging</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>Scand J Work Environ Health</source><pubdate>1999</pubdate><volume>25</volume><issue>Suppl 4</issue><fpage>43</fpage><lpage>48</lpage><xrefbib><pubid idtype="pmpid">10628440</pubid></xrefbib></bibl><bibl id="B70"><title><p>Multiple-bias modelling for analysis of observational data</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>J R Stat Soc Series A</source><pubdate>2005</pubdate><volume>168</volume><fpage>267</fpage><lpage>306</lpage><xrefbib><pubid idtype="doi">10.1111/j.1467-985X.2004.00349.x</pubid></xrefbib></bibl><bibl id="B71"><title><p>Bayesian perspectives for epidemiologic research. III. Bias analysis via missing-data methods</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>Int J Epidemiol</source><pubdate>2009</pubdate><inpress/></bibl><bibl id="B72"><title><p>Bayesian sensitivity analysis for unmeasured confounding in observational studies</p></title><aug><au><snm>McCandless</snm><fnm>LC</fnm></au><au><snm>Gustafson</snm><fnm>P</fnm></au><au><snm>Levy</snm><fnm>A</fnm></au></aug><source>Stat Med</source><pubdate>2007</pubdate><volume>26</volume><fpage>2331</fpage><lpage>2347</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/sim.2711</pubid><pubid idtype="pmpid" link="fulltext">16998821</pubid></pubidlist></xrefbib></bibl><bibl id="B73"><title><p>Relaxation penalties and priors for plausible modeling of nonidentified bias sources</p></title><aug><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>Statistical Science</source><pubdate>2010</pubdate><inpress/></bibl><bibl id="B74"><title><p>G-estimation of the effect of prophylaxis therapy for Pneumocystis carinii pneumonia on the survival of AIDS patients</p></title><aug><au><snm>Robins</snm><fnm>JM</fnm></au><au><snm>Blevins</snm><fnm>D</fnm></au><au><snm>Ritter</snm><fnm>G</fnm></au><au><snm>Wulfsohn</snm><fnm>M</fnm></au></aug><source>Epidemiology</source><pubdate>1992</pubdate><volume>3</volume><fpage>319</fpage><lpage>336</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1097/00001648-199207000-00007</pubid><pubid idtype="pmpid">1637895</pubid></pubidlist></xrefbib></bibl><bibl id="B75"><title><p>Marginal structural models and causal inference in epidemiology</p></title><aug><au><snm>Robins</snm><fnm>JM</fnm></au><au><snm>Hernan</snm><fnm>MA</fnm></au><au><snm>Brumback</snm><fnm>B</fnm></au></aug><source>Epidemiology</source><pubdate>2000</pubdate><volume>11</volume><fpage>550</fpage><lpage>560</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1097/00001648-200009000-00011</pubid><pubid idtype="pmpid" link="fulltext">10955408</pubid></pubidlist></xrefbib></bibl><bibl id="B76"><title><p>Correcting for non-compliance in randomized trials using rank preserving structural failure time models</p></title><aug><au><snm>Robins</snm><fnm>JM</fnm></au><au><snm>Tsiatis</snm><fnm>AA</fnm></au></aug><source>Commun Stat</source><pubdate>1991</pubdate><volume>20</volume><fpage>2609</fpage><lpage>2631</lpage><xrefbib><pubid idtype="doi">10.1080/03610929108830654</pubid></xrefbib></bibl><bibl id="B77"><title><p>Adjusting for differential rates of prophylaxis therapy for PCP in high versus low dose AZT treatment arms in an AIDS randomized trial</p></title><aug><au><snm>Robins</snm><fnm>JM</fnm></au><au><snm>Greenland</snm><fnm>S</fnm></au></aug><source>J Am Stat Assoc</source><pubdate>1994</pubdate><volume>89</volume><fpage>737</fpage><lpage>749</lpage><xrefbib><pubid idtype="doi">10.2307/2290899</pubid></xrefbib></bibl><bibl id="B78"><title><p>Estimation of direct causal effects</p></title><aug><au><snm>Petersen</snm><fnm>ML</fnm></au><au><snm>Sinisi</snm><fnm>SE</fnm></au><au><snm>Laan</snm><mnm>van der</mnm><fnm>MJ</fnm></au></aug><source>Epidemiology</source><pubdate>2006</pubdate><volume>17</volume><fpage>276</fpage><lpage>284</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1097/01.ede.0000208475.99429.2d</pubid><pubid idtype="pmpid" link="fulltext">16617276</pubid></pubidlist></xrefbib></bibl><bibl id="B79"><title><p>Marginal structural models for the estimation of direct and indirect effects</p></title><aug><au><snm>VanderWeele</snm><fnm>TJ</fnm></au></aug><source>Epidemiology</source><pubdate>2009</pubdate><volume>20</volume><fpage>18</fpage><lpage>26</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1097/EDE.0b013e31818f69ce</pubid><pubid idtype="pmpid" link="fulltext">19234398</pubid></pubidlist></xrefbib></bibl><bibl id="B80"><title><p>Sensitivity analysis for selection bias and unmeasured confounding in missing data and causal inference models</p></title><aug><au><snm>Robins</snm><fnm>JM</fnm></au><au><snm>Rotnitzky</snm><fnm>A</fnm></au><au><snm>Scharfstein</snm><fnm>DO</fnm></au></aug><source>Statistical models in epidemiology</source><publisher>New York City: Springer-Verlag</publisher><editor>Halloran ME, Berry DA</editor><pubdate>1999</pubdate><fpage>1</fpage><lpage>92</lpage></bibl><bibl id="B81"><title><p>Sensitivity analyses for unmeasured confounding assuming a marginal structural model for repeated measures</p></title><aug><au><snm>Brumback</snm><fnm>BA</fnm></au><au><snm>Hernan</snm><fnm>MA</fnm></au><au><snm>Haneuse</snm><fnm>SJ</fnm></au><au><snm>Robins</snm><fnm>JM</fnm></au></aug><source>Stat Med</source><pubdate>2004</pubdate><volume>23</volume><fpage>749</fpage><lpage>767</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/sim.1657</pubid><pubid idtype="pmpid" link="fulltext">14981673</pubid></pubidlist></xrefbib></bibl><bibl id="B82"><title><p>Causal directed acyclic graphs and the direction of unmeasured confounding bias</p></title><aug><au><snm>VanderWeele</snm><fnm>TJ</fnm></au><au><snm>Hernan</snm><fnm>MA</fnm></au><au><snm>Robins</snm><fnm>JM</fnm></au></aug><source>Epidemiology</source><pubdate>2008</pubdate><volume>19</volume><fpage>720</fpage><lpage>728</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1097/EDE.0b013e3181810e29</pubid><pubid idtype="pmpid" link="fulltext">18633331</pubid></pubidlist></xrefbib></bibl><bibl id="B83"><title><p>Toward a curse of dimensionality appropriate (CODA) asymptotic theory for semi-parametric models</p></title><aug><au><snm>Robins</snm><fnm>JM</fnm></au><au><snm>Ritov</snm><fnm>Y</fnm></au></aug><source>Stat Med</source><pubdate>1997</pubdate><volume>16</volume><fpage>285</fpage><lpage>319</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/(SICI)1097-0258(19970215)16:3&lt;285::AID-SIM535&gt;3.0.CO;2-#</pubid><pubid idtype="pmpid" link="fulltext">9004398</pubid></pubidlist></xrefbib></bibl><bibl id="B84"><title><p>Conditioning, likelihood, and coherence: A review of some foundational concepts</p></title><aug><au><snm>Robins</snm><fnm>JM</fnm></au><au><snm>Wasserman</snm><fnm>L</fnm></au></aug><source>J Am Stat Assoc</source><pubdate>2000</pubdate><volume>95</volume><fpage>1340</fpage><lpage>1346</lpage><xrefbib><pubid idtype="doi">10.2307/2669784</pubid></xrefbib></bibl><bibl id="B85"><aug><au><snm>Laan</snm><mnm>van der</mnm><fnm>M</fnm></au><au><snm>Robins</snm><fnm>JM</fnm></au></aug><source>Unified methods for censored longitudinal data and causality</source><publisher>New York City: Springer</publisher><pubdate>2003</pubdate></bibl><bibl id="B86"><title><p>Doubly robust estimation in missing data and causal inference models</p></title><aug><au><snm>Bang</snm><fnm>H</fnm></au><au><snm>Robins</snm><fnm>JM</fnm></au></aug><source>Biometrics</source><pubdate>2005</pubdate><volume>61</volume><fpage>962</fpage><lpage>973</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1541-0420.2005.00377.x</pubid><pubid idtype="pmpid" link="fulltext">16401269</pubid></pubidlist></xrefbib></bibl><bibl id="B87"><title><p>Comment: Demystifying double robustness: A comparison of alternative strategies for estimating a population mean from incomplete data</p></title><aug><au><snm>Ridgeway</snm><fnm>G</fnm></au><au><snm>McCaffrey</snm><fnm>D</fnm></au></aug><source>Stat Sci</source><pubdate>2007</pubdate><volume>22</volume><fpage>540</fpage><lpage>543</lpage><xrefbib><pubid idtype="doi">10.1214/07-STS227C</pubid></xrefbib></bibl></refgrp>
</bm>
</art>