Soc 505: Causal Inference in the Social Sciences
Sociology 505: Causal Inference in the Social Sciences
Instructor: Glenn Shafer gshafer@soil
Although it is the goal of most statistical investigation, causal
inference has traditionally been ignored by statistical theory.
Fortunately, there is now intense activity in a number of fields, including
sociology, psychology, econometrics, philosophy, and artificial
intelligence, aimed at correcting this situation. This course will sample
broadly from the burgeoning literature on causality in these fields.
Topics will include the problem of selectivity, the use of concomitants,
the causal interpretation of independence, the role of randomized
experiments, and path analysis.
Each student will be expected to lead the discussion of at least
one paper, but the instructor will lecture on basic topics and on
relatively technical papers.
Week 1. Introduction
How can causal statements be justified in the social sciences? How
far can regression and other statistical methods go towards justifying
causal claims, and how valid is the run-of-the mill use of these methods in
sociology? These readings provide a good starting point for addressing
these general questions. Marini and Singer survey the philosophical
literature on causality and discuss its application to the social sciences.
Freedman and his discussants look at the state of the art in sociology.
Lieberson's excellent book looks at the shortcomings of statistical
methodology in sociology and suggests some alternatives.
- Freedman, David A. (1991). "Statistical Models and Shoe Leather." With
discussion by Richard A. Berk, Hubert M. Blalock, Jr., William M. Mason.
Pp. 291-358 in Sociological Methodology 1991.
- Lieberson, Stanley (1985). Making It Count: The Improvement of Social
Research and Theory. University of California Press.
- Marini, M.M., and B. Singer (1988). "Causality in the Social Sciences."
Pp. 347-409 in Sociological Methodology 1988.
Week 2. The Fallacy of Observational Control
"Correlation cannot prove causation." "Observational studies
cannot substitute for experiments." "Statistical adjustment can do more
harm than good when we are trying to understand causal relations." These
strictures on the use of statistical evidence are more often repeated than
understood. This week of reading takes a closer look, beginning with a
close look at the more technical chapters of Lieberson's book.
- Lieberson, Stanley (1985). Making It Count: The Improvement of Social
Research and Theory. University of California Press. Chapters 2 & 6.
- Shafer, Glenn (1995). The Art of Causal Conjecture. MIT Press. Chapters
1 & 14.
- Wainer, Howard (1991). "Adjusting for Differential Base Rates: Lord's
Paradox Again." Psychological Bulletin. 109 147-151.
Week 3. Strategies for Studying Selectivity
This week's readings provide useful case studies of how social
scientists and epidemiologists struggle with the problem of selectivity.
Goldman analyzes flaws in arguments used to assess whether marriage or high
socio-economic status are causes of the greater longevity with which they
are associated. Robins is concerned with the effect of long-term
occupational hazards, the study of which is often bedeviled by the "healthy
survivor effect": workers with greater exposure may be healthier because
less robust individuals were not able to stay on the job long enough to
acquire high levels of exposure.
- Goldman, Noreen (1994). "Social factors and health: The
causation-selection issue revisited." Proceedings of the National Academy
of Sciences USA 91, 1251-1255.
- Robins, James (1987). "A Graphical Approach to the Identification and
Estimation of Causal Parameters in Mortality Studies with Sustained
Exposure Periods." Journal of Chronic Diseases 40, Supplement 2 139S-161S.
Week 4. The Most Famous Example: Smoking and Lung Cancer
The example of smoking and lung cancer is cited so often in
discussions of causal inference from statistical evidence that anyone who
wants to participate in such discussions needs to know something about the
evidence against smoking and how it has been used. The 1959 paper by
Cornfield still provides the best overview of the evidence. Additional
perspective is provided by papers by Cook and Stolley, which examine the
role the famous statistician R. A. Fisher played in the controversy.
- Cook, R. Dennis (1980). "Smoking and Lung Cancer." R.A. Fisher: An
Appreciation, S.E. Fienberg and D.V. Hinkley, eds. Springer-Verlag Lecture
Notes in Statistics, No. 1, pp. 182-191.
- Cornfield, J., et al. (1959). "Smoking and Lung Cancer: Recent Evidence
and a Discussion of Some Questions." Journal of the National Cancer
Institute 22, 173-203.
- Stolley, Paul D. (1991). "When Genius Errs: R.A. Fisher and the Lung
Cancer Controversy." American Journal of Epidemiology 133, 416-425.
Week 5. Path Analysis
The graphical saliency of its causal interpretation makes path
analysis a dependable producer of debate on causality. This week's
readings begins with a debate led by David Freedman, which focuses both on
the practical use of path analysis models and their technical meaning. The
article by Kang and Seneta clarifies what is asserted by these models.
- Freedman, D.A. (1987). "As Others See Us: A Case Study in Path Analysis."
(With comments by Keith Hope, Christopher H. Achen, P.M. Bentler, Norman
- Cliff, John Fox, Samuel Karlin, Bengt O. Muthen, David Rogosa, Thomas J.
Rothenberg, Eugene Seneta, Herman Wold.) Journal of Educational Statistics
12 101-223.
- Kang, K.M., and Seneta, E. (1980). "Path analysis: An exposition." In
P.R. Krishnaiah (Ed.), Developments in Statistics, Vol. 3, 217-246.
Week 6. The Causal Interpretation of Conditional Independence
Causal models, including the simultaneous-equations models used in
econometrics and the path-analytic models used in sociology, are often
explained in terms of conditional independence or partial uncorrelatedness.
Conditional independence, it seems, indicates some absence of causal
connection. A closer look shows that an independence relation or a path
analysis model usually has more than one causal interpretation.
Consequently, causal claims for path analysis and other models usually need
to be made more specific before they can be subjected to real tests.
- Shafer, Glenn (1995). The Art of Causal Conjecture. MIT Press. Chapters 5-10.
- Sobel, Michael (1990). "Effect Analysis and Causation in Linear Structural
Equation Models." Psychometrika 55 495-515.
Week 7. The Rubin-Holland Interpretation of Causality
Don Rubin's interpretation of causality in terms of manipulation,
real or hypothetical, has been increasingly influential in the social
sciences. Here we look at Rubin's original essay and Paul Holland's widely
read summary of the approach.
- Holland, P.W. (1986). "Statistics and Causal Inference (with discussion)."
Journal of the American Statistical Association 81, 945-970.
- Rubin, Donald B. (1991). "Practical Implications of Modes of Statistical
Inference for Causal Effects and the Critical Role of the Assignment
Mechanism." Biometrics 47, 1213-1234.
Week 8. Structural Equation Models and Latent Variables
What is the meaning of latent variables in causal models? As this
week's readings will show, this question can be answered using either the
instructor's probability-tree picture or Rubin's counterfactual picture.
- Bollen, Kenneth A. (1989). Structural Equations with Latent Variables.
Wiley, 1989.
- Shafer, Glenn (1995). The Art of Causal Conjecture. MIT Press. Chapter 15.
- Sobel, Michael (1993). "Causal Inference in Statistical Models with Latent
Variables." Analysis of Latent Variables in Developmental Research, A. von
Eye and C.C. Clogg (eds).
Week 9. Conjecturing Causal Relations from Large Data Sets
Structural equation models are usually formulated a priori.
Statistical evidence may be used to choose among a small number of models,
but the emphasis is on testing and estimation. A more ambitious approach
has been formulated recently in artificial intelligence. In this approach,
models are inferred from data, using algorithms that search through a large
number of models in order to match the conditional independence or
uncorrelatedness relations observed in the data.
- Pearl, Judea, and Thomas Verma (1993). "A Theory of Inferred Causation."
Principles of Knowledge Representation and Reasoning: Proceeding of the
Second International Conference, J.A. Allen, R. Fikes, and E. Sandewall,
eds. Morgan Kaufmann, San Mateo, CA, pp. 441-452.
- Spirtes, Peter, Clark Glymour, and Richard Scheines (1993). Causation,
Prediction, and Search. Springer-Verlag Lecture Notes in Statistics, No.
81.
Week 10. Philosophical Accounts of Causal Explanation
Although there are many philosophical accounts of causality, those
in the Reichenbach tradition are closest to the concerns of social
sciences. Humphreys and Salmon, with their emphasis on causal explanation,
are especially relevant.
- Salmon, Wesley C. (1984). Scientific Explanation and the Causal Structure
of the World. Princeton University Press.
- Humphreys, Paul (1989). The Chances of Explanation. Princeton University
Press.
Week 11. Psychological Accounts of Causality
Within psychology, there are at least three distinct traditions
that have investigated causality in quite distinct ways. Developmental
psychologists have been concerned with how children develop judgments of
causality, behavioral psychologists with how adults judge contingency, and
social psychologists with how people make attributions. The following
articles represent a thin sampling from the developmental and social
psychology traditions.
- Hastie, Reid (1984). "Causes and Effects of Causal Attribution." Journal
of Personality and Social Psychology 46 44-56.
- Hilton, Denis J. (1990). "Conversational Processes and Causal
Explanation." Psychological Bulletin 107 65-81.
- Shultz, Thomas R., et al. (1986). "Selection of Causal Rules." Child
Development 57 143-152.
Week 12. Conclusion
Having taken our own tour of the literature on causal inference, we
now review what we have learned using surveys by two authorities, a
statistician and a sociologist.
- Cox, D.R. (1992). "Causality: Some Statistical Aspects." Journal of the
Royal Statistical Society, Series A 155 291-301.
- Sobel, Michael E. (1993). "Causal Inference in the Social and Behavioral
Sciences." In A Handbook for Statistical Modeling in the Social and
Behavioral Sciences, C.G. Clogg and M.E. Sobel, eds. Plenum Press.
sociolog@princeton.edu Jan '95