NETS 7976 - Causal Inference with Network Interference - Fall 2024
Tuesdays: 2:30 – 4:30pm
September 5 – December 10, 2024
177 Huntington, room 1008
Summary
This class is an introduction to the measurement of causality in complex networks. It builds on the fundamental foundations of causal inference, drawing on techniques in statistics and econometrics and applying them to a network setting. These techniques are essential in today's scientific landscape, where the study of large systems is best modeled as complex networks. Furthermore, rigorous methods to establish causality in networks have critical implications for the design of interventions in complex systems. Students will leave the class with a foundational understanding of causal inference in networks and a survey of the state of the art in the field. Particular emphasis will be placed on epistemic uncertainty about the structure of interference and the sensitivity of existing techniques to these choices. Students completing the course should feel confident approaching the forefront of research in the field and applying these techniques to their own work.
This class is designed in collaboration with Erik Weis, a PhD student in Network Science; it draws from a previous Network Science Special Topics course taught in 2020 by Prof. Donghee Jo, NETS 7983 -- Causal Analysis: Emphasis on Network Settings.
PDF of this syllabus: here.
Coursework, Class Structure, Grading
The course adopts a flipped classroom concept: Students are exposed to the relevant material through readings and outside lectures/assignments. The in-class hours are dedicated to recapitulation of the most important points (via student-prepared presentations), clarification of questions, and open discussion. The content of the course is organized as follows. Weeks 1-5 are dedicated to an in-depth review of the fundamentals of causal inference. Weeks 6-10 are dedicated to a recapitulation of the ideas in the first phase but in the context of networks. Weeks 11-14 cover special topics and recent research, as well as completion of a project. The final class will be presentations of final projects. Evaluation is as follows:
Active participation in weekly meetings and completion of weekly deliverables (70%): Students produce a deliverable for each class (e.g. a presentation, code, or writeup), designed to demonstrate honest effort and engagement with the material. Students should also actively participate in discussions by posing relevant questions, attempting to answer other students' questions, and connecting the material to real-world problems, e.g. from their own field of research.
Final project (30%): The final project is to produce a full research proposal for a causal study in which network spillover effects play a central role. Students should draw on the toolbox of methods discussed throughout the course to design a methodology suited to answer the question. Students should clearly specify the assumptions made by their chosen method and justify their use in the particular application. This justification should include an extensive literature review. Finally, they should approach the inference problem pragmatically by considering the feasibility of any additional data collection or experimentation required by the proposed method. Students will give a presentation on the last day of class and submit a final report.
Learning Objectives and Outcomes
By the end of this course, students should have a deep familiarity with causal inference techniques in both network and non-network settings, including parametric and nonparametric techniques. From this, students will be able to articulate the assumptions in a wide variety of causal inference techniques and identify when these assumptions are appropriate for the problem at hand. Finally, students will be able to approach the forefront of causal inference on networks, with particular emphasis on the intersection of causal discovery and estimating unknown interference. Students should leave the course with the necessary knowledge to apply causal inference to their own network datasets and problems. The course is designed to survey the leading edge of the causal inference field, while being:
Current: While the courses begin with foundational concepts in causal inference, the final portion of the course will provide exposure to the latest research on causal inference in networks.
Practical: Coding assignments lead students to develop hands-on experience with causal inference techniques by using existing software to reproduce existing results.
Actionable: The final project allows students to apply their foundational understanding by tailoring the various techniques at their disposal to fit a particular problem.
Materials
There are no required materials to purchase for this course, but we rely on a range of published work on networks and causation, as well as basic textbooks on causal inference:
Scott Cunningham. Causal Inference: The Mixtape. Yale University Press, 2021.
Aleksander Molak. Causal Inference and Discovery in Python: Unlock the secrets of modern causal machine learning with DoWhy, EconML, PyTorch and more. Packt Publishing Ltd, 2023.
Miguel A. Hernan and James M. Robins. Causal Inference: What If. CRC Press, 2024.
Judea Pearl. Causality. 2nd ed. Cambridge University Press, 2009. isbn: 978-0-521-89560-6.
Guido W. Imbens and Donald B. Rubin. Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction. Cambridge University Press, 2015. isbn: 978-0-521-88588-1.
Instructor
Brennan Klein is an associate research scientist at the Network Science Institute, with a joint affiliation at the Institute for Experiential AI. He is the director of the Complexity & Society Lab. His research spans two broad topics: 1) Information, emergence, and inference in complex systems — developing tools and theory for characterizing dynamics, structure, and scale in networks, and 2) Public health and public safety — creating and analyzing large scale datasets that reveal inequalities in the United States, from epidemics to mass incarceration. Dr. Klein received a PhD in Network Science in 2020 from Northeastern University and got his BA in Cognitive Science & Psychology from Swarthmore College in 2014. Website: http://brennanklein.com/.
Week 1: Sep 5
Review: Probabilistic Modeling & Representations of Causality
Topics:
Chapter 1 - Causal Inference Mixtape: Introduction
Chapter 2 - Causal Inference Mixtape: Probability and Regression Review
Chapter 3 - Causal Inference Mixtape: Directed Acyclic Graphs
Chapter 4 - Causal Inference Mixtape: Potential Outcomes Causal Model
Supplementary Materials:
Miguel A. Hernan and James M. Robins. Causal Inference: What If. CRC Press, 2024 (Chp. 1, 2, 3, 9)
NETS 7983 Slides - Week 1: Course Introduction and Basics of Causal Analysis
NETS 7938 Slides - Week 2: Applications: Network-level RCTs, DAGs, & do-calculus
Week 2: Sep 10
Matching Methods
Topics:
Chapter 5 - Causal Inference Mixtape: Matching and Subclassification
Chapter 6 - Causal Inference Mixtape: Regression Discontinuity
Supplementary Materials:
Miguel A. Hernan and James M. Robins. Causal Inference: What If. CRC Press, 2024 (Chp. 4, 15)
NETS 7983 Slides - Week 3: DAGs Part II & Mediation Analysis
Week 3: Sep 17
Instrumental Variables and Panel Data
Topics:
Chapter 7 - Causal Inference Mixtape: Instrumental Variables
Chapter 8 - Causal Inference Mixtape: Panel Data
Supplementary Materials:
NETS 7983 Slides - Week 4: Mediation Analysis Part II & Instrumental Variables
Miguel A. Hernan and James M. Robins. Causal Inference: What If. CRC Press, 2024 (Chp. 16)
Week 4: Sep 24
Difference-in-Differences
Topics:
Chapter 9 - Causal Inference Mixtape: Difference-in-Differences
Supplementary Materials:
“Mixtape Sessions” from the Supplementary Materials of the Causal Inference Mixtape, including slides and code for running various DiD calculations
Week 5: Oct 1
Synthetic Controls
Topics:
Chapter 10 - Causal Inference Mixtape: Synthetic Control
Supplementary Materials:
Alberto Abadie, Alexis Diamond, and Jens Hainmueller. (2010). Synthetic Control Methods for Comparative Case Studies: Estimating the Effect of California’s Tobacco Control Program. Journal of the American Statistical Association 105.490, pp. 493–505. doi: 10.1198/jasa.2009.ap08746.
NETS 7983 Slides - Week 5: Regression Discontinuity, Fixed Effects, & Bootstrap
Week 6: Oct 8
RCT Designs in Network Settings
Topics:
Peter M. Aronow (2012). A General Method for Detecting Interference Between Units in Randomized Experiments. Sociological Methods & Research 41.1 , pp. 3–16. doi: 10.1177/0049124112437535.
Peter M. Aronow and Cyrus Samii. (2017). Estimating Average Causal Effects under General Interference, with Application to a Social Network Experiment. The Annals of Applied Statistics 11.4, pp. 1912–1947. doi: 10.1214/16-AOAS1005.
Supplementary Materials:
NETS 7983 Slides - Week 6: Bootstrap Part II & Imbens/Rubin Framework
Week 7: Oct 15
Statistical Tests for Causality Under Interference
Topics:
Susan Athey, Dean Eckles, and Guido W. Imbens. (2018). Exact P-Values for Network Interference. Journal of the American Statistical Association 113.521, pp. 230–240. doi: 10.1080/01621459.2016.1241178.
Guillaume W. Basse, Avi Feller, and Panos Toulis. (2019). Randomization Tests of Causal Effects under Interference. Biometrika 106.2, pp. 487–494. doi: 10.1093/biomet/asy072.
Supplementary Materials:
NETS 7983 Slides - Week 7: Imbens/Rubin Framework Part II & Exact-p for Network Analysis
NETS 7983 Slides - Week 8: Exact-p for Network Analysis Part II: Recent Progress
Week 8: Oct 22
Estimator Adjustments Due to Interference
Topics:
Alex Chin. (2019). Regression Adjustments for Estimating the Global Treatment Effect in Experiments with Interference. Journal of Causal Inference 7.2. doi: 10.1515/jci-2018-0026.
Mayleen Cortez-Rodriguez, Matthew Eichhorn, and Christina Lee Yu (2024). Combining Rollout Designs and Clustering for Causal Inference under Low-order Interference. arXiv. doi: 10.48550/arXiv.2405.05119.
Supplementary Materials:
Russell Lyons (2011). The Spread of Evidence-Poor Medicine via Flawed Social-Network Analysis. Statistics, Politics, and Policy 2.1 (2011). doi: 10.2202/2151-7509.1024.
Week 9: Oct 29
Graph Cluster Randomization: Designing Better Experiments Under Interference
Topics:
Johan Ugander, Brian Karrer, Lars Backstrom, and Jon Kleinberg. (2013). Graph Cluster Randomization: Network Exposure to Multiple Universes. Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD ’13. New York, NY, USA: Association for Computing Machinery, 2013, pp. 329–337. doi: 10.1145/2487575.2487695.
Supplementary Materials:
Johan Ugander and Hao Yin. (2023). Randomized Graph Cluster Randomization. Journal of Causal Inference 11.1. issn: 2193-3685. doi: 10.1515/jci-2022-0014.
Week 10: Nov 5
Causality in Network Settings: A Debate
Topics:
Nicholas A. Christakis and James H. Fowler. (2007). The Spread of Obesity in a Large Social Network over 32 Years. New England Journal of Medicine 357.4, pp. 370–379. doi: 10.1056/NEJMsa066082.
Ethan Cohen-Cole and Jason M. Fletcher. (2008). Is Obesity Contagious? Social Networks vs. Environmental Factors in the Obesity Epidemic. Journal of Health Economics 27.5, pp. 1382–1387. doi: 10.1016/j.jhealeco.2008.04.005.
James H. Fowler and Nicholas A. Christakis. (2008). Estimating Peer Effects on Health in Social Networks: A Response to Cohen-Cole and Fletcher; Trogdon, Nonnemaker, Pais. Journal of Health Economics 27.5, pp. 1400–1405. doi: 10.1016/j.jhealeco.2008.07.001.
Ethan Cohen-Cole and Jason M. Fletcher. (2008). Estimating Peer Effects in Health Outcomes: Replies and Corrections to Fowler and Christakis. SSRN Scholarly Paper. Rochester, NY. doi: 10.2139/ssrn.1262249.
Supplementary Materials:
Russell Lyons. (2011). The Spread of Evidence-Poor Medicine via Flawed Social-Network Analysis. Statistics, Politics, and Policy 2.1. doi: 10.2202/2151-7509.1024.
Week 11: Nov 12
Special Topics Pt. I: Causal Discovery
Topics:
Alessio Zanga and Fabio Stella. (2023). A Survey on Causal Discovery: Theory and Practice. arXiv. doi: 10.48550/arXiv.2305.10032.
Supplementary Materials:
Peter Martey Addo, Christelle Manibialoa, and Florent McIsaac. (2021). Exploring Nonlinearity on the CO2 Emissions, Economic Production and Energy Use Nexus: A Causal Discovery Approach. Energy Reports 7, pp. 6196–6204. doi: 10.1016/j.egyr.2021.09.026.
Justin J. Anker, Erich Kummerfeld, Alexander Rix, Scott J. Burwell, and Matt G. Kushner. (2019). Causal Network Modeling of the Determinants of Drinking Behavior in Comorbid Alcohol Use and Anxiety Disorder. Alcoholism, Clinical and Experimental Research 43.1, pp. 91–97. doi: 10.1111/acer.13914.
Week 12: Nov 19
Special Topics Pt. II: Spatial Causal Interference
Topics:
Brian J. Reich, Shu Yang, Yawen Guan, Andrew B. Giffin, Matthew J. Miller, and Ana Rappold. (2021). A Review of Spatial Causal Inference Methods for Environmental and Epidemiological Applications. International Statistical Review 89.3 (2021), pp. 605–634. issn: 1751-5823. doi: 10.1111/insr.12452.
Supplementary Materials:
Ye Wang, Cyrus Samii, Haoge Chang, and P. M. Aronow (2024). Design-Based Inference for Spatial Experiments under Unknown Interference. arXiv. doi: 10.48550/arXiv.2010.13599.
Serina Chang, Damir Vrabac, Jure Leskovec, and Johan Ugander. (2023). Estimating Geographic Spillover Effects of COVID-19 Policies from Large-Scale Mobility Networks. Proceedings of the AAAI Conference on Artificial Intelligence 37.12, pp. 14161–14169. issn: 2374-3468. doi: 10.1609/aaai.v37i12.26657.
Week 13: Nov 26
Thanksgiving - No Class
Week 14: Dec 3
Special Topics Pt. III: Unknown Network Interference
Topics:
Christina Lee Yu, Edoardo M. Airoldi, Christian Borgs, and Jennifer T. Chayes. (2022). Estimating the Total Treatment Effect in Randomized Experiments with Unknown Network Structure. Proceedings of the National Academy of Sciences 119.44, e2208975119. doi: 10.1073/pnas.2208975119.
Supplementary Materials:
Guillaume W. Basse and Edoardo M. Airoldi. (2018). Limitations of Design based Causal Inference and A/B Testing under Arbitrary and Network Interference. Sociological Methodology 48.1, pp. 136–151. issn: 0081-1750. doi: 10.1177/0081175018782569.
Week 15: Dec 10
Final Presentations
Students will present their summative assessment for the course.