NETS 7976 - Causal Inference with Network Interference - Fall 2024

Tuesdays: 2:30 – 4:30pm
September 5 – December 10, 2024
177 Huntington, room 1008

Summary

This class is an introduction to the measurement of causality in complex networks. It builds on the fundamental foundations of causal inference, drawing on techniques in statistics and econometrics and applying them to a network setting. These techniques are essential in today's scientific landscape, where the study of large systems is best modeled as complex networks. Furthermore, rigorous methods to establish causality in networks have critical implications for the design of interventions in complex systems. Students will leave the class with a foundational understanding of causal inference in networks and a survey of the state of the art in the field. Particular emphasis will be placed on epistemic uncertainty about the structure of interference and the sensitivity of existing techniques to these choices. Students completing the course should feel confident approaching the forefront of research in the field and applying these techniques to their own work.

This class is designed in collaboration with Erik Weis, a PhD student in Network Science; it draws from a previous Network Science Special Topics course taught in 2020 by Prof. Donghee Jo, NETS 7983 -- Causal Analysis: Emphasis on Network Settings.

PDF of this syllabus: here.

Coursework, Class Structure, Grading

The course adopts a flipped classroom concept: Students are exposed to the relevant material through readings and outside lectures/assignments. The in-class hours are dedicated to recapitulation of the most important points (via student-prepared presentations), clarification of questions, and open discussion. The content of the course is organized as follows. Weeks 1-5 are dedicated to an in-depth review of the fundamentals of causal inference. Weeks 6-10 are dedicated to a recapitulation of the ideas in the first phase but in the context of networks. Weeks 11-14 cover special topics and recent research, as well as completion of a project. The final class will be presentations of final projects. Evaluation is as follows:

  • Active participation in weekly meetings and completion of weekly deliverables (70%): Students produce a deliverable for each class (e.g. a presentation, code, or writeup), designed to demonstrate honest effort and engagement with the material. Students should also actively participate in discussions by posing relevant questions, attempting to answer other students' questions, and connecting the material to real-world problems, e.g. from their own field of research.

  • Final project (30%): The final project is to produce a full research proposal for a causal study in which network spillover effects play a central role. Students should draw on the toolbox of methods discussed throughout the course to design a methodology suited to answer the question. Students should clearly specify the assumptions made by their chosen method and justify their use in the particular application. This justification should include an extensive literature review. Finally, they should approach the inference problem pragmatically by considering the feasibility of any additional data collection or experimentation required by the proposed method. Students will give a presentation on the last day of class and submit a final report.

Learning Objectives and Outcomes

By the end of this course, students should have a deep familiarity with causal inference techniques in both network and non-network settings, including parametric and nonparametric techniques. From this, students will be able to articulate the assumptions in a wide variety of causal inference techniques and identify when these assumptions are appropriate for the problem at hand. Finally, students will be able to approach the forefront of causal inference on networks, with particular emphasis on the intersection of causal discovery and estimating unknown interference. Students should leave the course with the necessary knowledge to apply causal inference to their own network datasets and problems. The course is designed to survey the leading edge of the causal inference field, while being:

  • Current: While the courses begin with foundational concepts in causal inference, the final portion of the course will provide exposure to the latest research on causal inference in networks.

  • Practical: Coding assignments lead students to develop hands-on experience with causal inference techniques by using existing software to reproduce existing results.

  • Actionable: The final project allows students to apply their foundational understanding by tailoring the various techniques at their disposal to fit a particular problem.

Materials

There are no required materials to purchase for this course, but we rely on a range of published work on networks and causation, as well as basic textbooks on causal inference:

Instructor

Brennan Klein is an associate research scientist at the Network Science Institute, with a joint affiliation at the Institute for Experiential AI. He is the director of the Complexity & Society Lab. His research spans two broad topics: 1) Information, emergence, and inference in complex systems — developing tools and theory for characterizing dynamics, structure, and scale in networks, and 2) Public health and public safety — creating and analyzing large scale datasets that reveal inequalities in the United States, from epidemics to mass incarceration. Dr. Klein received a PhD in Network Science in 2020 from Northeastern University and got his BA in Cognitive Science & Psychology from Swarthmore College in 2014. Website: http://brennanklein.com/.

Week 1: Sep 5

Review: Probabilistic Modeling & Representations of Causality

Topics:

Supplementary Materials:

  • Miguel A. Hernan and James M. Robins. Causal Inference: What If. CRC Press, 2024 (Chp. 1, 2, 3, 9)

  • NETS 7983 Slides - Week 1: Course Introduction and Basics of Causal Analysis

  • NETS 7938 Slides - Week 2: Applications: Network-level RCTs, DAGs, & do-calculus


Week 2: Sep 10

Matching Methods

Topics:

Supplementary Materials:

  • Miguel A. Hernan and James M. Robins. Causal Inference: What If. CRC Press, 2024 (Chp. 4, 15)

  • NETS 7983 Slides - Week 3: DAGs Part II & Mediation Analysis


Week 3: Sep 17

Instrumental Variables and Panel Data

Topics:

Supplementary Materials:

  • NETS 7983 Slides - Week 4: Mediation Analysis Part II & Instrumental Variables

  • Miguel A. Hernan and James M. Robins. Causal Inference: What If. CRC Press, 2024 (Chp. 16)


Week 4: Sep 24

Difference-in-Differences

Topics:

Supplementary Materials:


Week 5: Oct 1

Synthetic Controls

Topics:

Supplementary Materials:

  • Alberto Abadie, Alexis Diamond, and Jens Hainmueller. (2010). Synthetic Control Methods for Comparative Case Studies: Estimating the Effect of California’s Tobacco Control Program. Journal of the American Statistical Association 105.490, pp. 493–505. doi: 10.1198/jasa.2009.ap08746.

  • NETS 7983 Slides - Week 5: Regression Discontinuity, Fixed Effects, & Bootstrap


Week 6: Oct 8

RCT Designs in Network Settings

Topics:

  • Peter M. Aronow (2012). A General Method for Detecting Interference Between Units in Randomized Experiments. Sociological Methods & Research 41.1 , pp. 3–16. doi: 10.1177/0049124112437535.

  • Peter M. Aronow and Cyrus Samii. (2017). Estimating Average Causal Effects under General Interference, with Application to a Social Network Experiment. The Annals of Applied Statistics 11.4, pp. 1912–1947. doi: 10.1214/16-AOAS1005.

Supplementary Materials:

  • NETS 7983 Slides - Week 6: Bootstrap Part II & Imbens/Rubin Framework


Week 7: Oct 15

Statistical Tests for Causality Under Interference

Topics:

  • Susan Athey, Dean Eckles, and Guido W. Imbens. (2018). Exact P-Values for Network Interference. Journal of the American Statistical Association 113.521, pp. 230–240. doi: 10.1080/01621459.2016.1241178.

  • Guillaume W. Basse, Avi Feller, and Panos Toulis. (2019). Randomization Tests of Causal Effects under Interference. Biometrika 106.2, pp. 487–494. doi: 10.1093/biomet/asy072.

Supplementary Materials:

  • NETS 7983 Slides - Week 7: Imbens/Rubin Framework Part II & Exact-p for Network Analysis

  • NETS 7983 Slides - Week 8: Exact-p for Network Analysis Part II: Recent Progress


Week 8: Oct 22

Estimator Adjustments Due to Interference

Topics:

  • Alex Chin. (2019). Regression Adjustments for Estimating the Global Treatment Effect in Experiments with Interference. Journal of Causal Inference 7.2. doi: 10.1515/jci-2018-0026.

  • Mayleen Cortez-Rodriguez, Matthew Eichhorn, and Christina Lee Yu (2024). Combining Rollout Designs and Clustering for Causal Inference under Low-order Interference. arXiv. doi: 10.48550/arXiv.2405.05119.

Supplementary Materials:

  • Russell Lyons (2011). The Spread of Evidence-Poor Medicine via Flawed Social-Network Analysis. Statistics, Politics, and Policy 2.1 (2011). doi: 10.2202/2151-7509.1024.


Week 9: Oct 29

Graph Cluster Randomization: Designing Better Experiments Under Interference

Topics:

  • Johan Ugander, Brian Karrer, Lars Backstrom, and Jon Kleinberg. (2013). Graph Cluster Randomization: Network Exposure to Multiple Universes. Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD ’13. New York, NY, USA: Association for Computing Machinery, 2013, pp. 329–337. doi: 10.1145/2487575.2487695.

Supplementary Materials:

  • Johan Ugander and Hao Yin. (2023). Randomized Graph Cluster Randomization. Journal of Causal Inference 11.1. issn: 2193-3685. doi: 10.1515/jci-2022-0014.


Week 10: Nov 5

Causality in Network Settings: A Debate

Topics:

  • Nicholas A. Christakis and James H. Fowler. (2007). The Spread of Obesity in a Large Social Network over 32 Years. New England Journal of Medicine 357.4, pp. 370–379. doi: 10.1056/NEJMsa066082.

  • Ethan Cohen-Cole and Jason M. Fletcher. (2008). Is Obesity Contagious? Social Networks vs. Environmental Factors in the Obesity Epidemic. Journal of Health Economics 27.5, pp. 1382–1387. doi: 10.1016/j.jhealeco.2008.04.005.

  • James H. Fowler and Nicholas A. Christakis. (2008). Estimating Peer Effects on Health in Social Networks: A Response to Cohen-Cole and Fletcher; Trogdon, Nonnemaker, Pais. Journal of Health Economics 27.5, pp. 1400–1405. doi: 10.1016/j.jhealeco.2008.07.001.

  • Ethan Cohen-Cole and Jason M. Fletcher. (2008). Estimating Peer Effects in Health Outcomes: Replies and Corrections to Fowler and Christakis. SSRN Scholarly Paper. Rochester, NY. doi: 10.2139/ssrn.1262249.

Supplementary Materials:

  • Russell Lyons. (2011). The Spread of Evidence-Poor Medicine via Flawed Social-Network Analysis. Statistics, Politics, and Policy 2.1. doi: 10.2202/2151-7509.1024.


Week 11: Nov 12

Special Topics Pt. I: Causal Discovery

Topics:

  • Alessio Zanga and Fabio Stella. (2023). A Survey on Causal Discovery: Theory and Practice. arXiv. doi: 10.48550/arXiv.2305.10032.

Supplementary Materials:

  • Peter Martey Addo, Christelle Manibialoa, and Florent McIsaac. (2021). Exploring Nonlinearity on the CO2 Emissions, Economic Production and Energy Use Nexus: A Causal Discovery Approach. Energy Reports 7, pp. 6196–6204. doi: 10.1016/j.egyr.2021.09.026.

  • Justin J. Anker, Erich Kummerfeld, Alexander Rix, Scott J. Burwell, and Matt G. Kushner. (2019). Causal Network Modeling of the Determinants of Drinking Behavior in Comorbid Alcohol Use and Anxiety Disorder. Alcoholism, Clinical and Experimental Research 43.1, pp. 91–97. doi: 10.1111/acer.13914.


Week 12: Nov 19

Special Topics Pt. II: Spatial Causal Interference

Topics:

  • Brian J. Reich, Shu Yang, Yawen Guan, Andrew B. Giffin, Matthew J. Miller, and Ana Rappold. (2021). A Review of Spatial Causal Inference Methods for Environmental and Epidemiological Applications. International Statistical Review 89.3 (2021), pp. 605–634. issn: 1751-5823. doi: 10.1111/insr.12452.

Supplementary Materials:

  • Ye Wang, Cyrus Samii, Haoge Chang, and P. M. Aronow (2024). Design-Based Inference for Spatial Experiments under Unknown Interference. arXiv. doi: 10.48550/arXiv.2010.13599.

  • Serina Chang, Damir Vrabac, Jure Leskovec, and Johan Ugander. (2023). Estimating Geographic Spillover Effects of COVID-19 Policies from Large-Scale Mobility Networks. Proceedings of the AAAI Conference on Artificial Intelligence 37.12, pp. 14161–14169. issn: 2374-3468. doi: 10.1609/aaai.v37i12.26657.


Week 13: Nov 26

Thanksgiving - No Class


Week 14: Dec 3

Special Topics Pt. III: Unknown Network Interference

Topics:

  • Christina Lee Yu, Edoardo M. Airoldi, Christian Borgs, and Jennifer T. Chayes. (2022). Estimating the Total Treatment Effect in Randomized Experiments with Unknown Network Structure. Proceedings of the National Academy of Sciences 119.44, e2208975119. doi: 10.1073/pnas.2208975119.

Supplementary Materials:

  • Guillaume W. Basse and Edoardo M. Airoldi. (2018). Limitations of Design based Causal Inference and A/B Testing under Arbitrary and Network Interference. Sociological Methodology 48.1, pp. 136–151. issn: 0081-1750. doi: 10.1177/0081175018782569.


Week 15: Dec 10

Final Presentations

Students will present their summative assessment for the course.