This course introduces experimental and quasi-experimental methods for causal inference that are widely used in a broad array of domains such as marketing and information systems. The focus is on delivering a breadth of substantive topics and methodological considerations that emerge in utilizing identification-oriented methods.
Throughout the course, we will discuss topics related to methods such as randomized controlled trials (RCT), difference-in-differences (DiD), matching methods such as propensity score matching (PSM) and coarsened exact matching (CEM), and more advanced topics such as regression discontinuity designs (RDD), double-debiased machine learning (DML), synthetic control methods (SCM), and synthetic difference-in-differences.
Students will review relevant research papers on each topic and actively engage in presentations and discussions about the nature of causation and alternative means of inferring causal relationships. Students will also carry out a collaborative group project where they design an experiment and associated plan of analysis to draw business insights.
Jaewon Yoo · Assistant Professor
Institute of Service Science, College of Technology Management
National Tsing Hua University
Kai-Chieh (Justin) Kao
Alumnus, ISS · TSMC
justin.kao [at] iss.nthu.edu.tw
Ting-Wen (Keri) Liu
M.S. Student, ISS, CADI Lab
keri.liu [at] iss.nthu.edu.tw
This course uses three platforms: this website, MS Teams, and Canvas:
| Purpose | Platform |
|---|---|
| Syllabus, schedule, readings, slides | This website |
| Announcements & Q&A | Teams → General channel |
| Shared papers & resources | Teams → Files |
| Office hours & consultations | Teams → DM + Google Calendar |
| Homework submission | Canvas |
| Grades | Canvas |
All course communication goes through MS Teams. Homework submissions and grades are on Canvas. For private matters, email or DM the instructor.
I update the ECI GitHub Page regularly throughout the semester. Items below have been recently revised and need your attention.
Design and analysis of randomized field experiments.
Causal inference with longitudinal data and staggered adoption.
Propensity Score (PSM) and Coarsened Exact Matching (CEM).
Sharp and Fuzzy regression discontinuity designs.
Double-debiased ML (DML) and Heterogeneous Treatment Effects (HTE) with Meta-learners.
Graphical models for identification and exploring causal mechanisms.
Please refer to the Syllabus PDF for the most up-to-date schedule and full reading lists. Click the purple badges to view lecture slides. Readings should be completed before class. Items marked Due must be submitted via Canvas before that week’s class begins (Thu 14:20).
Week 1 (Feb 26): Introduction and Potential Outcomes Slides
Week 2 (Mar 5): Randomization Inference Slides
Week 3 (Mar 12): Inference for the Average Treatment Effect Slides
Week 4 (Mar 19): Linear Regression and Randomized Experiments Slides
Week 5 (Mar 26): Individual/Group Project Meetings
Week 6 (Apr 2): School Holiday (no class)
Week 7 (Apr 9): Observational Studies I Slides
Week 8 (Apr 16): DAGs and Covariate Selection Slides
Week 9 (Apr 23): Instrumental Variables I (Noncompliance and IV) Slides
Week 10 (Apr 30): Instrumental Variables II (TSLS) Slides
Week 11 (May 7): Panel Data, Fixed Effects, and Difference-in-Differences Slides
Week 12 (May 14): Matching and Weighting Estimators Slides Slides (b)
Week 13 (May 21): Regression Discontinuity Designs Slides
Week 14 (May 28): Regression Discontinuity Designs Cont. Slides
Week 15 (Jun 4): Advanced Topics TENTATIVE
Week 16 (Jun 11): Final Group Project Presentations
Detailed notes in article format, covering material discussed in class and beyond.
Required (please have these ready before the semester begins):
Optional (useful for additional coverage):
We use R (with RStudio) as the primary computing environment. Key packages include:
tidyverse, fixest, did, rdrobust, MatchIt, WeightIt, grf, dagitty, ggdag, modelsummary
Students comfortable with Stata or Python may use those for their research projects, but in-class demonstrations and code examples will be in R.
| Component | Weight | Description |
|---|---|---|
| Research Project | 50% | Original empirical research paper (max 20 pages). Includes proposal, progress report, presentation, and final paper. |
| Homework | 30% | |
| Problem Sets | 15% | Conceptual questions, analytic problems, simulations, and data analysis. |
| The Effect Assignments | 15% | Exercises from The Effect by Huntington-Klein. |
| Participation (incl. presentations) | 10% | Active engagement in discussions + paper introduction presentations. |
| One-Page Summaries | 5% | Weekly reading summaries, graded complete/incomplete. |
| Attendance | 5% | Expected at every session; each absence costs ~1% of final grade. |
In lieu of midterms and a final exam, students write a short paper applying or extending the causal inference methods learned in class. The paper should be no longer than 20 double-spaced pages and focus on research design, data, methodology, results, and analysis.
To give you a clear sense of expectations and grading criteria, here is a sample final report with instructor evaluation from a previous offering:
| Milestone | Due | Deliverable |
|---|---|---|
| Find a collaborator | Week 2 | Form a team or obtain permission for individual project |
| Project proposal | Week 5 | Half-page description of proposed project & feasible research plan |
| Progress report | Week 11 | 5-page memo with preliminary results, tables, figures, and analysis |
| Final project report | Week 15 | Submit final version of the paper |
| Final presentation | Week 16 | In-class group presentation |
Final presentations are accompanied by structured peer feedback. After each presentation, every student completes a feedback form covering four areas: Research Question, Identification Strategy, Threats & Limitations, and a Constructive Suggestion. Each area includes space for written comments and a 1–5 rating.
→ Open Printable Feedback Form
Feedback forms are shared with presenters to support their final revisions. The quality and thoughtfulness of your feedback (not the scores you give) contributes to your Participation grade. Presenter grades are determined by the instructor independently.
You will have two types of homework:
You are encouraged to work in groups, but you must always write your own solutions including your own computer code. It is hugely beneficial to attempt the problems on your own before working in groups.
Late policy: Late submissions are penalized 1 percentage point of the assignment’s weight per day. For example, an assignment worth 7% of the course grade turned in 3 days late has a maximum attainable score of 4%.
Choose 5 papers from the assigned reading list and write a one-page summary addressing the following:
You may also discuss: one-sentence conclusion, institutional background, conceptual framework, or relevant literature. Submit as PDF via Teams.
Students will present papers from the following five topic groups. Each group contains five readings: a textbook chapter plus published applications from top journals. Presentation assignments will be finalized before the first presentation week.
Note: If you have a top-journal paper that uses a method covered in one of these modules (e.g., a DiD paper for the DiD module) and you would prefer to present it instead, please contact the instructor before presentations begin to request a substitution.
1. Field Experiments (Weeks 4–5)
2. Instrumental Variables (Weeks 8–9)
3. Difference-in-Differences (Week 11)
4. Matching Methods (Week 12)
5. Regression Discontinuity Design (Weeks 13–14)
Attendance: All students are expected to attend every class. Please bring your own hard copy of the course materials distributed before class. If you must miss a class, inform the instructor or TA in advance via email or phone. You are still responsible for the materials covered. Attendance counts toward your participation score (5%); each absence costs approximately 1% of the final grade.
Participation (10%): This includes both active engagement in class discussions and the quality of your paper introduction presentations. Stay active and engaged. Effective discussions require that everyone comes prepared. Be ready to share your opinions and thoughts.
Late Policy: Late submissions are penalized 1 percentage point of the assignment’s weight per day. For example, an assignment worth 7% of the course grade turned in 3 days late has a maximum attainable score of 4%.
Academic Honesty: All work submitted must be the student’s own. Violations will result in a zero for the first offense; subsequent violations result in a failing grade for the course. Submissions will be checked via Turnitin.
AI Use Policy: Students are permitted to use AI tools (e.g., ChatGPT, Claude) responsibly as editorial assistants and sounding boards to refine, stress-test, or get feedback on their own work, not to generate drafts in their place. The final submission must predominantly reflect the student’s own understanding and reasoning.
A few risks to avoid and practices to adopt when using AI for scholarly writing:
If you use AI tools, you must:
For deeper discussion of these trade-offs, see Paul Goldsmith-Pinkham’s (Yale) Writing and Thinking with AI Assistance. For practical writing principles and discipline-specific norms AI tools often miss (active voice, concrete examples, citation integrity, anti-AI-pattern hygiene), see this open-source GitHub repository by Lu Han, which distills writing guidance from leading social scientists, including Nobel laureates Claudia Goldin and Michael Kremer alongside John Cochrane, Deirdre McCloskey, and Jesse Shapiro. The takeaway I find most compelling: academic writing is not an art but a learnable craft, a set of best practices that students can study and apply directly.
Accommodation: Students with disabilities or special needs should contact the instructor during the first week of class to arrange appropriate accommodations.