Add estimands knowledgebase page for core methods by ErikRingen · Pull Request #683 · pymc-labs/CausalPy

ErikRingen · 2026-01-23T13:42:15Z

Documents the causal estimands, computation approaches, and key assumptions for DiD, ITS, Synthetic Control, and Regression Discontinuity. Clarifies the distinction between parametric interaction models (ATT via coefficients) and pre-post counterfactual models (time-varying unit-specific impacts).

Towards #681

📚 Documentation preview 📚: https://causalpy--683.org.readthedocs.build/en/683/

Documents the causal estimands, computation approaches, and key assumptions for DiD, ITS, Synthetic Control, and Regression Discontinuity. Clarifies the distinction between parametric interaction models (ATT via coefficients) and pre-post counterfactual models (time-varying unit-specific impacts). Addresses pymc-labs#681

codecov · 2026-01-23T13:52:53Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 94.35%. Comparing base (b9ea4c9) to head (b8bd273).
⚠️ Report is 36 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #683      +/-   ##
==========================================
+ Coverage   93.37%   94.35%   +0.98%     
==========================================
  Files          39       44       +5     
  Lines        5930     7517    +1587     
  Branches      386      456      +70     
==========================================
+ Hits         5537     7093    +1556     
- Misses        255      262       +7     
- Partials      138      162      +24

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

drbenvincent · 2026-01-26T13:37:47Z

Nice.

Over the weekend I had the realisation that in most (all?) cases we are basing our estimands upon coefficient estimates. But we might be better off using a g-computation type approach. That might make things more robust - for example, coefficient based estimates will presumably break as soon as you add link functions (which we don't yet have, but should probably add some examples in the docs). Or if you have a moderation of an effect by an interaction term with a treatment variable. Though in that case I guess the estimand needs to be CATE and the experiment will most likely be aware of it.

Anyway, this is not a fleshed out proposal (one may follow in a separate issue), but early thoughts on using g-computation as the default method of estimating the estimand?

ErikRingen · 2026-01-26T17:09:22Z

Nice.

Over the weekend I had the realisation that in most (all?) cases we are basing our estimands upon coefficient estimates. But we might be better off using a g-computation type approach. That might make things more robust - for example, coefficient based estimates will presumably break as soon as you add link functions (which we don't yet have, but should probably add some examples in the docs). Or if you have a moderation of an effect by an interaction term with a treatment variable. Though in that case I guess the estimand needs to be CATE and the experiment will most likely be aware of it.

Anyway, this is not a fleshed out proposal (one may follow in a separate issue), but early thoughts on using g-computation as the default method of estimating the estimand?

Early thought is that it would be a great idea--at least when we are using PyMC models as the backend/estimator and it is straightforward to do g-computation.

read-the-docs-community · 2026-02-03T14:49:59Z

Documentation build overview

📚 causalpy | 🛠️ Build #31349364 | 📁 Comparing b8bd273 against latest (278e947)

🔍 Preview build

Show files changed (99 files in total): 📝 72 modified | ➕ 2 added | ➖ 25 deleted

File	Status
404.html	📝 modified
genindex.html	📝 modified
index.html	📝 modified
py-modindex.html	📝 modified
_modules/index.html	📝 modified
api/index.html	📝 modified
knowledgebase/causal_video_resources.html	📝 modified
knowledgebase/causal_written_resources.html	📝 modified
knowledgebase/design_notation.html	📝 modified
knowledgebase/estimands.html	➕ added
knowledgebase/glossary.html	📝 modified
knowledgebase/index.html	📝 modified
knowledgebase/quasi_dags.html	📝 modified
knowledgebase/reporting_statistics.html	📝 modified
knowledgebase/structural_causal_models.html	📝 modified
notebooks/ancova_pymc.html	📝 modified
notebooks/did_pymc.html	📝 modified
notebooks/did_pymc_banks.html	📝 modified
notebooks/did_skl.html	📝 modified
notebooks/geolift1.html	📝 modified
notebooks/index.html	📝 modified
notebooks/inv_prop_latent.html	📝 modified
notebooks/inv_prop_pymc.html	📝 modified
notebooks/its_covid.html	📝 modified
notebooks/its_lift_test.html	📝 modified
notebooks/its_post_intervention_analysis.html	📝 modified
notebooks/its_pymc.html	📝 modified
notebooks/its_pymc_comparative.html	➖ deleted
notebooks/its_skl.html	📝 modified
notebooks/iv_pymc.html	📝 modified
notebooks/iv_vs_priors.html	📝 modified
notebooks/iv_weak_instruments.html	📝 modified
notebooks/multi_cell_geolift.html	📝 modified
notebooks/rd_pymc.html	📝 modified
notebooks/rd_pymc_drinking.html	📝 modified
notebooks/rd_skl.html	📝 modified
notebooks/rd_skl_drinking.html	📝 modified
notebooks/rkink_pymc.html	📝 modified
notebooks/sc_pymc.html	📝 modified
notebooks/sc_pymc_brexit.html	📝 modified
notebooks/sc_skl.html	📝 modified
notebooks/staggered_did_pymc.html	➖ deleted
.codespell/test_data/test_notebook.html	➕ added
api/generated/causalpy.data.datasets.load_data.html	📝 modified
api/generated/causalpy.data.simulate_data.generate_staggered_did_data.html	➖ deleted
api/generated/causalpy.data.simulate_data.html	📝 modified
api/generated/causalpy.experiments.base.BaseExperiment.effect_summary.html	📝 modified
api/generated/causalpy.experiments.base.BaseExperiment.html	📝 modified
api/generated/causalpy.experiments.diff_in_diff.DifferenceInDifferences.algorithm.html	➖ deleted
api/generated/causalpy.experiments.diff_in_diff.DifferenceInDifferences.effect_summary.html	📝 modified
api/generated/causalpy.experiments.diff_in_diff.DifferenceInDifferences.html	📝 modified
api/generated/causalpy.experiments.html	📝 modified
api/generated/causalpy.experiments.instrumental_variable.InstrumentalVariable.algorithm.html	➖ deleted
api/generated/causalpy.experiments.instrumental_variable.InstrumentalVariable.effect_summary.html	📝 modified
api/generated/causalpy.experiments.instrumental_variable.InstrumentalVariable.html	📝 modified
api/generated/causalpy.experiments.interrupted_time_series.InterruptedTimeSeries.algorithm.html	➖ deleted
api/generated/causalpy.experiments.interrupted_time_series.InterruptedTimeSeries.effect_summary.html	📝 modified
api/generated/causalpy.experiments.interrupted_time_series.InterruptedTimeSeries.html	📝 modified
api/generated/causalpy.experiments.inverse_propensity_weighting.InversePropensityWeighting.algorithm.html	➖ deleted
api/generated/causalpy.experiments.inverse_propensity_weighting.InversePropensityWeighting.effect_summary.html	📝 modified
api/generated/causalpy.experiments.inverse_propensity_weighting.InversePropensityWeighting.get_ate.html	📝 modified
api/generated/causalpy.experiments.inverse_propensity_weighting.InversePropensityWeighting.html	📝 modified
api/generated/causalpy.experiments.prepostnegd.PrePostNEGD.algorithm.html	➖ deleted
api/generated/causalpy.experiments.prepostnegd.PrePostNEGD.effect_summary.html	📝 modified
api/generated/causalpy.experiments.prepostnegd.PrePostNEGD.html	📝 modified
api/generated/causalpy.experiments.regression_discontinuity.RegressionDiscontinuity.algorithm.html	➖ deleted
api/generated/causalpy.experiments.regression_discontinuity.RegressionDiscontinuity.effect_summary.html	📝 modified
api/generated/causalpy.experiments.regression_discontinuity.RegressionDiscontinuity.html	📝 modified
api/generated/causalpy.experiments.regression_kink.RegressionKink.algorithm.html	➖ deleted
api/generated/causalpy.experiments.regression_kink.RegressionKink.effect_summary.html	📝 modified
api/generated/causalpy.experiments.regression_kink.RegressionKink.html	📝 modified
api/generated/causalpy.experiments.staggered_did.StaggeredDifferenceInDifferences.init.html	➖ deleted
api/generated/causalpy.experiments.staggered_did.StaggeredDifferenceInDifferences.algorithm.html	➖ deleted
api/generated/causalpy.experiments.staggered_did.StaggeredDifferenceInDifferences.effect_summary.html	➖ deleted
api/generated/causalpy.experiments.staggered_did.StaggeredDifferenceInDifferences.fit.html	➖ deleted
api/generated/causalpy.experiments.staggered_did.StaggeredDifferenceInDifferences.get_plot_data.html	➖ deleted
api/generated/causalpy.experiments.staggered_did.StaggeredDifferenceInDifferences.get_plot_data_bayesian.html	➖ deleted
api/generated/causalpy.experiments.staggered_did.StaggeredDifferenceInDifferences.get_plot_data_ols.html	➖ deleted
api/generated/causalpy.experiments.staggered_did.StaggeredDifferenceInDifferences.html	➖ deleted
api/generated/causalpy.experiments.staggered_did.StaggeredDifferenceInDifferences.input_validation.html	➖ deleted
api/generated/causalpy.experiments.staggered_did.StaggeredDifferenceInDifferences.plot.html	➖ deleted
api/generated/causalpy.experiments.staggered_did.StaggeredDifferenceInDifferences.print_coefficients.html	➖ deleted
api/generated/causalpy.experiments.staggered_did.StaggeredDifferenceInDifferences.summary.html	➖ deleted
api/generated/causalpy.experiments.staggered_did.html	➖ deleted
api/generated/causalpy.experiments.synthetic_control.SyntheticControl.algorithm.html	➖ deleted
api/generated/causalpy.experiments.synthetic_control.SyntheticControl.effect_summary.html	📝 modified
api/generated/causalpy.experiments.synthetic_control.SyntheticControl.html	📝 modified
_modules/causalpy/data/datasets.html	📝 modified
_modules/causalpy/data/simulate_data.html	📝 modified
_modules/causalpy/experiments/base.html	📝 modified
_modules/causalpy/experiments/diff_in_diff.html	📝 modified
_modules/causalpy/experiments/instrumental_variable.html	📝 modified
_modules/causalpy/experiments/interrupted_time_series.html	📝 modified
_modules/causalpy/experiments/inverse_propensity_weighting.html	📝 modified
_modules/causalpy/experiments/prepostnegd.html	📝 modified
_modules/causalpy/experiments/regression_discontinuity.html	📝 modified
_modules/causalpy/experiments/regression_kink.html	📝 modified
_modules/causalpy/experiments/staggered_did.html	➖ deleted
_modules/causalpy/experiments/synthetic_control.html	📝 modified

…base

cursor · 2026-02-03T16:26:22Z

PR Summary

Low Risk
Documentation-only changes (new knowledgebase page, toctree link, and a bibliography entry) with no runtime or API impact.

Overview
Adds a new knowledgebase/estimands.md page introducing the theoretical/empirical estimand vs estimator framework and documenting, for DiD/ITS/Synthetic Control/RD, the targeted estimand, key identification assumptions, and computation approach (coefficient-based vs g-computation vs prediction-based).

Updates the Knowledge Base toctree to include the new page and adds the lundberg2021estimand citation to references.bib.

^{Written by Cursor Bugbot for commit b8bd273. This will update automatically on new commits. Configure here.}

drbenvincent

I think this will be an excellent addition. My own personal request would be to not hard-wrap lines.

I got a review from Claude - which picked up a few things.

I'm not 100% sure of its point 2.3. And I don't care if it thinks g-computation might surprise Epi readers 🤣 We could maybe add a short description in the glossary, but don't feel you have to do that in this PR. We are likely going to increase the mention of g-computation in the repo in the near future anyway.

Overall Assessment

This is a valuable addition to CausalPy's knowledgebase. The Lundberg et al. (2021) framework is well-chosen as an organizing principle, and the per-method sections provide a consistent structure (research question → estimand → assumptions → estimator → interpretation). The writing is clear and accessible.

However, the document contains several factual errors about how CausalPy actually computes treatment effects, particularly for Regression Discontinuity. These must be corrected before merging. Below, findings are organized by severity.

1. Errors (Must Fix)

1.1 RD estimator is mislabeled as "Coefficient-based"

Lines 190–193 and Quick Reference table (line 228)

The document states:

Estimator: Coefficient-based. The treatment effect is estimated as the discontinuity in predicted outcomes at the cutoff, typically using local polynomials around the running variable threshold.

This is self-contradictory and factually wrong. The actual implementation in regression_discontinuity.py (lines 175–203) computes the treatment effect by:

Creating two prediction points: threshold - epsilon (untreated) and threshold + epsilon (treated).
Predicting outcomes at both points using the fitted model.
Taking the difference: pred[above] - pred[below].

This is prediction-based (closer to G-computation), not coefficient extraction. The effect is stored in self.discontinuity_at_threshold, not extracted from a coefficient vector.

Suggested fix: Change "Coefficient-based" to "Prediction-based" (or "G-computation") and update the Quick Reference table accordingly.

1.2 RD does NOT use "local polynomials"

Line 191

CausalPy does not implement local polynomial regression. The implementation uses standard regression formulas via patsy.dmatrices() (e.g., y ~ 1 + x + treated + x:treated). There is an optional bandwidth parameter that filters data to a window around the cutoff, but the model fitted within that window is still a standard parametric regression — not a local polynomial estimator (as used by e.g. rdrobust).

Suggested fix: Remove the phrase "typically using local polynomials" and replace with something like: "using a parametric regression model evaluated on both sides of the cutoff. An optional bandwidth parameter restricts the data to a window around the threshold."

1.3 RD estimand terminology conflicts with IV

Line 175

The document calls the RD estimand:

Local average treatment effect at the cutoff

In the causal inference literature, "LATE" (Local Average Treatment Effect) is specifically associated with Instrumental Variable designs and refers to the complier average causal effect. CausalPy's own glossary defines LATE as:

"Also known as the complier average causal effect (CACE), is the effect of a treatment for subjects who comply with the experimental treatment assigned to their sample group. It is the quantity we're estimating in IV designs."

Using the same term for RD creates confusion. The standard RD terminology is "local treatment effect at the cutoff" or simply "the sharp RD estimand".

Suggested fix: Change "Local average treatment effect at the cutoff" to "Local treatment effect at the cutoff" (dropping "average" removes the LATE association) or use a more explicit label like "Sharp RD treatment effect at the cutoff."

2. Inaccuracies / Omissions (Should Fix)

2.1 IPW weighting schemes are incomplete

Lines 208–209

The document lists three weighting_scheme options:

"raw", "robust" → ATE; "overlap" → overlap population estimand

But the actual implementation (inverse_propensity_weighting.py, line 42-44) supports four options:

"raw" — ATE
"robust" — ATE
"overlap" — overlap population (ATO)
"doubly robust" / "doubly_robust" — omitted from the document

Suggested fix: Add "doubly_robust" to the list and clarify which estimand it targets.

2.2 Quick Reference table is inconsistent with implementation

Line 228

The table lists RD's computation as "Coefficient-based". Per finding 1.1, this should be "Prediction-based" or "G-computation (local)".

Method	Empirical Estimand	Computation
Regression Discontinuity	Local treatment effect at cutoff	Prediction-based

2.3 SC "Convex hull coverage" assumption may not apply to all CausalPy models

Line 152

The "convex hull coverage" assumption comes from the original Abadie et al. synthetic control framework, which requires non-negative weights summing to one. CausalPy's Bayesian SC implementation does not necessarily enforce these constraints — the Bayesian model can produce weights outside [0, 1] or that don't sum to one.

Suggested fix: Qualify this assumption, e.g., "In the standard (non-Bayesian) formulation, the treated unit must lie within the convex hull of control units. CausalPy's Bayesian implementation relaxes this constraint."

3. Conceptual / Pedagogical Suggestions (Nice to Have)

3.1 DiD "No interference" assumption is SUTVA, not DiD-specific

Line 83

"No interference between units" (SUTVA) is a general assumption required by all causal methods covered in the document, not unique to DiD. Listing it only under DiD implies it's specific to that method. Consider either:

Stating SUTVA once at the top as a universal assumption, or
Listing it under all methods for consistency.

3.2 The ASCII pipeline diagram could be improved

Lines 12–14

Theoretical Estimand     -->  Empirical Estimand      -->  Estimator

This could be improved with a proper Mermaid diagram or a styled admonition. The plain ASCII is functional but feels out of place in otherwise well-formatted documentation. For example:

:::{mermaid}
graph LR
    A["Theoretical Estimand<br/>(research question)"] --> B["Empirical Estimand<br/>(ATT, ATE, LATE, ...)"]
    B --> C["Estimator<br/>(model + computation)"]
:::

3.3 Cross-reference style inconsistency

Lines 86, 118, 156, 187

Links to quasi_dags use raw Markdown link syntax:

[Difference in Differences section of quasi_dags](quasi_dags.ipynb#difference-in-differences)

While the rest of the document uses MyST cross-reference syntax ({doc}, {term}, {cite:t}). The auto-generated section anchors from notebook headers are fragile — renaming a header silently breaks the link. Consider using {ref} with explicit labels if possible, or at minimum note this fragility.

3.4 ITS and SC could mention `effect_summary()` more prominently

The ITS section (line 129) mentions effect_summary() but SC does not, despite both supporting it. Since effect_summary() is the primary programmatic way users access results, it would be helpful to mention it for SC as well.

3.5 IV and ANCOVA are referenced but have limited CausalPy support

Lines 206–216, 230

The "Context-Dependence" note references IV, and the closing paragraph mentions IV, IPW, and ANCOVA. However:

IV: plot() and summary() both raise NotImplementedError.
IPW: Bayesian-only (supports_ols = False).
ANCOVA (PrePostNEGD): Bayesian-only.

Users reading this doc might expect full feature parity. A brief note about maturity levels would help set expectations.

3.6 Minor: "G-computation" terminology

The use of "G-computation" for ITS and SC is technically defensible (predict-under-intervention), but in the epidemiology literature, G-computation (Robins, 1986) refers specifically to standardization over the joint distribution of confounders. Readers from that background may find the term surprising. A brief parenthetical — e.g., "G-computation (counterfactual prediction)" — on first use would help.

4. Summary Table

#	Issue	Severity	Lines
1.1	RD estimator is NOT "Coefficient-based"	Error	190–193, 228
1.2	RD does NOT use "local polynomials"	Error	191
1.3	RD "LATE" terminology conflicts with IV/glossary	Error	175
2.1	IPW missing `"doubly_robust"` weighting scheme	Omission	208–209
2.2	Quick Reference table wrong for RD	Inaccuracy	228
2.3	SC convex hull assumption may not apply to Bayesian SC	Inaccuracy	152
3.1	SUTVA listed only under DiD	Pedagogical	83
3.2	ASCII diagram could be a Mermaid diagram	Style	12–14
3.3	Cross-reference style inconsistency	Style	86, 118, 156, 187
3.4	SC section doesn't mention `effect_summary()`	Omission	~165
3.5	IV/ANCOVA maturity not noted	Clarity	206–216, 230
3.6	"G-computation" may confuse epi readers	Clarity	121, 159

5. What Works Well

The Lundberg et al. framework is an excellent organizing choice and provides a principled way for users to think about estimands.
Consistent per-method structure (question → estimand → assumptions → estimator → interpretation) makes the document scannable and easy to reference.
Explicit identification assumptions per method is very valuable — users often skip this step.
Interpretation notes calling out common pitfalls (e.g., DiD's coefficient vs. counterfactual plot, ITS temporal variation, RD locality) are practical and helpful.
The Quick Reference table is a nice summary (once corrected).
Glossary term linking via {term} is done correctly throughout.
The Lundberg citation is properly included in references.bib and cited via {cite:t}.

- Fix RD estimator: change "Coefficient-based" to "Prediction-based" (implementation uses prediction at threshold +/- epsilon, not coefficient extraction) - Remove incorrect "local polynomials" claim for RD; describe parametric regression with optional bandwidth - Fix RD estimand: "Local treatment effect at the cutoff" (drop "average" to avoid LATE/IV confusion) - Add "doubly_robust" to IPW weighting scheme list - Qualify SC convex hull assumption with Dirichlet prior detail - Move SUTVA to shared assumption note (not DiD-specific) - Add effect_summary() mention to SC interpretation note - Add IV/ANCOVA implementation maturity caveat - Unwrap hard-wrapped lines per reviewer request - Remove LATE from empirical estimand examples (IV-specific) Co-authored-by: Cursor <cursoragent@cursor.com>

ErikRingen · 2026-02-10T12:41:31Z

@drbenvincent I think all relevant feedback addressed in b8bd273

Add note on context-dependent estimands

ec4bbe6

Add new article reference and enhance estimand framework in knowledge…

747abf3

…base

ErikRingen marked this pull request as ready for review February 3, 2026 16:26

drbenvincent self-requested a review February 9, 2026 19:33

drbenvincent requested changes Feb 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add estimands knowledgebase page for core methods#683

Add estimands knowledgebase page for core methods#683
ErikRingen wants to merge 4 commits intopymc-labs:mainfrom
ErikRingen:estimand_docs

ErikRingen commented Jan 23, 2026 •

edited by github-actions bot

Loading

Uh oh!

codecov bot commented Jan 23, 2026 •

edited

Loading

Uh oh!

drbenvincent commented Jan 26, 2026

Uh oh!

ErikRingen commented Jan 26, 2026

Uh oh!

read-the-docs-community bot commented Feb 3, 2026 •

edited

Loading

Uh oh!

cursor bot commented Feb 3, 2026 •

edited

Loading

Uh oh!

drbenvincent left a comment

Uh oh!

ErikRingen commented Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ErikRingen commented Jan 23, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jan 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

drbenvincent commented Jan 26, 2026

Uh oh!

ErikRingen commented Jan 26, 2026

Uh oh!

read-the-docs-community bot commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Documentation build overview

Uh oh!

cursor bot commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Summary

Uh oh!

drbenvincent left a comment

Choose a reason for hiding this comment

Overall Assessment

1. Errors (Must Fix)

1.1 RD estimator is mislabeled as "Coefficient-based"

1.2 RD does NOT use "local polynomials"

1.3 RD estimand terminology conflicts with IV

2. Inaccuracies / Omissions (Should Fix)

2.1 IPW weighting schemes are incomplete

2.2 Quick Reference table is inconsistent with implementation

2.3 SC "Convex hull coverage" assumption may not apply to all CausalPy models

3. Conceptual / Pedagogical Suggestions (Nice to Have)

3.1 DiD "No interference" assumption is SUTVA, not DiD-specific

3.2 The ASCII pipeline diagram could be improved

3.3 Cross-reference style inconsistency

3.4 ITS and SC could mention effect_summary() more prominently

3.5 IV and ANCOVA are referenced but have limited CausalPy support

3.6 Minor: "G-computation" terminology

4. Summary Table

5. What Works Well

Uh oh!

ErikRingen commented Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ErikRingen commented Jan 23, 2026 •

edited by github-actions bot

Loading

codecov bot commented Jan 23, 2026 •

edited

Loading

read-the-docs-community bot commented Feb 3, 2026 •

edited

Loading

cursor bot commented Feb 3, 2026 •

edited

Loading

3.4 ITS and SC could mention `effect_summary()` more prominently