`maketables` plug in by s3alfisc · Pull Request #600 · pymc-labs/CausalPy

s3alfisc · 2025-12-20T15:04:56Z

Attached a very drafty PR (with lots of support from Claude) to enable maketables support for CausalPy via the new Plug-In solution @dsliwka implemented.
Attached is also an example notebook.

For the RDD class, you e.g. get something like this:

Would you generally be interested in merging this PR?

Where I'd need your help - what coefficients should be reported by default? Which statistics might users be interested in beyond the defaults? Is my choice of 94% probability intervals a good one? Etc.

Note that this PR is more of a proof of concept and might need some cleaning up and unit tests. If this makes it into CausalPy, we should likely also implement tests over there to ensure we never break CausalPy flows.

Best, Alex

📚 Documentation preview 📚: https://causalpy--600.org.readthedocs.build/en/600/

review-notebook-app · 2025-12-20T15:05:01Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

cursor · 2025-12-20T15:05:02Z

PR Summary

Introduce maketables plug-in dunder methods on BaseExperiment (coef table, stats, depvar, vcov) with Bayesian/OLS handling, plus a demo notebook.

Core (experiments):
- Add zero-coupling maketables integration on BaseExperiment via dunder methods:
  - __maketables_coef_table__: builds coefficient tables for PyMC (posterior mean, std, two-tailed posterior p, 94% HDI) and OLS (point estimates).
  - __maketables_stat__(key): returns N, model_type, experiment_type, and r2 (if available).
  - __maketables_depvar__: exposes dependent variable name.
  - __maketables_vcov_info__: returns vcov metadata.
Examples:
- Add notebooks/maketables_demo.ipynb demonstrating DiD and RD models (OLS vs PyMC) and maketables.ETable usage.

^{Written by Cursor Bugbot for commit c5bb622. This will update automatically on new commits. Configure here.}

cursor · 2025-12-20T15:09:03Z

causalpy/experiments/base.py

+            widths = sorted_samples[interval_size:] - sorted_samples[:n_intervals]
+            min_idx = np.argmin(widths)
+            ci_lower = float(sorted_samples[min_idx])
+            ci_upper = float(sorted_samples[min_idx + interval_size])


HDI calculation has off-by-one error in interval bounds

The HDI (Highest Density Interval) calculation in _maketables_coef_table_bayesian has an off-by-one error. For a 94% interval containing ceil(0.94 * n) samples starting at index i, the interval spans indices i to i + interval_size - 1. However, the code computes widths = sorted_samples[interval_size:] - sorted_samples[:n_intervals], which calculates sorted_samples[i + interval_size] - sorted_samples[i] instead of sorted_samples[i + interval_size - 1] - sorted_samples[i]. Similarly, ci_upper uses sorted_samples[min_idx + interval_size] when it needs sorted_samples[min_idx + interval_size - 1]. This results in intervals containing one extra sample and potentially selecting a suboptimal HDI.

cursor · 2025-12-20T15:09:03Z

causalpy/experiments/base.py

+        }
+        # Add R² if available
+        if hasattr(self, "score") and self.score is not None:
+            stat_map["r2"] = self.score


R² statistic returns Series instead of scalar value

The __maketables_stat__ method assigns self.score directly to stat_map["r2"] without handling the case where self.score is a pandas Series (which occurs for PyMC models). As shown in the notebook output, this causes the R² row in maketables to display the entire Series representation (unit_0_r2 0.836121 unit_0_r2_std 0.012656 dtype: float64) instead of a clean scalar value. Other code in the codebase (e.g., _bayesian_plot) properly extracts self.score["unit_0_r2"] when self.score is a Series.

cursor · 2025-12-20T15:09:03Z

causalpy/experiments/base.py

+            interval_size = int(np.ceil(0.94 * n))
+            n_intervals = n - interval_size
+            widths = sorted_samples[interval_size:] - sorted_samples[:n_intervals]
+            min_idx = np.argmin(widths)


HDI calculation crashes with small MCMC sample sizes

The HDI calculation in _maketables_coef_table_bayesian crashes with ValueError: attempt to get argmin of an empty sequence when the number of MCMC samples is 16 or fewer. When n <= 16, interval_size = ceil(0.94 * n) equals n, making n_intervals = 0, which results in an empty widths array. Calling np.argmin on an empty array raises a ValueError. While unlikely in normal usage where MCMC runs with hundreds of samples, this could occur in quick testing scenarios with minimal sampling parameters.

codecov · 2025-12-24T14:26:43Z

Codecov Report

❌ Patch coverage is 18.00000% with 41 lines in your changes missing coverage. Please review.
✅ Project coverage is 92.60%. Comparing base (2d6bba7) to head (c5bb622).
⚠️ Report is 3 commits behind head on main.

Files with missing lines	Patch %	Lines
causalpy/experiments/base.py	18.00%	41 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #600      +/-   ##
==========================================
- Coverage   93.27%   92.60%   -0.67%     
==========================================
  Files          37       37              
  Lines        5632     5682      +50     
  Branches      367      373       +6     
==========================================
+ Hits         5253     5262       +9     
- Misses        248      289      +41     
  Partials      131      131

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

initial causalpy support via plugin

c5bb622

s3alfisc marked this pull request as draft December 20, 2025 15:06

s3alfisc mentioned this pull request Dec 20, 2025

CausalPy Support py-econometrics/maketables#17

Open

cursor bot reviewed Dec 20, 2025

View reviewed changes

drbenvincent added the enhancement New feature or request label Dec 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`maketables` plug in #600

`maketables` plug in #600
s3alfisc wants to merge 1 commit intopymc-labs:mainfrom
s3alfisc:main

s3alfisc commented Dec 20, 2025 •

edited

Loading

Uh oh!

review-notebook-app bot commented Dec 20, 2025

Uh oh!

cursor bot commented Dec 20, 2025 •

edited

Loading

Uh oh!

cursor bot Dec 20, 2025

Uh oh!

cursor bot Dec 20, 2025

Uh oh!

cursor bot Dec 20, 2025

Uh oh!

codecov bot commented Dec 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

s3alfisc commented Dec 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Dec 20, 2025

Uh oh!

cursor bot commented Dec 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Summary

Uh oh!

cursor bot Dec 20, 2025

Choose a reason for hiding this comment

HDI calculation has off-by-one error in interval bounds

Uh oh!

cursor bot Dec 20, 2025

Choose a reason for hiding this comment

R² statistic returns Series instead of scalar value

Uh oh!

cursor bot Dec 20, 2025

Choose a reason for hiding this comment

HDI calculation crashes with small MCMC sample sizes

Uh oh!

codecov bot commented Dec 24, 2025

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

s3alfisc commented Dec 20, 2025 •

edited

Loading

cursor bot commented Dec 20, 2025 •

edited

Loading