python_comparison.rst

Python Comparison

This guide compares diff-diff with other Python packages for DiD analysis, helping users understand the landscape and choose the right tool.

Overview

Feature	diff-diff	pyfixest	differences	CausalPy	linearmodels
Basic DiD	✅	✅	✅	✅	✅ (manual)
TWFE	✅	✅	❌	❌	✅
Callaway-Sant'Anna	✅	❌	✅	❌	❌
Sun-Abraham	✅	✅	❌	❌	❌
Synthetic DiD	✅	❌	❌	✅ (SC only)	❌
Honest DiD	✅	❌	❌	❌	❌
Wild bootstrap	✅	✅	❌	❌	❌

Package Summaries

pyfixest

pyfixest is inspired by R's fixest package and excels at fast fixed effects estimation.

Strengths:

Very fast for high-dimensional fixed effects (JIT-compiled)
Sun-Abraham event study estimator
Gardner's did2s two-stage estimator
Local projections (Dube et al. 2023)
Publication-ready tables

Gaps:

No Callaway-Sant'Anna estimator
No sensitivity analysis (Honest DiD)
No synthetic DiD

differences

differences provides a pure Callaway-Sant'Anna implementation.

Strengths:

Faithful implementation of Callaway & Sant'Anna (2021)
Multi-valued treatment support
Triple difference estimation
Handles unbalanced panels

Gaps:

No sensitivity analysis
No synthetic DiD
Limited inference options (no wild bootstrap)

CausalPy

CausalPy takes a Bayesian approach using PyMC for uncertainty quantification.

Strengths:

Bayesian credible intervals
Synthetic control methods
Nice built-in visualizations
Interrupted time series

Gaps:

Limited to simple 2x2 DiD designs
No staggered treatment support
No event study methods

linearmodels

linearmodels provides panel data econometrics, serving as a building block for DiD.

Strengths:

Solid PanelOLS implementation
Two-way fixed effects
Various standard error options
statsmodels-compatible

Gaps:

No DiD-specific methods (manual implementation required)
No staggered treatment handling
No diagnostics or sensitivity analysis

Why diff-diff?

diff-diff fills critical gaps in the Python DiD ecosystem:

Only Python library with Honest DiD sensitivity analysis

Reviewers increasingly expect sensitivity analysis showing robustness to parallel trends violations. diff-diff is the only Python package implementing Rambachan & Roth (2023).
Unified toolkit

Combines Callaway-Sant'Anna, Synthetic DiD, TWFE, and Honest DiD in one package with a consistent API.
Built-in diagnostics

Placebo tests, parallel trends tests, Goodman-Bacon decomposition, and sensitivity analysis out of the box.
sklearn-style API

Familiar fit() interface with rich result objects.

Feature Comparison Table

Feature	diff-diff	pyfixest	differences	CausalPy	linearmodels
Basic 2x2 DiD	✅	✅	✅	✅	✅
Two-way fixed effects	✅	✅	❌	❌	✅
Staggered DiD (CS)	✅	❌	✅	❌	❌
Sun-Abraham estimator	✅	✅	❌	❌	❌
Gardner's did2s	✅	✅	❌	❌	❌
Local projections	❌	✅	❌	❌	❌
Synthetic DiD	✅	❌	❌	❌	❌
Synthetic control	❌	❌	❌	✅	❌
Covariate adjustment	✅	✅	✅	Limited	✅
Doubly robust	✅	❌	✅	❌	❌
IPW	✅	❌	✅	❌	❌
Group-time effects	✅	❌	✅	❌	❌
Event study aggregation	✅	✅	✅	❌	❌
Honest DiD (ΔRM)	✅	❌	❌	❌	❌
Honest DiD (ΔSD)	✅	❌	❌	❌	❌
Sensitivity analysis	✅	❌	❌	❌	❌
Wild bootstrap	✅	✅	❌	❌	❌
Cluster-robust SE	✅	✅	✅	❌	✅
Placebo tests	✅	❌	❌	❌	❌
Parallel trends tests	✅	❌	❌	❌	❌
Bacon decomposition	✅	❌	❌	❌	❌
Event study plots	✅	✅	✅	✅	❌
Triple Difference (DDD)	✅	❌	✅	❌	❌
TROP	✅	❌	❌	❌	❌
Stacked DiD	✅	❌	❌	❌	❌
Bacon Decomposition	✅	❌	❌	❌	❌
Continuous DiD	✅	❌	❌	❌	❌
Efficient DiD	✅	❌	❌	❌	❌
Built-in datasets	✅	❌	❌	❌	❌
Bayesian inference	❌	❌	❌	✅	❌

Code Comparison

Basic DiD

# diff-diff
from diff_diff import DifferenceInDifferences

did = DifferenceInDifferences()
results = did.fit(data, outcome='y', treatment='treated', time='post')
print(results.summary())

# pyfixest
import pyfixest as pf

fit = pf.feols("y ~ treated:post | unit + time", data=data)
fit.summary()

# linearmodels
from linearmodels.panel import PanelOLS

mod = PanelOLS.from_formula(
    'y ~ treated:post + EntityEffects + TimeEffects',
    data=data.set_index(['unit', 'time'])
)
results = mod.fit(cov_type='clustered', cluster_entity=True)

Staggered DiD (Callaway-Sant'Anna)

# diff-diff
from diff_diff import CallawaySantAnna

cs = CallawaySantAnna(estimation_method='dr')
results = cs.fit(
    data,
    outcome='y',
    unit='unit',
    time='time',
    first_treat='first_treat',
    covariates=['x1', 'x2'],
    aggregate='event_study'
)
event_study = results.event_study_effects

# differences
from differences import ATTgt

att = ATTgt(
    data=data,
    outcome='y',
    cohort='first_treat',
    time='time',
    strata='unit'
)
att.fit()
att.aggregate('event')

Sensitivity Analysis

# diff-diff (only Python option)
from diff_diff import HonestDiD, plot_sensitivity

honest = HonestDiD(method='relative_magnitude', M=1.0)
results = honest.fit(event_study_results)

# Sensitivity over M grid
sensitivity = honest.sensitivity_analysis(
    event_study_results,
    M_grid=[0, 0.5, 1.0, 1.5, 2.0]
)
plot_sensitivity(sensitivity)

# No equivalent in other Python packages

When to Use Each Package

Use diff-diff when:

You need sensitivity analysis (Honest DiD)
You want Callaway-Sant'Anna with doubly robust estimation
You need built-in diagnostics (placebo tests, parallel trends)
You prefer a unified sklearn-style API

Use pyfixest when:

Speed is critical (high-dimensional fixed effects)
You need Sun-Abraham or did2s estimators
You want R's fixest-style syntax
You need local projections

Use differences when:

You need multi-valued treatment effects
You need triple difference estimation
You want the most faithful CS implementation

Use CausalPy when:

You want Bayesian uncertainty quantification
You're doing synthetic control (not synthetic DiD)
You prefer PyMC-based modeling

Use linearmodels when:

You're doing general panel econometrics beyond DiD
You need statsmodels compatibility
You're implementing DiD manually with full control

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python Comparison

Overview

Package Summaries

pyfixest

differences

CausalPy

linearmodels

Why diff-diff?

Feature Comparison Table

Code Comparison

Basic DiD

Staggered DiD (Callaway-Sant'Anna)

Sensitivity Analysis

When to Use Each Package

FilesExpand file tree

python_comparison.rst

Latest commit

History

python_comparison.rst

File metadata and controls

Python Comparison

Overview

Package Summaries

pyfixest

differences

CausalPy

linearmodels

Why diff-diff?

Feature Comparison Table

Code Comparison

Basic DiD

Staggered DiD (Callaway-Sant'Anna)

Sensitivity Analysis

When to Use Each Package