Redactable

Deterministic-first PII/PHI de-identification you can prove.
Benchmarked recall · versioned jurisdiction policies · reversible tokenization · reproducible audit trail.

pip install redactable · Apache-2.0 · runs anywhere your data already lives

De-identification is a proof problem, not a model problem. A team that ships data across a trust boundary — to a vendor, a cloud LLM, or a shared training corpus — cannot adopt a redactor they can't measure, version, or audit. Redactable makes accuracy a number you can put in CI.

Why another redaction tool?

Most PII tools are a pile of regexes or a thin wrapper around an LLM, and neither tells you how much they miss. Redactable is built around three convictions the alternatives get wrong:

Deterministic-first. Structured identifiers (credit cards, IBANs, SSNs) are caught by regex + checksum — a Luhn or MOD-97 check is provably correct, reproducible, and auditable. A probabilistic LLM can't beat a checksum and only adds non-determinism and hallucination risk to the one category where a miss is a breach.
The model is a commodity; the eval is the product. Contextual PII (names, locations) is handled by an encoder NER (GLiNER) — non-autoregressive, so it can't hallucinate or regurgitate — and every engine (ours, Presidio, a cloud API) is scored on the same labeled corpus. If recall drops, your build fails.
Provable, not promised. Versioned jurisdiction policy packs (HIPAA Safe Harbor's 18 identifiers, GDPR special categories), reversible tokenization so de-identified data stays joinable, and a reproducible audit manifest of exactly what was redacted, by which policy version, by which engine version, when.

Quick start

pip install redactable

# De-identify a file and write a reproducible audit manifest alongside it
redactable redact notes.txt --policy hipaa-safe-harbor --out notes.redacted.txt --audit notes.audit.json

# Add contextual entities (names, locations) with the optional encoder NER
pip install "redactable[ner]"
redactable redact notes.txt --policy hipaa-safe-harbor --ner --out notes.redacted.txt

# Prove recall against a labeled corpus — exits non-zero on regression (drop this in CI)
redactable eval --corpus corpus/seed.jsonl --policy pii-structured --gate

from redactable import Redactor

# The deterministic core (no model, no download) catches structured identifiers:
r = Redactor.from_policy("hipaa-safe-harbor")
out = r.redact("Email jane@acme.io or call (212) 555-0188; card 4111 1111 1111 1111.")
print(out.text)
# Email [EMAIL] or call [PHONE]; card [CREDIT_CARD].

# Names/locations need the optional encoder NER — add it explicitly:
#   from redactable.detectors.ner import GlinerDetector
#   r = Redactor.from_policy("hipaa-safe-harbor",
#                            detectors=[*r.detectors, GlinerDetector()])

Use the reusable GitHub Action to gate PRs:

# .github/workflows/pii-gate.yml
- uses: redactable/redactable@v0
  with:
    corpus: corpus/seed.jsonl
    policy: pii-structured   # deterministic types only — passes with no model

Names & places, in any runtime

Structured PII is caught by math everywhere. Contextual PII (names, places, orgs) has no checksum, so it needs a model — and the engine swaps in whichever model fits the runtime, behind one Detector interface. GLiNER (a small CPU encoder) is the default contextual engine; Gemma is the last-resort fallback for when you have a GPU and want max recall.

# Default: GLiNER — an encoder NER (auditable, CPU, fast, can't hallucinate)
pip install "redactable[ner]"
redactable redact notes.txt --policy hipaa-safe-harbor --ner

# Last resort (max recall, needs a GPU/server): Gemma via any OpenAI-compatible endpoint,
# e.g. Ollama — `ollama run gemma3` — text never leaves your machine
redactable redact notes.txt --policy hipaa-safe-harbor --llm --llm-model gemma3

# In the browser: Gemma-4 on WebGPU (see web/) — the GPU "last resort" tier, runs in the tab

Same policies, same eval harness, same audit trail — only the contextual Detector changes. A missing/unreachable model degrades gracefully: the deterministic core still runs.

Drop it into your coding agent

Scrub PII before it reaches the model — via the familiar mechanisms:

# MCP server — one-line add for any MCP-aware agent (Claude Code, Cursor, …)
pip install "redactable[mcp]"
claude mcp add redactable -- redactable-mcp

# Or fully automatic for ANY agent — a local scrub-proxy on the wire (one env var):
redactable serve
export ANTHROPIC_BASE_URL=http://localhost:8080      # Claude Code; or --openai-api-base for others

The MCP server exposes scrub / restore / detect tools; the proxy tokenizes PII out of every request (with a meta-prompt that keeps placeholders verbatim) and restores it in the reply — so the model provider never sees real PII while the agent stays coherent. Also ships a Claude Code pre-send hook (hooks/redactable-userpromptsubmit.py) and a git pre-commit hook. Full guide: docs/INTEGRATIONS.md.

Running inside a coding agent? The scrub-proxy works with any harness, because every harness ends in the same HTTPS call to the model. An agent can set this up itself in two steps (redactable serve, then point its base URL at it) — copy-paste runbook with a self-verification test: docs/AGENT-SETUP.md.

What's in the box (v0.1)

Deterministic detectors — email, phone, US SSN, credit card (Luhn), IBAN (MOD-97), IPv4/IPv6, US routing (ABA), URL, and more. Confidence is 1.0/0.0, not a guess.
Encoder-NER detector (optional pip install redactable[ner]) — PERSON / LOCATION / ORG via GLiNER. The deterministic core has zero heavy dependencies; the [ner] extra pulls PyTorch + transformers transitively (a large, CPU-capable download), so it stays opt-in.
Eval harness — per-entity precision / recall / F1 over a labeled corpus, with a configurable regression gate for CI.
Reversible tokenization — consistent [TYPE_n] placeholders, joinable across a document, re-identifiable under a local keymap.
Policy packs — declarative, versioned YAML. Ships hipaa-safe-harbor (the 18 identifiers) and pii-structured (deterministic types only, passes with no model).
CLI + Python library + reusable GitHub Action.

Validated on real data

Measured against the independent, third-party ai4privacy/pii-masking-200k dataset (1,500 English examples) — not our own fixtures. Deterministic engine, recall is the headline de-id metric:

EMAIL	URL	IBAN	IP_ADDRESS	PHONE	US_SSN	CREDIT_CARD
1.000	1.000	1.000	0.996	0.609	0.560	0.181*

100% precision-coverage (every span flagged is real PII). *CREDIT_CARD is bounded by the dataset: only 18% of its synthetic "cards" are Luhn-valid, and the checksum gate correctly rejects the rest — against real cards it's ~perfect. PHONE (US-centric regex) and the contextual types (names/places, handled by the NER/Gemma tier) are the honest gaps. Full methodology, interpretation, and the two bugs this benchmark caught: benchmarks/.

What this is not

Redactable does not claim legal compliance and never silently auto-redacts high-stakes PHI as a fact. It is assisted de-identification with measured recall — a high-recall, flag-and-prove tool whose output you can audit. Compliance is a process; this gives you the evidence for it.

Project status

Early (0.1.0, alpha). The deterministic core + eval harness are the foundation; the roadmap (hosted pipeline, maintained policy packs, signed attestation, warehouse/log connectors, an optional browser build) is tracked in docs/.

License

Apache-2.0. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github/workflows		.github/workflows
benchmarks		benchmarks
corpus		corpus
docs		docs
hooks		hooks
integrations/opencode		integrations/opencode
src/redactable		src/redactable
tests		tests
web		web
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
action.yml		action.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Redactable

Why another redaction tool?

Quick start

Names & places, in any runtime

Drop it into your coding agent

What's in the box (v0.1)

Validated on real data

What this is not

Project status

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Redactable

Why another redaction tool?

Quick start

Names & places, in any runtime

Drop it into your coding agent

What's in the box (v0.1)

Validated on real data

What this is not

Project status

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages