Skip to content

jamon8888/anonymize

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

221 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Stella anonymize

anonymize

Monorepo for the Stella anonymization stack.

It contains the runtime package, the published data package, and the browser/WASM entrypoint used by downstream apps.

Packages

Package Purpose
@stll/anonymize Native runtime for multi-layer PII detection and anonymization
@stll/anonymize-data Published deny-list dictionaries and trigger/config data
@stll/anonymize-wasm Browser/WASM build of the runtime
@stll/anonymize-cli Command-line anonymization (anonymize binary)

Install

bun add @stll/anonymize
# Optional runtime data bundle
bun add @stll/anonymize-data
# Browser / Vite usage
bun add @stll/anonymize-wasm

Or anonymize from the terminal without installing:

echo "Contact Jan Novák at jan.novak@example.com" | bunx @stll/anonymize-cli
# Contact [PERSON_1] at [EMAIL_ADDRESS_1]

What it does

  • Regex-based detection for common identifiers, dates, and legal entities
  • Trigger phrases and deny-list matching for language-aware anonymization
  • NER, coreference handling, and confidence boosting
  • Native, browser, and Vite-compatible entrypoints

Development

bun install --frozen-lockfile
bun run lint
bun run typecheck
bun run test

Git hooks (opt-in)

Lefthook config lives at lefthook.yml and is not auto-installed. To enable local hooks (format on pre-commit, typecheck + format check on pre-push):

bun run hooks:install
# bun run hooks:uninstall to remove

Release hygiene

  • Pinned GitHub Actions workflows validate lint, typecheck, tests, and package tarballs before release.
  • The data package tarball is checked to make sure every exported dictionary path is present.
  • Release publishing is gated behind manual workflow dispatch and provenance-enabled npm publish steps.

Repository layout

About

Text anonymization pipeline for Stella legal data workflows

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • TypeScript 60.7%
  • Rust 35.3%
  • JavaScript 3.9%
  • Shell 0.1%