⭐ Star AlbumentationsX on GitHub — 307+ stars and counting!

Star on GitHub
microsoft

presidio

microsoft/presidio

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

7,593stars
Forks
1,001
Open issues
71
Watchers
7,593
Size
319.0 MB
PythonMIT License
anonymizationdata-anonymizationdata-maskingdata-obfuscationdata-privacydata-redactionde-identificationguardrailsimage-redactornamed-entity-recognitionnlppersonally-identifiable-informationphipiipii-detectionprivacypythonsensitive-dataspacytransformers
Created: May 4, 2018
Updated: Apr 14, 2026
Last push: Apr 14, 2026