presidio
microsoft/presidio
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
7,593stars
Forks
1,001
Open issues
71
Watchers
7,593
Size
319.0 MB
PythonMIT License
anonymizationdata-anonymizationdata-maskingdata-obfuscationdata-privacydata-redactionde-identificationguardrailsimage-redactornamed-entity-recognitionnlppersonally-identifiable-informationphipiipii-detectionprivacypythonsensitive-dataspacytransformers
Created: May 4, 2018
Updated: Apr 14, 2026
Last push: Apr 14, 2026