presidio
microsoft/presidio
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
8,359stars
Forks
1,080
Open issues
70
Watchers
8,359
Size
318.5 MB
PythonMIT License
anonymizationdata-anonymizationdata-maskingdata-obfuscationdata-privacydata-redactionde-identificationguardrailsimage-redactornamed-entity-recognitionnlppersonally-identifiable-informationphipiipii-detectionprivacypythonsensitive-dataspacytransformers
Created: May 4, 2018
Updated: May 29, 2026
Last push: May 26, 2026