Skip to content

Latest commit

 

History

History
57 lines (44 loc) · 1.32 KB

README.md

File metadata and controls

57 lines (44 loc) · 1.32 KB

awesome-pii

Awesome-PII is a tool collection related to detecting, extracting, and removing PII from data.

Regex

The regexes.yaml schema may change. I may change the schema later for future versions.

Regexes are tested with ruby 2.5.9 on Rubular.

Regexes are downcase sensitive. (meaning downcase text before using them)

Regexes are incomplete. Pull Requests welcome.

Categories

  • Phone
    • US
  • Email
  • Socials
    • Instagram
    • TikTok
    • Snapchat
    • LinkedIn
    • WhatsApp
    • Telegram
    • Twitter
    • Facebook
  • ID card
    • Social Security Number
  • Internet
    • IP Address
    • Domain Name
  • Personal
    • Date of Birth (US)
    • Race
    • Religion

Roadmap

  • Regexes

    • Implement regex groups to extract parts of regex
    • (ongoing) add support for obsfuncation (e.g. "s.c." means snapchat)
    • regexes with PII
    • remove PII from text (with ChatGPT)
  • Images

    • Image PII (with OCR)
    • Image PII removal (with OCR and Stable Diffusion)

[ ] Multi-language support

Special Thanks