Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding a French Date Recognizer #1431

Open
cpetresc opened this issue Aug 6, 2024 · 1 comment
Open

Adding a French Date Recognizer #1431

cpetresc opened this issue Aug 6, 2024 · 1 comment

Comments

@cpetresc
Copy link

cpetresc commented Aug 6, 2024

Is your feature request related to a problem? Please describe.
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]
Dear Presidio Team at Microsoft,

When I try to use your library, Presidio, on French text using the SpaCy model, I am unable to detect French dates. For example, '3 janvier 2001', 'janvier 2001', or '3 janvier' are not recognized.

Describe the solution you'd like
A clear and concise description of what you want to happen.
I would like to have the capability to detect French dates with a dedicated French date recognizer

Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.
I have written a class called FrDateRecognizer that can detect French dates using regex. You can find this class in the file attached to this issue.
fr_date_recognizer.zip

Additional context
Add any other context or screenshots about the feature request here.

@omri374
Copy link
Contributor

omri374 commented Aug 7, 2024

Hi, have you tried using a French NER model from either spaCy or Huggingface? They usually have good support for dates and there's no need for rule based logic.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants