Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature: add tokenizer for parsing tags for booru-series sites #34

Open
Tracked by #246
MaikoTan opened this issue Mar 22, 2023 · 0 comments
Open
Tracked by #246

Feature: add tokenizer for parsing tags for booru-series sites #34

MaikoTan opened this issue Mar 22, 2023 · 0 comments
Assignees
Labels
Milestone

Comments

@MaikoTan
Copy link
Member

MaikoTan commented Mar 22, 2023

Sources

Danbooru, Gelbooru, Konachan.com (NSFW), Konachan.net (SFW), Lolibooru, Safebooru, Yande.re

Describe the problem related to the feature request

It is usual to make typos when inputting tags, need a tokenizer to correct tags and then feed to booru-series sites.

This is also solve the part of the problem of #33

Describe the solution you'd like

Add a tokenizer

Describe alternatives you've considered

Calculate levenshtein distance to find the closest tags.

Additional context

No response

@MaikoTan MaikoTan added this to the v1.0.2 milestone Mar 22, 2023
@MaikoTan MaikoTan modified the milestones: v1.0.2, v1.0.3 Mar 27, 2023
@MaikoTan MaikoTan removed this from the v1.0.3 milestone May 3, 2023
@MaikoTan MaikoTan added this to the v1.2.0 milestone Nov 30, 2023
@MaikoTan MaikoTan modified the milestones: v1.2.0, v1.3.0 Mar 21, 2024
@MaikoTan MaikoTan added the 🆘 help wanted Extra attention is needed label Mar 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants