Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[NL] Experiment: add de and het to skip_list #2608

Draft
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

TheFes
Copy link
Contributor

@TheFes TheFes commented Nov 22, 2024

Adds de and het to skip list, which reduces the total possible sentence options from roughly 17 billion to roughly 380 million.

Will potentially give issues if articles are used in the name, eg lamp in de hoek (light in the corner)
Not stripping skip_words from names would prevent this issue.

More improvements could be made by adding words like mijn or m'n (my).

@synesthesiam
Copy link
Contributor

The problem is we don't know which text is part of the name before matching starts.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants