feat: extract wordchars from lunr-languages #150

dhdaines · 2024-07-04T18:13:18Z

See #149 (doesn't fix the whole thing)

dhdaines · 2024-07-04T18:19:30Z

Note also that you could also just add {r'\w'} to all_word_characters in the same way as you do for the default pipeline.

dhdaines · 2024-07-04T19:46:28Z

In actual fact we should add \w to them, because otherwise they will remove numbers at the end of search terms, which is almost certainly not what you want for a lot of applications! But... this is not bug-compatible with lunr-languages, so it might just need a documented workaround.

dhdaines · 2024-07-06T16:12:22Z

You may not really want to do this, it seems the trimmers in lunr-languages are full of weird junk: MihaiValentin/lunr-languages#66

dhdaines · 2024-07-06T18:42:51Z

Hmm. It turns out, actually, that lunr-languages code is generated programmatically as well. So it doesn't make a lot of sense to parse it to create these. I'm closing this PR and will come up with a better way to do this.

dhdaines added 2 commits July 4, 2024 14:10

feat: extract wordchars from lunr-languages

c1be11c

docs: update docstring

138ace3

dhdaines closed this Jul 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: extract wordchars from lunr-languages #150

feat: extract wordchars from lunr-languages #150

dhdaines commented Jul 4, 2024

dhdaines commented Jul 4, 2024

dhdaines commented Jul 4, 2024 •

edited

Loading

dhdaines commented Jul 6, 2024

dhdaines commented Jul 6, 2024

feat: extract wordchars from lunr-languages #150

feat: extract wordchars from lunr-languages #150

Conversation

dhdaines commented Jul 4, 2024

dhdaines commented Jul 4, 2024

dhdaines commented Jul 4, 2024 • edited Loading

dhdaines commented Jul 6, 2024

dhdaines commented Jul 6, 2024

dhdaines commented Jul 4, 2024 •

edited

Loading