-
Notifications
You must be signed in to change notification settings - Fork 31
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
authors: catastrophic backtracking in regex #26
Comments
This is the article that causes the issue, it should be reharvested once this is fixed: arXiv:1704.00841 |
Worksaround inspirehep/refextract#26 by interrupting the running away refextract process. Signed-off-by: Samuele Kaplun <[email protected]>
Worksaround inspirehep/refextract#26 by interrupting the running away refextract process. Signed-off-by: Samuele Kaplun <[email protected]>
Worksaround inspirehep/refextract#26 by interrupting the running away refextract process. Signed-off-by: Samuele Kaplun <[email protected]>
@tsgit are you by chance going to work on this issue in the near future? For the time being we have a workaround, but the approach you outlined in chat sounded way better than a workaround. |
Times out the `refextract` task after 300 seconds to work around inspirehep/refextract#26, which would otherwise block a Celery worker indefinitely. Signed-off-by: Samuele Kaplun <[email protected]>
Times out the `refextract` task after 300 seconds to work around inspirehep/refextract#26, which would otherwise block a Celery worker indefinitely. Signed-off-by: Samuele Kaplun <[email protected]>
@kaplun yes, very high on my todo list. unfortunately got pushed back by AAHEP, vacation, surgery and some other business -- by next week! |
How to reproduce:
this hangs
refextract
for, at least, days.The reason appears to be catastrophic backtracking in this regex:
refextract/refextract/authors/regexs.py
Lines 491 to 494 in 27588da
The text was updated successfully, but these errors were encountered: