Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Document crawl removal from playback #7

Open
edsu opened this issue May 23, 2022 · 1 comment
Open

Document crawl removal from playback #7

edsu opened this issue May 23, 2022 · 1 comment
Labels
blocked prereqs for this ticket aren't done yet web archiving 2022 web archiving work cycle

Comments

@edsu
Copy link
Contributor

edsu commented May 23, 2022

We occasionally need to block particular crawls from playback, so that they don't show up as links to view in SearchWorks. This is usually because the crawl was "not good" or lacked some resources for a quality playback. There aren't controls for this in Argo, but it would be good to document how to do this manually in DevOpsDocs.

@edsu edsu added the web archiving 2022 web archiving work cycle label May 23, 2022
@ndushay ndushay added the blocked prereqs for this ticket aren't done yet label Jun 3, 2022
@lwrubel
Copy link
Contributor

lwrubel commented Jun 15, 2022

This requires an enhancement to pywb, blocking a URL via access controls for a specified time period. The goal is to prevent the problem capture from displaying or being included in TimeMap API responses.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
blocked prereqs for this ticket aren't done yet web archiving 2022 web archiving work cycle
Projects
None yet
Development

No branches or pull requests

3 participants