Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Overwrite date metadata 1970s -> #72

Open
ninpnin opened this issue Jan 16, 2025 · 0 comments
Open

Overwrite date metadata 1970s -> #72

ninpnin opened this issue Jan 16, 2025 · 0 comments

Comments

@ninpnin
Copy link
Contributor

ninpnin commented Jan 16, 2025

Riksdagen has (presumably) accurate date information in the JSON files from the 1970s onwards. We should overwrite the scraped dates from 1970 onwards. While I am relatively certain this will improve the quality of the date metadata, doing the sample check properly will ensure that.

Sidenote: Between 1970 and 1990, most of the records only cover one day, but these records are concatenated into a longer document, which is the one that has been scanned. As a result, each record PDF includes the table of contents for multiple records in the beginning, making annotation slightly more difficult.

@ninpnin ninpnin mentioned this issue Jan 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant