Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DigiPres Publications Index v2.0 #5

Open
anjackson opened this issue Mar 7, 2024 · 1 comment
Open

DigiPres Publications Index v2.0 #5

anjackson opened this issue Mar 7, 2024 · 1 comment

Comments

@anjackson
Copy link
Collaborator

anjackson commented Mar 7, 2024

Leading on from #2

Proposed features

  • Zenodo/Zotero/Google Sheet as faceted sources.

Ideas

From Micky:

Are there some parts where we do need community editing workflows to manage some aggregation data? Like the iPRES conference metadata? Are there tools for supporting analysis and visualisation? See digipres/registries-of-practice-project#16

@anjackson
Copy link
Collaborator Author

That paper at iPRES, applying https://maartengr.github.io/BERTopic/ to a different corpus of digital preservation papers, seemed to mirror what I'd found with spacy. You don't get much that makes sense when you've only got metadata to work with. I suspect this is generally true that domains with terms of art and difficult to integrate with generic language tools, at least without a decently large corpus. Perhaps this needs the full-text to be in place?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: To Do
Development

No branches or pull requests

1 participant