Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Workflow B: SemMedDB "Novelty" constraint #313

Open
karafecho opened this issue Sep 30, 2021 · 1 comment
Open

Workflow B: SemMedDB "Novelty" constraint #313

karafecho opened this issue Sep 30, 2021 · 1 comment
Assignees

Comments

@karafecho
Copy link
Contributor

This issue is to follow up on Andrew Su's suggestion to consider ways to leverage SemMedDB's "Novelty" constraint by setting it to Novelty=0 either in the query itself or via another approach.

@andrewsu
Copy link
Contributor

Just some quick notes about how we can check whether this is likely to have much/any impact before we actually write any additional code.

I believe the issue to be addressed can be seen in the ARAX response for Query B.1. The observation is that many non-specific nodes (e.g., "cytokine", "MicroRNAs", "agonists") are appearing in the results. The hypothesis is that filtering on "novelty" node property in SemmedDB (eliminating nodes where novelty = 0) would enrich for results that SMEs would care about.

The Service Provider has created a new API for semmeddb that includes the novelty score at https://biothings.ncats.io/semmeddb. So let's check a few of the B.1 results to see what their novelty scores are:

I hope I've illustrated how the semmeddb novelty score currently can be accessed. Perhaps a SME or analyst with workflow B could do more spot checking on the nodes they would consider to be "too generic"?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants