-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Lookup dictionary for pretrained embedding #3
Comments
Hello Victor, I have been looking into this work recently, I think that CUI mapping files / scripts to convert can be found in the repository for embeddings: https://github.com/clinicalml/embeddings/tree/master/eval Cheers |
These are UMLS concept unique identifier(CUI) Examples from https://arxiv.org/pdf/1804.01486.pdf
UMLS CUIs can be browsed on https://uts.nlm.nih.gov/metathesaurus.html |
Came across this post while looking for information on the meaning of the columns in the If we were to load this csv file into a database, what kind of schema should we create? (Or does it even make sense to load this into a database in the first place?) I have read the |
v1,...,v500 are the 500 dimensional vector embedding for the CUIs. Quoting the paper from Section 4.1:
Loading cui2vec: As a pre-requisite, you should read about word embeddings e.g. word2vec. |
Hi Andrew,
Do you have a lookup dictionary for the pretrained embeddings? I saw in the embedding file, the "medical concepts" are in format of "CXXXX", not sure if they are ICD codes, procedure codes or something else.
Thanks!
The text was updated successfully, but these errors were encountered: