You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi,
in our project we are looking at patients with subarachnoid hemorrhage (SAH) diagnosis; such patients often undergo neurological assessment which includes grading scores such as Hunt and Hess, WFNS, or (Modified) Fisher scale. Such scores are often gathered during admission and reported in the discharge summary. Extracting these scores from free-text notes can be useful for downstream applications.
It seems that the description of these scores in the notes is masked. For example in MIMIC-III, the TEXT field of the entry with HADM_ID=167857 and CATEGORY="Discharge summary" in NOTEEVENTS.csv.gz has the Hess part of Hunt and Hess masked. Further, the subsequent score name in the same entry is completely masked, making it impossible to recover.
In MIMIC-IV, a similar phenomena can be observed, albeit slightly different. For the text field in entry in mimic-iv-note/2.2/note/discharge.csv.gz with note_id=13317644-DS-20, both Hunt and Hess are masked whereas Fisher is not masked.
I wonder if the context-specific rules can be added to the deidentification algorithm, similarly as suggested in #1507 ?
Thanks a lot for your efforts of maintaining and further developing the MIMIC database, it is a great resource!
Best,
Tilman
The text was updated successfully, but these errors were encountered:
Prerequisites
Description
Hi,
in our project we are looking at patients with subarachnoid hemorrhage (SAH) diagnosis; such patients often undergo neurological assessment which includes grading scores such as Hunt and Hess, WFNS, or (Modified) Fisher scale. Such scores are often gathered during admission and reported in the discharge summary. Extracting these scores from free-text notes can be useful for downstream applications.
It seems that the description of these scores in the notes is masked. For example in MIMIC-III, the
TEXT
field of the entry withHADM_ID=167857
andCATEGORY="Discharge summary"
inNOTEEVENTS.csv.gz
has the Hess part of Hunt and Hess masked. Further, the subsequent score name in the same entry is completely masked, making it impossible to recover.In MIMIC-IV, a similar phenomena can be observed, albeit slightly different. For the
text
field in entry inmimic-iv-note/2.2/note/discharge.csv.gz
withnote_id=13317644-DS-20
, both Hunt and Hess are masked whereas Fisher is not masked.I wonder if the context-specific rules can be added to the deidentification algorithm, similarly as suggested in #1507 ?
Thanks a lot for your efforts of maintaining and further developing the MIMIC database, it is a great resource!
Best,
Tilman
The text was updated successfully, but these errors were encountered: