ADF template updates to align with Healthcare data solution in MS Fabric #645
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR includes changes in
Copy DICOM Metadata Changes to ADLS Gen2 in Delta Format
ADF template to align with Healthcare data solution in MS Fabric.partitionName
andlastModifiedTimestamp
. Example,studyDate
changed tostudydate
.metadata
object and exposed as columns in the table. The below dicom tags were additionally exposed to the table as columns:patientbirthdate, accessionnumber, referringphysicianname, modalitiesinstudy, performedprocedurestepstartdate, manufacturermodelname, studytime, timezoneoffsetfromutc, numberofstudyrelatedseries, numberofstudyrelatedinstances, seriesnumber, seriesdescription, numberofseriesrelatedinstances, bodypartexamined, laterality, seriesdate, seriestime, instancenumber, documenttitle
.metadata
- the metadata object we get from changefeed apimetadata_string
- the stringified version of the metadata objectcreated_date
- currentTimestamp()metadata
column coming from AHDS changefeed api is projected differently than in original implementation as mentioned below:Current projection:
Proposed projection:
studyDate
is being formatted to a date type(yyyy-mm-dd
format). Where as in the proposed changes, there are two columnsstudydate
,studydate_formatted
which contains the raw data (inyyyymmdd
format) coming from themetadata
and formatted column value (inyyyy-mm-dd
format) respectively.instance
table written. With the proposed changesinstance
columns are in lowercase. So, changes are made to project the columns as expected (camel case) byseries
dataflow.study
table.