Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tracking intermediate stages in data flow #10

Open
3 tasks
ByroneCole-SageBionetworks opened this issue Jan 11, 2023 · 0 comments
Open
3 tasks

Tracking intermediate stages in data flow #10

ByroneCole-SageBionetworks opened this issue Jan 11, 2023 · 0 comments
Assignees
Labels
Sage Sage Bionetworks task

Comments

@ByroneCole-SageBionetworks
Copy link

ByroneCole-SageBionetworks commented Jan 11, 2023

  • for each dataset to be submitted in INCLUDE, a project manager completes a data plan form providing:

  • dataset contributor

  • target dataset submission date

  • dataset type: selected among the dataset types available in the INCLUDE data model (e.g. biosample data, demographics

  • clinical data, demographics family history, bulk RNA-seq data, etc.)

  • target release date

  • governance access controls & PHI checklist

  • for each dataset described in the data plan, in addition to indicating target submission date and target release date information, the DFA dashboard tracks the dataset status

  • has been submitted (true/false)

  • metadata have been validated (true/false)

  • data have been QC-ed (true/false)

  • dataset is ready for release (true/false)

  • dataset is released (true/false)

  • the DFA tracks distribution of datasets across data types, contributors and release state, so that project managers and program officers can alert relevant parties for bottlenecks, etc.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Sage Sage Bionetworks task
Projects
No open projects
Status: No status
Development

No branches or pull requests

3 participants