Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add per-sample flag to ignore sample for background database computation #1682

Open
xiamaz opened this issue Jun 6, 2024 · 3 comments
Open
Labels
dataflow-v2 Requires new data flow for implementation enhancement New feature or request

Comments

@xiamaz
Copy link
Collaborator

xiamaz commented Jun 6, 2024

Is your feature request related to a problem? Please describe.
Currently it is only possible to ignore entire projects for background database computation. While this allows ignoring data in test projects, there are use-cases, where samples need to be duplicated in multiple cases (analysis of a second index in the same family). The repeated samples from existing samples should not be included in the background database computation, whereas new samples should be.

Describe the solution you'd like
During case-creation a flag should exist, which can be used excluded individual samples from background database computation.

Describe alternatives you've considered
Cases with existing samples could be created in a separate project which is not included in background database computation. This would still not allow inclusion of new data in background database computation.

Additional context
Further in the future relatedness information in the form of a fingerprint should be made available to allow for more generalizable identification of duplicated samples.

@xiamaz xiamaz added the enhancement New feature or request label Jun 6, 2024
@xiamaz xiamaz moved this to In review in Release Planning Jun 6, 2024
@xiamaz
Copy link
Collaborator Author

xiamaz commented Jun 6, 2024

This ticket is required for implementation necessary changes for fulfilling processing needs in the model initiative.

@xiamaz
Copy link
Collaborator Author

xiamaz commented Jun 13, 2024

Can only be solved in the new data flows application.

@xiamaz
Copy link
Collaborator Author

xiamaz commented Jun 13, 2024

Can be solved via case manifest attributes in imported phenopacket format.

@xiamaz xiamaz moved this from In review to Backlog in Release Planning Jun 13, 2024
@xiamaz xiamaz added the dataflow-v2 Requires new data flow for implementation label Jun 13, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dataflow-v2 Requires new data flow for implementation enhancement New feature or request
Projects
Status: Backlog
Development

No branches or pull requests

1 participant