-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
File naming conventions #31
Comments
Can we enforce this at all? What if the data generated is 10k files from some third party software? There's a discussion to be had to as to where we fall on this spectrum:
|
Yes this is true we can't enforce it at all, maybe it is best left up to the user and we can provide a recommendation. Yes this is another key point, how 'far down' the tree we manage and how much we leave to the user. The current preferred choice folder structure is nice for this as we can just provide 'behav', 'ephys' at the data type level (i.e.one level below the session level) and leave it at that (rather than ses-001/ephys/behav/camera etc there was before). For now, we could leave things agnoistic from below the data-type level?
In future either provide options for additional structure if it is very widely used, or alternatively support for creating / searching with custom strings. Related, is everyone happy with a single folder for histology at the subject level? I think this makes sense |
I agree. Maybe some docs on good practice, and we could even print out a recommended filename string.
I think for now this is the best idea.
Yep, as the tool is adopted, we could provide support for a limited set of acquisition setups. Bonsai etc.
Agree. In future we could potentially provide support for |
I agree with Adam. My personal preference would be including sub/ses in the filename, just how BIDS does it, but it will be a headache to enforce for everything. Even BIDS started by supporting a few acquisition types (e.g. T1w, BOLD EPI) before expanding to others. Context on requirements vs recommendationsWhen BIDS validates datasets (see bids-validator), it differentiates between For now I propose the following:
ConclusionThe standard itself should be versioned, improved upon, and expanded by trial and error. Let's start small with minimum requirement and see where we go from there |
Additionally, we can offer some (non-enforced) guidelines on how to store metadata. E.g. use .csv or .tsv for tables/dataframes and .json file for key-value pairs. If a specific metadata file, or table pertains to a specific subject/session/acquisition, it's name should reflect that. |
Also @JoeZiminski , nitpicking some things I noticed in your example directory trees above:
|
That's really nice, how do you think it is best to manage the documentation for the standard vs. datashuttle implementation? Shall we have a single (versioned) help page which introduces BIDS , makes reccomendations? and use the current ephys BEP for the formal standard? cheers for those points I will open / amend isues |
You are right in the sense that The standard itself (let's tentatively call its BIDS-SWC) is probably best hosted as a separate repo, which will be solely documentation, similar to bids-specification. In the future we might also implement a tool like bids-validator to check whether a dataset is BIDS-SWC compliant. We should of course monitor (and contribute to) BEP029 and BEP032, and strive to converge with them over time. That said, the in-house needs already go beyond these BEPs. Those were my first thoughts on this, so fully open to counter-points. |
I think thats a good approach, completely agree |
In terms of docs, we could have multiple repos containing docs, or directories containing docs. These could all be rendered with Sphinx, and hosted using github pages. Something like :
and other tools e.g. github.com/neuroinformatics-unit/behaviour-pipeline/docs -> neuroinformatics-unit.github.io/behaviour-pipeline As adoption increases, the repos and docs could be moved to either SWC, or their own organisation. |
There has been some discussion here on the best file naming convention. Should the sub/ses info be at the file level?
e.g.
My preference is for the longer version, even though it is ugly it is completely unamgibious and protects against some possible worst-case-scenario bugs (that should absolutely never happen, but still e.g. copying a session data to the wrong session / subject). I think this is also neuroimaging BIDS preferred but might be wrong.
The problem is filenames will be created by users, so will be harder to enfore. We could at least provide some functionality to copy to clipboard the correct prefix based on the cwd or something.
The text was updated successfully, but these errors were encountered: