Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

dataset-metadata #15

Open
wants to merge 4 commits into
base: master
Choose a base branch
from
Open

dataset-metadata #15

wants to merge 4 commits into from

Conversation

satra
Copy link
Contributor

@satra satra commented Feb 14, 2020

This is simply a starting point for discussion

This is simply a starting point for discussion
@tgbugs
Copy link
Contributor

tgbugs commented Feb 14, 2020

identifier: REQUIRED
repository: REQUIRED
url: REQUIRED
publications:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The generalized version of this is related identifiers of which publications are a subset, see relation types also in table 4 from the previous comment.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this is going to be a constant tussle, and we should decide whether we want a specific node, or the more general node.

email: Recommended # filled from orcid
name: Recommended # filled from orcid
affiliations: optional # filled from orcid
sponsors:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A grant number, or funding field seems to be missing here?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Definitely need a field for Award number, but it would be optional - example case is at Allen Institute we also acknowledge Allen Institute for Brain Science for support, but have no specific award ids

name: REQUIRED
url: RECOMMENDED
license:
- url # REQUIRED
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Potential controlled source? https://spdx.org/licenses/

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ooh, nice source of license controlled vocabulary.

  1. Should we log an identifer for the license here, AND the url?
  2. Custom licenses: Right now in BICCN the policy is the data should all be CC-BY-4.0 if generated by BICCN. HOWEVER, its possible at some point that we ingest some data that has a different license attached for some reason. If that license is a custom license and not included in that list, what do we use this url for?

description: REQUIRED
contributors: # required for author and contact
- orcid: # REQUIRED
roles: # REQUIRED from https://casrai.org/credit/ + maintainer, contact
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the link to casrai.org/credit - looks usefule! Would you just want to use the role alone, or include optional additional text, much as is provided in an acknowledgement in a paper?

@@ -0,0 +1,85 @@
identifier: REQUIRED ## Post upload (or during dandi organize)
name: REQUIRED
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Prefer to call this "title" to encourage people to use a formal title, but doesn't matter so long as people provide appropriate info.

identifier: REQUIRED ## Post upload (or during dandi organize)
name: REQUIRED
description: REQUIRED
contributors: # required for author and contact
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add "recommended for other major contributors"

name: REQUIRED
description: REQUIRED
contributors: # required for author and contact
- orcid: # REQUIRED
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One comment - PIs likely to have an ORCID, but the other contributors may not have them such as the lead RA who generated the data. What if that lead RA has left the lab and can't be convinced to create an ORCID?
ORCIDs seem like they would be good, but I can see cases where we won't have them, because they have to be created by the individual - unless someone knows of a way around that?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

for the moment we are going with orcid, but will allow people to add name/email/affiliation.

BICCN-policies/dataset-metadata.yaml Outdated Show resolved Hide resolved
releaseDate: REQUIRED
associatedData:
- name: REQUIRED
identifier: REQUIRED
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

some data may have both an identifier and a DOI

publications:
- url: REQUIRED # doi preferred
identifiers: RECOMMENDED # PMCID
relation: RECOMMENDED
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is there a controlled vocab for this?

## Post publication (i.e. added by DANDI)
version: REQUIRED
releaseDate: REQUIRED
associatedData:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Associated data is set up here to refer to the raw data. The associated data has a repository and url.
During the collection of dataset metadata from BICCN labs, several labs provided me with an alternate data access URL like brainome.org. Where should we model this alternate/related data access URL?

relation: RECOMMENDED
doi: REQUIRED
url: REQUIRED
repository: REQUIRED
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why is there a second repository listed? Did I misinterpret the first "Associated Data" section?

url: RECOMMENDED
license:
- url # REQUIRED
keywords: [key1, key2,]
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Related to keywords - we would want to capture the modality and relevant technique(s) for BICCN purposes.

satra added 2 commits March 3, 2020 15:15
@almightyyakob just a ping to the subfield.
@satra satra mentioned this pull request Mar 20, 2020
@satra
Copy link
Contributor Author

satra commented Mar 20, 2020

i'll take another pass through this soon and align as much as possible with the DATS object.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants