You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
One suggestion: the proposal seems to address running code more in length than it addresses its relation to data. It may be useful to promote the idea of coupling of code and data more closely, e.g. via git-annex or git-lfs tools, that permit researchers to maintain versioning of both software and data in the same place, even though the data is located on some remote storage service due to its size.
For services like Zenodo, this would open an easy possibility to archive not only software, but also (reasonably sized) datasets at the time of the release, for example.
The text was updated successfully, but these errors were encountered:
@tiborsimko That's indeed an important issue, but difficult to deal with in our proposal, for two reasons: (1) Executability and linking with data are nearly orthogonal issues and (2) Depending on the size and nature of data, very different technical solutions are required.
What we could do is to mention the issue in some kind of outlook - something we'd look at in phase II.
@tiborsimko do you know if at CERN the EOS people have looked into having EOS as a git-lfs backend? (for non-CERNies, EOS is CERN's multi-PB storage solution)
Nice proposal! Many things in the pitch are exactly what we try to achieve within the context of the CERN Open Data service and the CERN Analysis Preservation pilot.
One suggestion: the proposal seems to address running code more in length than it addresses its relation to data. It may be useful to promote the idea of coupling of code and data more closely, e.g. via git-annex or git-lfs tools, that permit researchers to maintain versioning of both software and data in the same place, even though the data is located on some remote storage service due to its size.
For services like Zenodo, this would open an easy possibility to archive not only software, but also (reasonably sized) datasets at the time of the release, for example.
The text was updated successfully, but these errors were encountered: