Skip to content

Latest commit

 

History

History
20 lines (17 loc) · 887 Bytes

README.md

File metadata and controls

20 lines (17 loc) · 887 Bytes

Data Histories

This repository attempts to reconstruct various bioinformatics data histories using crowdsourced data sets, to create new version controlled data sets that can be examined over time.

Why?

When working with older published data sets, the lack of historical data access makes it extremely difficult to reproduce the analysis, verify data, or update results to modern annotations. Because of this, it is nearly impossible to estimate how rigorous or reproducble a dataset is!

Tools

This repository contains folders with the scripts, code, and other descriptive README documents that are used to populate the repositories at http://github.com/joiningdatasets/

License

The scripts and other files in this repository are made available under the CC0 / Public Domain license, similar to many of the data sets we use that have been created by the federal government.