This repository attempts to reconstruct various bioinformatics data histories using crowdsourced data sets, to create new version controlled data sets that can be examined over time.
When working with older published data sets, the lack of historical data access makes it extremely difficult to reproduce the analysis, verify data, or update results to modern annotations. Because of this, it is nearly impossible to estimate how rigorous or reproducble a dataset is!
This repository contains folders with the scripts, code, and other descriptive README documents that are used to populate the repositories at http://github.com/joiningdatasets/
The scripts and other files in this repository are made available under the CC0 / Public Domain license, similar to many of the data sets we use that have been created by the federal government.