Data Histories

This repository attempts to reconstruct various bioinformatics data histories using crowdsourced data sets, to create new version controlled data sets that can be examined over time.

Why?

When working with older published data sets, the lack of historical data access makes it extremely difficult to reproduce the analysis, verify data, or update results to modern annotations. Because of this, it is nearly impossible to estimate how rigorous or reproducble a dataset is!

Tools

This repository contains folders with the scripts, code, and other descriptive README documents that are used to populate the repositories at http://github.com/joiningdatasets/

License

The scripts and other files in this repository are made available under the CC0 / Public Domain license, similar to many of the data sets we use that have been created by the federal government.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
gene_info		gene_info
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Histories

Why?

Tools

License

About

Releases

Packages

Languages

License

joiningdata/data_histories

Folders and files

Latest commit

History

Repository files navigation

Data Histories

Why?

Tools

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages