A boilerplate for reproducible and transparent science with close resemblances to the philosophy of Cookiecutter Data Science: A logical, reasonably standardized, but flexible project structure for doing and sharing data science work. Includes integration with mybinder.
Install cookiecutter
command line: pip install cookiecutter
To start a new science project:
cookiecutter gh:leukgen/cookiecutter-reproducible-science
.
├── AUTHORS.md
├── LICENSE
├── README.md
├── bin <- Your compiled model code can be stored here (not tracked by git)
├── binder <- Files to setup mybinder integration (optional)
│ ├── apt.txt <- apt install debian packages
│ ├── install.R <- Install R packages
│ ├── postBuild <- Additional setup such as adding jupyterlab extensions
│ ├── requirements.txt <- Python requirements
│ └── runtime.txt <- Specifiy R version or change to python2 with `python-2.7`
├── config <- Configuration files, e.g., for doxygen or for your model if needed
├── data
│ ├── external <- Data from third party sources.
│ ├── interim <- Intermediate data that has been transformed.
│ ├── processed <- The final, canonical data sets for modeling.
│ └── raw <- The original, immutable data dump.
├── docs <- Documentation, e.g., doxygen or scientific papers (not tracked by git)
├── notebooks <- Ipython or R notebooks
├── reports <- For a manuscript source, e.g., LaTeX, Markdown, etc., or any project reports
│ └── figures <- Figures for the manuscript or reports
└── src <- Source code for this project
├── data <- scripts and programs to process data
├── external <- Any external source code, e.g., pull other git projects, or external libraries
├── models <- Source code for your own model
├── tools <- Any helper scripts go here
└── visualization <- Scripts for visualisation of your results, e.g., matplotlib, ggplot2 related.
This project is licensed under the terms of the BSD License