OHBM Hackathon 2021 TrainTrack Session - Reproducible Workflows

Welcome to the code and content repository of the hands-on session "I’d like to reproduce your results…" and other tales in Reproducible Workflows, part of the TrainTrack of the OHBM BrainHack 2021

Presenters:

Stephan Heunis and Şeyma Bayrak

Abstract

Almost all researchers have data and analysis scripts that generate results in the form of figures. Yet, few other researchers can use these exact data and scripts to generate the same figures, or to reproduce all results of the study. In this session, we’ll take you on a journey of building reproducible workflows that help alleviate the anxiety associated with receiving that dreaded email 'I’d like to reproduce your results...'

We’ll start with helping others run your code on their machines, and end up with a fully reproducible workflow running in the cloud, with several pit stops in between.

Scenario

As a researcher working in neuroimage analysis, you (the person following this hands-on session) have recently published a paper using cortical thickness data from MICA-MNI.

Your paper described an analysis pipeline to compare the thickness in various brain regions for a group of 259 participants, and your results section contains several figures including a visualization of statistical test values.

You receive an email from a colleague asking if you can send them the necessary code, data and instructions to reproduce these results.

Goals

By the end of this session, you should be able to do the following STEPS:

Set up a requirements.txt file that specifies package requirements
Specify and set up a virtual environment to install requirements
Share code, installation, and running instructions via GitHub
Transform your code into a Jupyter notebook
Set up your code repository to run in the cloud with Binder
Understand how containers can play a role in this context
Understand the benefits of data management with DataLad

Slides

The sessions follows these slides step by step.

Computational environment

Some parts of the session will be run in a Binder-based computational environment in the cloud.

One environment demonstrates the use of requirements.txt as the configuration file for Binder. Access it here:

Another environment demonstrates the use of environment.yml as the configuration file for Binder. Access it here:

The latter environment builds from the conda-env branch.

Local installation

If you'd like to install the full repository (code, data, confirguration files, presentation slides, etc) on your machine, please follow these instructions:

1. Install required system level apps

These are listed in apt.txt

2. Clone the repository

git clone https://github.com/ohbm/handson-2021-reproducible-workflows.git

3. Create a virtual environment

With virtualenv:

pip install virtualenv
virtualenv --python=python3.6 mypythonenv
source mypythonenv/bin/activate

With conda:

# First install miniconda: https://docs.conda.io/en/latest/miniconda.html
conda create -n mypythonenv python=3.6
conda activate mypythonenv

4. Install dependencies

With pip and requirements.txt (main branch):

pip install -r requirements.txt

With conda and environment.yml (conda-env branch)

conda env create -f environment.yml

Additional dependencies can only be installed as follows:

git clone https://github.com/MICA-MNI/BrainStat.git
cd BrainStat
python3 setup.py build
python3 setup.py install --user

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
code		code
data		data
presentation		presentation
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
apt.txt		apt.txt
awesomepublication.pdf		awesomepublication.pdf
figure.png		figure.png
postBuild		postBuild
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OHBM Hackathon 2021 TrainTrack Session - Reproducible Workflows

Presenters:

Abstract

Scenario

Goals

Slides

Computational environment

Local installation

1. Install required system level apps

2. Clone the repository

3. Create a virtual environment

4. Install dependencies

About

Releases

Packages

Contributors 2

Languages

License

ohbm/handson-2021-reproducible-workflows

Folders and files

Latest commit

History

Repository files navigation

OHBM Hackathon 2021 TrainTrack Session - Reproducible Workflows

Presenters:

Abstract

Scenario

Goals

Slides

Computational environment

Local installation

1. Install required system level apps

2. Clone the repository

3. Create a virtual environment

4. Install dependencies

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages