Skip to content

Commit

Permalink
Merge pull request #13 from scientificcomputing/paper-with-code
Browse files Browse the repository at this point in the history
Add slides for paper with code
  • Loading branch information
finsberg authored Nov 20, 2023
2 parents f27a983 + aae8c57 commit 5fc41aa
Show file tree
Hide file tree
Showing 5 changed files with 104 additions and 20 deletions.
3 changes: 2 additions & 1 deletion .github/workflows/build_docs.yml
Original file line number Diff line number Diff line change
Expand Up @@ -47,11 +47,12 @@ jobs:

- name: Build slides
run: |
npx @marp-team/marp-cli docs/intro.md --html -o ${{ env.PUBLISH_DIR }}/intro-slides.html
npx @marp-team/marp-cli docs/repo.md --html -o ${{ env.PUBLISH_DIR }}/repo-slides.html
npx @marp-team/marp-cli docs/environments.md --html -o ${{ env.PUBLISH_DIR }}/environments-slides.html
npx @marp-team/marp-cli docs/documentation.md --html -o ${{ env.PUBLISH_DIR }}/documentation-slides.html
npx @marp-team/marp-cli docs/testing.md --html -o ${{ env.PUBLISH_DIR }}/testing-slides.html
npx @marp-team/marp-cli docs/distribution.md --html -o ${{ env.PUBLISH_DIR }}/paper-slides.html
npx @marp-team/marp-cli docs/paper.md --html -o ${{ env.PUBLISH_DIR }}/paper-slides.html
npx @marp-team/marp-cli docs/distribution.md --html -o ${{ env.PUBLISH_DIR }}/distribution-slides.html
Expand Down
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,12 +5,12 @@ This repository contains material for the seminar on *Elevating Scientific Compu
The seminar is organized in a very hands-on and interactive way where we will first present some theory and then do a live demo before we ask you to solve a related exercise.
The agenda is as follows

- Intro [Slides](https://scientificcomputing.github.io/seminar-23-11-2023/intro-slides.html) [Book](https://scientificcomputing.github.io/seminar-23-11-2023/docs/intro.html)
- Code and Data repositories [Slides](https://scientificcomputing.github.io/seminar-23-11-2023/repo-slides.html) [Book](https://scientificcomputing.github.io/seminar-23-11-2023/docs/repo.html)
- Reproducible environments [Slides](https://scientificcomputing.github.io/seminar-23-11-2023/environments-slides.html) [Book](https://scientificcomputing.github.io/seminar-23-11-2023/docs/environments.html)
- Linters, formatters and continuous integration [Slides](https://scientificcomputing.github.io/seminar-23-11-2023/testing-slides.html) [Book](https://scientificcomputing.github.io/seminar-23-11-2023/docs/testing.html)
- Documentation [Slides](https://scientificcomputing.github.io/seminar-23-11-2023/documentation-slides.html) [Book](https://scientificcomputing.github.io/seminar-23-11-2023/docs/documentation.html)
- Example paper with code [Slides](https://scientificcomputing.github.io/seminar-23-11-2023/paper-slides.html) [Book](https://scientificcomputing.github.io/seminar-23-11-2023/docs/paper.html)
- Packaging and Distribution (if time) [Slides](https://scientificcomputing.github.io/seminar-23-11-2023/distribution-slides.html) [Book](https://scientificcomputing.github.io/seminar-23-11-2023/docs/distribution.html)


## Other useful resources
Expand Down
1 change: 1 addition & 0 deletions _toc.yml
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,7 @@ format: jb-article
root: README

sections:
- file: docs/intro
- file: docs/repo
- file: docs/environments
- file: docs/testing
Expand Down
1 change: 1 addition & 0 deletions intro.md → docs/intro.md
Original file line number Diff line number Diff line change
Expand Up @@ -27,6 +27,7 @@ Henrik Finsberg, Jørgen Dokken and Min Ragan-Kelley
* We write code to create figures and tables (**post-processing**)

* Whenever you publish a paper, you need to have recipe for how to reproduce the results in the paper
- Many journals require that you make your code available

---

Expand Down
117 changes: 99 additions & 18 deletions docs/paper.md
Original file line number Diff line number Diff line change
Expand Up @@ -21,37 +21,118 @@ Henrik Finsberg

---

## Agenda

- How to set up a repo to reproduce results in a paper
- Starting a new repository
- Set up reproducible environment
- Create tests to make sure results remain valid
- Host results online via GitHub pages
- Make sure to specify how to cite
- Licensing
## To produce a paper we write code

* We write code to **pre-process** data
* We write code to **run simulations**
* We write code to create figures and tables (**post-processing**)

---

## You want to publish code along with your paper

The first time you can use

- cookiecutter
- Will prompt you with some questions
- https://github.com/scientificcomputing/generate-paper
- template
- Will copy the template repo
- https://github.com/scientificcomputing/example-paper

* The second time, it is OK to copy files from an old project

---

## Write a README file

The README file should contain info about

- Short description of the code / paper
- Which paper and how to cite
- How to install the dependencies
- How to reproduce the results / run the code

---


## Setting up a reproducible environment

Write a `pyproject.toml` with the dependencies you need

* Compile a `requirements.txt` with the exact versions you used when creating the results with `pip-compile` (from [`pip-tools`](https://github.com/jazzband/pip-tools))
```
pip-compile --output-file=requirements.txt pyproject.toml
```
* Compile extra dependencies
```
pip-compile --extra=docs --output-file=requirements-docs.txt pyproject.toml
```
---
## Starting a new project
## Publish docker image with exact dependencies
- Use cookiecutter
- Use template
- Copy files from existing project
* Write a Dockerfile which clones the repo and installs the dependencies
* Build and push the image to some public registry
- For example, you can set up a GitHub action to push a new image when you create a new release
---
## Demo
Example paper with pure python dependencies:
## Add the code for reproducing the results
Several ways to do this:
- Add scripts for reproducing figures and tables
- Add asserts that will raise an error if results have changed
- Example: https://github.com/scientificcomputing/example-paper or https://github.com/scientificcomputing/example-paper-fenics
- Add notebooks and execute them as part of building docs
- Also here you can add asserts
- Example: https://github.com/RangamaniLabUCSD/smart
---
https://github.com/scientificcomputing/example-paper
## Tips and tricks
- Make it possible to pass command line arguments to the scripts so that you can e.g change the path to the results or data
- This will also make it easier if you e.g need to run the scripts on a cluster where you need to get the data from a different path
- Set up CI to run the scripts
- Upload the artifacts after the run
---
## Demo
## Handling data
* Large datasets should not be stored in git
* Data can be stored locally, dropbox or google drive during development
* Ideally you should share the data on Zenodo (https://zenodo.org) when publishing the paper. This will make sure you get a DOI for the data.
- It is also possible to upload data with restricted data on Zenodo
* Create a script for downloading data
## Make sure to create a tag / release
A tag is a specific snapshot of your repository, and by creating a tag it makes it easy to check out that version of the code.
Example paper with dependencies in docker image
You should create a tag (and a release) of the code
- when you submit the paper
- when the paper is published (if there are any changes from submission)
- if there are bug fixes
https://github.com/scientificcomputing/example-paper-fenics
* Remember to write a changelog if you make a new release with info of what has changed since the previous version.
---
## License and Citation
Make sure that people can use your code and provide proper attribution
- License
- Without a license, other cannot use your code without asking you first
- Citation information
- Write a `CITATION.cff`
---

0 comments on commit 5fc41aa

Please sign in to comment.