Skip to content
This repository has been archived by the owner on Sep 30, 2022. It is now read-only.

Reproducible data science with targets #70

Open
3 of 17 tasks
maurolepore opened this issue Jan 26, 2021 · 0 comments
Open
3 of 17 tasks

Reproducible data science with targets #70

maurolepore opened this issue Jan 26, 2021 · 0 comments
Labels

Comments

@maurolepore
Copy link
Contributor

maurolepore commented Jan 26, 2021

Who is the audience?

R users who need to run complex and slow data pipelines multiple times. This includes folks at 2DII and beyond.

Why is this important?

We are facing a reproducibility crisis. Few analyses are easy or even possible to reproduce. The challenge is particularly hard for analyses involving data pipelines that are complex and slow to run. When data pipelines are complex the order in which to run the code is difficult to understand; and when they are slow to run their quality suffers because they are too expensive to maintain.
The targets package tackles these problems. It provides a way to explain in code the steps required to reproduce the analysis, and it saves time because it skips costly runtime for tasks that are already up to date.

See also https://books.ropensci.org/targets/index.html#motivation

What should be covered?

This series will cover the content from target's manual that is most relevant to the work we do at 2DII.

https://github.com/2DegreesInvesting/ds-targets#syllabus

Suggested speakers or contributors

@wlandau, @maurolepore, @cjyetman?

Resources

https://github.com/2DegreesInvesting/ds-targets#resources

Checklist

2h before

  • Open meetup issue, paste this template and edit as necessary
  • Announce on #coding
  • Add meetup date, title and link to the invitation on 2dii's calendar

10' before

  • Start the meeting.

  • Announce on #coding:

    The ds-incubator starts in 10'; please join early so we start on time.
    The meeting room is already open at:
    <LINK WITH PASSWORD>

Start

  • 00': Welcome
  • 02': Announcements
  • 04': Nominate a timekeeper
  • 05': Record
  • 05': Remind syllabus and requirements
  • 06': Explain benefit for the intended audience
  • 20': Show the relevance for 2DII (give an example)
  • 24': Point to resources to learn more (share materials)
  • 25': Round of voices
  • 30': End
  • 30': Upload video
  • 35': Link video under resources
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

1 participant