Skip to content

lincc-frameworks/nested-dask

This branch is up to date with main.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

f5019f7 · Nov 6, 2024
Oct 14, 2024
Oct 9, 2024
Oct 21, 2024
Nov 6, 2024
Nov 6, 2024
Oct 9, 2024
Apr 18, 2024
Apr 18, 2024
Apr 18, 2024
Oct 9, 2024
Oct 9, 2024
Apr 18, 2024
Oct 9, 2024
Jul 1, 2024
Nov 6, 2024

Repository files navigation

nested-dask

Template

PyPI GitHub Workflow Status Codecov Read The Docs Benchmarks

A dask extension of nested-pandas.

Nested-pandas is a pandas extension package that empowers efficient analysis of nested associated datasets. This package wraps the majority of the nested-pandas API with Dask, which enables easy parallelization and capacity for work at scale.

Dev Guide - Getting Started

Before installing any dependencies or writing code, it's a great idea to create a virtual environment. LINCC-Frameworks engineers primarily use conda to manage virtual environments. If you have conda installed locally, you can run the following to create and activate a new environment.

>> conda create env -n <env_name> python=3.10
>> conda activate <env_name>

Once you have created a new environment, you can install this project for local development using the following commands:

>> pip install -e .'[dev]'
>> pre-commit install
>> conda install pandoc

Notes:

  1. The single quotes around '[dev]' may not be required for your operating system.
  2. pre-commit install will initialize pre-commit for this local repository, so that a set of tests will be run prior to completing a local commit. For more information, see the Python Project Template documentation on pre-commit
  3. Install pandoc allows you to verify that automatic rendering of Jupyter notebooks into documentation for ReadTheDocs works as expected. For more information, see the Python Project Template documentation on Sphinx and Python Notebooks