Skip to content

Simple tools for reviewing the quality of Jupyter notebooks.

License

Notifications You must be signed in to change notification settings

innovationOUtside/nb_quality_profile

Repository files navigation

nb_quality_profile

Simple tools for reviewing the quality of Jupyter notebooks.

Binder

At the moment, only a few of simple tools are provided::

  • a utility to list package imports into one or more notebooks;
  • a simple reading time estimator based on words-per-minute calculations against markdown cells (code cells ignored);
  • a visualisation of notebook structure (relative length, structure in terms of markdown vs code cell). Visualisations are of the form:

As well as reporting on .ipynb notebooks, reports can also be generated for other document formats that are convertable to the Jupyter notebook .ipynb using jupytext.

Installation

Install from pip:

pip install nb_quality_profile

Install from this repo:

pip install git+https://github.com/innovationOUtside/nb_quality_profile.git

Usage

Base:

Usage: nb_quality [OPTIONS] COMMAND [ARGS]...

Options:
  --help  Show this message and exit.

Commands:
  chart    Display notebook profile chart.
  imports  Display notebook imports.
  text-analysis  Report on text / markdown content.

Commands:

Usage: nb_quality chart [OPTIONS] PATH

  Display notebook profile chart from provided file or directory path.

Options:
  -o, --out TEXT                  Image outfile
  -g, --gap FLOAT                 Gap
  -G, --gapcolor TEXT             Gap colour
  -l, --linewidth INTEGER         Line width
  --text-formats / --no-text-formats
                                  Enable/disable Jupytext support.
  --help                          Show this message and exit.

So for example, to generate a chart of files in current directory:

nb_quality chart .

Check package imports:

Usage: nb_quality imports [OPTIONS] PATH

  Display notebook imports from provided file or directory path.

Options:
  --text-formats / --no-text-formats
                                  Enable/disable Jupytext support.
  --help                          Show this message and exit.

The imports report also generates a scatterplot (packages.png) showing package use across notebooks.

Check links (note: links included as "free text" are currently ignored):

Usage: nb_quality link-check [OPTIONS] PATH

  Check links.

Options:
  --all-links         Display all links.
  --grab-screenshots  Grab screenshots.
  --help              Show this message and exit.

To grab screenshots, playwright needs to be installed:

pip install playwright
playwright install

Check image alt text:

Usage: nb_quality alt-tags [OPTIONS] PATH

  Check image alt text.

Options:
  --help  Show this message and exit.

Check for errors and warnings (stderr messages in code cell outputs):

Usage: nb_quality check-warnings [OPTIONS] PATH

  Check code output cells for warnings.

Options:
  --help  Show this message and exit.

Reading time (based solely on markdown; code cells ignored):

Usage: nb_quality text-analysis [OPTIONS] PATH

  Report on text / markdown content.

Options:
  --text-formats / --no-text-formats
                                  Enable/disable Jupytext support.
  -r, --reading-rate INTEGER      Words per minute.
  -R, --rounded-minutes           Round up to minutes.
  --help                          Show this message and exit.

On a Mac, you may get a warning of the form:

2020-07-03 15:34:06.658 Python[46394:645167] ApplePersistenceIgnoreState: Existing state will not be touched. New state will be written to (null)

This seems to be a known matplotlib issue.

Use as an API

To generate simple visualisations of the relative size and structure (markdown vs code cells) of a single Jupyter notebook:

from  nb_quality_profile import nb_visualiser as nbv

nbv.nb_vis_parse_nb(PATH_TO_IPYNB_FILE)

See demo.ipynb for an example.

Related Blog Posts

The visualisation tool was originally described here: Fragment -Visualising Jupyter Notebook Structure

About

Simple tools for reviewing the quality of Jupyter notebooks.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published