bridge-ds is a lightweight Python framework designed to provide a unified interface to deep learning datasets from different modalities: Perform global operations, aggregations and queries with a Pandas-like experience, and handle individual samples and raw data using a class-based, tab-completion-ey interface.

Key Features

Browse

Browse through your datasets with ease using an intuitive interface.

Work with tables

View your data as tables.

Plot your data

Visualize your data quickly and effectively with the exposed Pandas Plotting API.

Assign, sort and filter

Perform common data operations like assigning new columns, sorting, and filtering with Pandas-like syntax.

Augment

Apply and visualize data augmentations directly within your workflow.

Installation

You can install the latest version of Bridge's from PyPI. It comes in a few flavors:

Core: The core package includes the basic functionality of Bridge.

$ pip install bridge-ds

Vision: The vision package includes the core package and additional (opinionated) functionality for working with image datasets.

$ pip install bridge-ds[vision]

NOTE: to run the demo notebooks locally, you'll need the vision package.

Documentation

To learn more about bridge-ds, please visit the official documentation.

Development

Setup

$ git clone https://github.com/guybuk/bridge-ds.git
$ cd bridge-ds
$ pip install -e ".[dev]"

# Testing
$ pytest tests/core

# Building the docs
$ sudo apt install pandoc
$ cd docs
$ make html

Roadmap

bridge-ds is under active development, currently in a pre-alpha stage.

The following is a rough roadmap of the planned features:

Video Support
- DataIO for video
- DisplayEngine (video player)
- DatasetProviders (for popular video datasets)
- Transforms (clipping, sampling, augmentation)
Text
- DatasetProviders
- DisplayEngine (adapt existing engine to work with classic text tasks: translation, Q&A, etc.)
Core
- DualDatasets (for tasks with two main elements e.g. image-image, image-text,text-text)
- Stress testing (currently have no capacity to test huge datasets)

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github/workflows		.github/workflows
bridge		bridge
docs		docs
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yaml		.readthedocs.yaml
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contents

Key Features

Installation

Documentation

Development

Setup

Roadmap

About

Languages

License

guybuk/bridge-ds

Folders and files

Latest commit

History

Repository files navigation

Contents

Key Features

Installation

Documentation

Development

Setup

Roadmap

About

Topics

Resources

License

Stars

Watchers

Forks

Languages