ella: Embedded Low-Latency Datastore

ella is in extremely early development. There is very little documentation and many parts of it do not work. Expect that any part of the API may change in the future.

ella is a streaming time-series datastore designed for:

Low (<1 ms) end-to-end latency.
First-class multidimensional tensor support.
Hybrid embedded/client-server deployment.

ella is not:

An ACID database.
A replacement for Delta Lake, Snowflake, or any other cloud data service.

Usage

Use Cases

The goal of ella is to simplify the storage and analysis of data in systems that require low-latency data access.

A typical workflow for such a system might be something like:

Data is ingested from sensors.
The data is sent to streaming consumers using a low-latency API (e.g. gRPC, LSL).
The data is written to persistent storage such as a SQL database or an HDF5 file.
Batch consumers read data from persistent storage using a separate API.

---
title: Typical Workflow
---
flowchart LR
    pub1[Data Source]
    sub1[Streaming Consumer]
    sub2[Batch Consumer]
    api1[[Streaming API]]
    api2[[Batch API]]
    store[(Storage)]

    pub1 --> api1
    api1 --> sub1
    pub1 --> api2
    api2 --> sub2
    api2 <-.-> store

In contrast, ella provides a unified API for both streaming and batch processing:

---
title: Unified Workflow
---
flowchart LR
    pub1[Data Source]
    sub1[Streaming Consumer]
    sub2[Batch Consumer]

    subgraph ella[ella]
        direction TB
        table1[[Table]]
        store[(Storage)]
        api[[API]]
    end

    pub1 ---> api
    table1 <--> api
    table1 <-.-> store
    api ---> sub2
    api ---> sub1

For example, consider the following queries:

-- Get all rows from the table
SELECT time,x,y FROM sensor

-- Only return new rows published after this query is executed
SELECT time,x,y FROM sensor WHERE time > now()

-- Return existing rows, but ignore any additional rows published after the query is executed
SELECT time,x,y FROM sensor WHERE time < now()

Concepts

Tables

Data in ella is grouped into tables. Each table is either a topic or a view.

Topics: collect rows of data written by publishers. Topics are stored to disk by default, but temporary topics are not.
Views: return the result of specific queries. By default a view is re-computed each time it's scanned, but views can also be materialized to disk. Views are read-only.

Organization

ella follows the Catalog → Schema → Table organizational model.

The default catalog is "ella" and the default schema is "public".

Columns and Indices

Ella is a time-series datastore, and all topics have a timestamp as their first column (named "time" by default but can be renamed) and primary index.

Views are not required to have a time column.

Name		Name	Last commit message	Last commit date
Latest commit History 144 Commits
.cargo		.cargo
.github		.github
docker		docker
ella-cli		ella-cli
ella-common		ella-common
ella-derive		ella-derive
ella-engine		ella-engine
ella-server		ella-server
ella-tensor		ella-tensor
ella		ella
pyella		pyella
tracing		tracing
.dockerignore		.dockerignore
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE-APACHE		LICENSE-APACHE
LICENSE-MIT		LICENSE-MIT
README.md		README.md
cliff.toml		cliff.toml
deny.toml		deny.toml
justfile		justfile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

ella: Embedded Low-Latency Datastore

Usage

Rust

Python

CLI

Docker

Use Cases

Concepts

Tables

Organization

Columns and Indices

About

Licenses found

Releases 2

Packages

Languages

License

Licenses found

CerebusOSS/ella

Folders and files

Latest commit

History

Repository files navigation

ella: Embedded Low-Latency Datastore

Usage

Rust

Python

CLI

Docker

Use Cases

Concepts

Tables

Organization

Columns and Indices

About

Resources

License

Licenses found

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages