Skip to content

Commit

Permalink
Docs: static data (#7856)
Browse files Browse the repository at this point in the history
Title.

* Fixes #7845
  • Loading branch information
teh-cmc authored Oct 24, 2024
1 parent 91e981a commit 64a27ee
Show file tree
Hide file tree
Showing 15 changed files with 438 additions and 299 deletions.
97 changes: 97 additions & 0 deletions docs/content/concepts/static.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,97 @@
---
title: Static data
order: 450
---


The Rerun SDK allows you to store data as _static_. Static data belongs to all timelines (existing ones, and ones not yet created) and shadows any temporal data of the same type on the same entity.

That is, any time you log static data to an entity path, all past, present and future temporal data on that same entity path and component is _semantically_ discarded in favor of the static one (which doesn't necessarily mean that it is _physically_ discarded, more on that below).


## How to store static data?

Internally, all data in Rerun is stored as chunks of columns. Specifically, each chunk holds zero or more time columns (the indices), and zero or more component columns (the data).
Static data is data that lives in a chunk whose set of time columns is the empty set.

The easiest way to create such chunks is by using the `log` family of methods, which exposes a `static` flag where appropriate:

snippet: concepts/static/log_static

The same can be achieved using the `send_columns` API by simply leaving the time column set empty:

snippet: concepts/static/send_static

(Using `send_columns` that way is rarely useful in practice, but is just a logical continuation of the data model.)


## When should I use static data?

There are two broad categories of situations where you'd want to use static data: scene setting and memory savings.


### Scene setting

Often, you'll want to store data that isn't part of normal data capture, but sets the scene for how it should be shown.
For instance, if you are logging cars on a street, perhaps you want to always show a street mesh as part of the scenery, and for that it makes sense for that data to be static.

snippet: concepts/static/log_static

The alternative would be to log that data at the beginning of every relevant timeline, which can be very problematic as the set of timelines might not even be known before runtime.

Similarly, [coordinate systems](spaces-and-transforms.md) or [annotation context](annotation-context.md) are typically stored as static.


### Memory savings

When you store _temporal_ data in Rerun, it is always appended to the existing dataset: there is no such thing as overwriting temporal data. The dataset only grows, it never shrinks.
To compensate for that, the Rerun viewer has a [garbage collection mechanism](../howto/limit-ram) that will drop the oldest data from the store when memory becomes scarce.

For example, the following snippet stores 10 images at index `4` on the `frame` [timeline](timelines.md):

snippet: concepts/static/log_temporal_10x

All these images are actually stored, and all of them can be visualized in the viewer independently, even though they share the same index.

Contrary to temporal data, static data is **never** garbage collected… but it can actually be overwritten!
_Semantically_, only a single piece of static data can exist at a given time for a specific component on a specific entity.

In the following snippet, only the data from latest log call (in execution order) will be inspectable in the viewer:

snippet: concepts/static/log_static_10x

In practice, the Rerun datastore will rely on these semantics to physically drop the superfluous static data where possible, therefore drastically reducing memory costs. See ["Understanding storage costs"](#understanding-storage-costs) for more information.


## Understanding storage costs

In ["Memory savings"](#memory-savings), we mentioned that the following snippet _semantically_ stores a single image:

snippet: concepts/static/log_static_10x

How these semantics actually translate to physical storage depends on the context.


### In recordings

Rerun recordings (`.rrd` files) are just streams of binary messages: they have no semantics whatsoever, therefore they don't know what static means and can't do anything about it.

If you were to log the snippet above to a file (using e.g. `rr.save()`), you'd find that the recording does in fact contains your 10 images.

If you wanted the recording file itself to only contain a single static value, you would need to either:
* Stream the data to the viewer, and then save the recording directly out of the viewer using `Menu > Save recording` (or the equivalent palette command).
* Manually recompact your recording using the [Rerun CLI](../reference/cli#rerun-rrd-compact) so that the data overwrite semantics can get appropriately applied, e.g.: `rerun rrd compact -o compacted.rrd myrecording.rrd`.


### In the viewer

The data store that backs the Rerun viewer natively understands these temporal/garbage-collected vs. static/overwritten semantics.

If you were to log the snippet above directly to the Rerun viewer (using e.g. `rr.connect()`), you'd notice that the viewer's memory usage stays constant: the data is automatically being overwritten as new updates come in.
For data where you don't need to keep track of historical values, this effectively to logs its new values indefinitely.

In the following example, you can see our [face tracking example]() indefinitely tracking my face while maintaining constant memory usage by logging all data as static:

<video width="100%" autoplay loop muted controls>
<source src="https://static.rerun.io/tutorials/infinite_face_tracking.mp4" type="video/mp4" />
</video>
3 changes: 3 additions & 0 deletions docs/content/concepts/timelines.md
Original file line number Diff line number Diff line change
Expand Up @@ -49,7 +49,10 @@ An _event_ refer to an instance of logging one or more component batches to one

The [`rr.log()`](https://ref.rerun.io/docs/python/stable/common/logging_functions/#rerun.log) function has a `static=False` default argument.
If `static=True` is used instead, the data logged becomes *static*. Static data belongs to all timelines (existing ones, and ones not yet created) and shadows any temporal data of the same type on the same entity.

This is useful for data that isn't part of normal data capture, but sets the scene for how it should be shown.
For instance, if you are logging cars on a street, perhaps you want to always show a street mesh as part of the scenery, and for that it makes sense for that data to be static.

Similarly, [coordinate systems](spaces-and-transforms.md) or [annotation context](annotation-context.md) are typically static.

You can read more about static data in the [dedicated section](static.md).
1 change: 1 addition & 0 deletions docs/snippets/CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,7 @@ cmake_minimum_required(VERSION 3.16...3.27)
file(GLOB_RECURSE sources_list true ${CMAKE_CURRENT_SOURCE_DIR}/all/*.cpp)

# Not complete examples:
list(FILTER sources_list EXCLUDE REGEX .*/concepts/static/*)
list(FILTER sources_list EXCLUDE REGEX .*/tutorials/custom-recording-id.*)
list(FILTER sources_list EXCLUDE REGEX .*/tutorials/log_line.*)
list(FILTER sources_list EXCLUDE REGEX .*/tutorials/log-file.*)
Expand Down
1 change: 1 addition & 0 deletions docs/snippets/all/concepts/static/log_static.cpp
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
rec.log_static("skybox", generate_skybox_mesh());
1 change: 1 addition & 0 deletions docs/snippets/all/concepts/static/log_static.py
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
rr.log("skybox", static=True, generate_skybox_mesh())
1 change: 1 addition & 0 deletions docs/snippets/all/concepts/static/log_static.rs
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
rec.log_static("skybox", &generate_skybox_mesh())?;
2 changes: 2 additions & 0 deletions docs/snippets/all/concepts/static/log_static_10x.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
for _ in range(10):
rr.log("camera/image", static=True, camera.save_current_frame())
3 changes: 3 additions & 0 deletions docs/snippets/all/concepts/static/log_temporal_10x.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
rr.set_time_sequence("frame", 4)
for _ in range(10):
rr.log("camera/image", camera.save_current_frame())
1 change: 1 addition & 0 deletions docs/snippets/all/concepts/static/send_static.cpp
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
rec.send_columns("skybox", {}, generate_skybox_mesh());
1 change: 1 addition & 0 deletions docs/snippets/all/concepts/static/send_static.py
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
rr.send_columns("skybox", times=[], components=generate_skybox_mesh())
1 change: 1 addition & 0 deletions docs/snippets/all/concepts/static/send_static.rs
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
rec.send_columns("skybox", std::iter::empty(), generate_skybox_mesh())?;
20 changes: 20 additions & 0 deletions docs/snippets/snippets.toml
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,26 @@
# You should only ever use this if the test isn't implemented and cannot yet be implemented
# for one or more specific SDKs.
[opt_out.run]
"concepts/static/log_static" = [ # pseudo-code
"py",
"cpp",
"rust",
]
"concepts/static/log_temporal_10x" = [ # pseudo-code
"py",
"cpp",
"rust",
]
"concepts/static/send_static" = [ # pseudo-code
"py",
"cpp",
"rust",
]
"concepts/static/log_static_10x" = [ # pseudo-code
"py",
"cpp",
"rust",
]
"concepts/viscomp-base" = [
"cpp", # Blueprint API doesn't exist for C++/Rust
"rust", # Blueprint API doesn't exist for C++/Rust
Expand Down
Loading

0 comments on commit 64a27ee

Please sign in to comment.