Normalize Data on GPU #25

sadamov · 2024-05-03T13:30:23Z

Motivation
Data normalization can be done on the fly on GPU for each batch. It's faster on GPU than CPU and cleans up the dataset init method.

Implementation
Could very nicely use https://lightning.ai/docs/pytorch/stable/common/lightning_module.html#on-after-batch-transfer to normalize once data is on GPU. Makes sure that you never forget about it (all batches on GPU are normalized).

The stats could be provided by a yaml_object handler that can be accessed on the model's init

leifdenby · 2024-05-22T10:31:23Z

sounds cool @sadamov, are you thinking this for v0.3.0 or a later release? :)

sadamov · 2024-05-25T17:08:42Z

This feature is ready in #39 I don't have a strong opinion about the version it should be published in. :)

sadamov self-assigned this May 13, 2024

sadamov linked a pull request May 25, 2024 that will close this issue

25 normalize data on gpu #39

Open

joeloskarsson added this to the v0.5.0 milestone Nov 20, 2024

This was referenced Dec 6, 2024

Standardize or rescale also static variables #95

Open

Implement standardization of static features #96

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Normalize Data on GPU #25

Normalize Data on GPU #25

sadamov commented May 3, 2024

leifdenby commented May 22, 2024

sadamov commented May 25, 2024

Normalize Data on GPU #25

Normalize Data on GPU #25

Comments

sadamov commented May 3, 2024

leifdenby commented May 22, 2024

sadamov commented May 25, 2024