Jammy flow integration #728

RasmusOrsoe · 2024-05-29T08:05:01Z

This PR adds support for Normalizing Flows via the jammy_flows package, and therefore supersedes #649. The benefit of using jammy_flows is that it contains many different normalizing flows, and that we avoid maintaining that code ourselves :-).

The package is not listed as a direct dependency but used as an optional support package.

Specifically, this PR introduces the following major changes:

StandardFlowTask now uses jammy_flows to construct pdfs of any kind that it supports. These pdfs can be both conditional and non-conditional. Conditional flows can be conditioned on latent model output, event-level information or pulse-level information.
A new model class is added: NormalizingFlow which work with the StandardFlowTask. Usage is similar to StandardModel.
An example of training a conditional flow is added under examples/04_training/07_train_normalizing_flow.py
has_jammy_flows_package() is added under graphnet.utils.imports to check if its installed, and is used in a few places to make sure that the code runs also for people who choose not to install jammy_flows.

Minor changes:

repeat_labels is added as an argument to GraphDefinition - if True, event-level information, .e.g energy is repeated row-wise to match the number of pulses in the event. This feature was added in this PR to make it possible to build flows that learn pulse-level pdfs conditioned on event-level information.
Installation matrix is updated to provide a note on the installation of jammy flows
Github workflows is adjusted to run with jammy_flows installed.
**kwargs for Trainer is added for predict-methods in EasySyntax to allow the same level of control over Trainer arguments as we have for .fit

update main

RasmusOrsoe · 2024-08-06T09:26:38Z

@Aske-Rosted checks are now passing. Please let me know if you have any questions

Aske-Rosted

LGTM - A few questions here and there that do not necessarily require fixing.

Aske-Rosted · 2024-08-30T01:40:22Z

src/graphnet/models/graphs/graph_definition.py

-                sensor listed here will be removed from the graph. Defaults to None.
-            string_mask: A list of string id's to be masked from the graph. Defaults to None.
+                sensor listed here will be removed from the graph.
+                    Defaults to None.


Did some unintended formatting happen here?

Aske-Rosted · 2024-08-30T01:53:56Z

src/graphnet/models/task/task.py

+        return self._default_prediction_labels
+
+    def nb_inputs(self) -> Union[int, None]:  # type: ignore
+        """Return number of conditional inputs assumed by task."""


does non-conditional inputs also exist or what is the reason for the distinction of "conditional"?

The flow can be trained both as a conditional flow, where it's conditioned on a latent representation of an event e.g. p(target|latent_rep) which is the probability of seeing target for event represented by latent_rep, and as an unconditional probability distribution e.g. p(target) which would just learn the overall distribution of the variable in the dataset.

Aske-Rosted · 2024-08-30T01:54:39Z

src/graphnet/models/easy_model.py

Are these changes unrelated to jammy flows?

reread the minor changes and these changes are described there, you can disregard this question.

Aske-Rosted · 2024-08-30T01:54:58Z

src/graphnet/models/graphs/graphs.py

Are these changes also unrelated to Jammy flows?

reread the minor changes and these changes are described there, you can disregard this question.

jvaracarbonell · 2024-09-10T14:40:51Z

src/graphnet/models/task/task.py

+        labels = labels.to(self.dtype)
+        # Set the initial parameters of flow close to truth
+        # This speeds up training and helps with NaN
+        if self._initialized is False:


Just a quick note: I noticed that this part could override the learned weights whenever they are loaded from a state dict, which might affect further training or inference. It could be worth adding a global boolean to check if the weights have been loaded. Otherwise, it might be good at least to pass and self.training , which should work when the model is set to evaluation mode for predictions.

@jvaracarbonell thanks for pointing this out! I've added the check

…oe/graphnet into jammy_flow_integration

RasmusOrsoe and others added 28 commits May 20, 2024 16:37

revert changes on main

6a06d65

Merge branch 'graphnet-team:main' into main

8f7ea52

add NormalizingFlow

c1b4099

check

1e14f54

hooks

0c135b7

hooks

ce1223d

hooks

8886376

black

9d8b560

black

dbb02c4

black

9b90af4

black

382651f

polish dtype assignment

e299aac

add warning

9c0ad64

add check for flow package

f53bc1d

expand docstrings

a0afcc3

update workflow to install jammy_flows

845293d

add example

a71765c

check in example

eb15932

update example

4150f14

update example

8116c29

actions

d51f02c

update icetray action

210ef28

update install action

c32ffd1

fix has_jammy_flows_package

59870dd

polish

b953ff4

add doc string

5ab298a

update docstring

3bc33a3

update installation instruction

b74d9b2

RasmusOrsoe requested review from ArturoLlorente and Aske-Rosted May 30, 2024 13:56

RasmusOrsoe and others added 6 commits July 13, 2024 10:09

add normalization

49576cb

Merge branch 'jammy_flow_integration' into main

e3188e3

Merge pull request #29 from RasmusOrsoe/main

7f472cf

update main

increase batch size to avoid single event batch

53eefb4

revert change

dfee76d

increase batch size

dd41659

RasmusOrsoe mentioned this pull request Aug 7, 2024

NormalizingFlow #649

Closed

Aske-Rosted approved these changes Aug 30, 2024

View reviewed changes

jvaracarbonell reviewed Sep 10, 2024

View reviewed changes

RasmusOrsoe added 2 commits September 16, 2024 15:37

Merge branch 'jammy_flow_integration' of https://github.com/RasmusOrs…

ecd1627

…oe/graphnet into jammy_flow_integration

only initialize if training

9aa6936

RasmusOrsoe merged commit 4d9ca09 into graphnet-team:main Sep 18, 2024
13 of 14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jammy flow integration #728

Jammy flow integration #728

RasmusOrsoe commented May 29, 2024 •

edited

Loading

RasmusOrsoe commented Aug 6, 2024

Aske-Rosted left a comment

Aske-Rosted Aug 30, 2024

Aske-Rosted Aug 30, 2024

RasmusOrsoe Sep 16, 2024

Aske-Rosted Aug 30, 2024

Aske-Rosted Aug 30, 2024 •

edited

Loading

Aske-Rosted Aug 30, 2024

Aske-Rosted Aug 30, 2024 •

edited

Loading

jvaracarbonell Sep 10, 2024 •

edited

Loading

RasmusOrsoe Sep 16, 2024

Jammy flow integration #728

Jammy flow integration #728

Conversation

RasmusOrsoe commented May 29, 2024 • edited Loading

RasmusOrsoe commented Aug 6, 2024

Aske-Rosted left a comment

Choose a reason for hiding this comment

Aske-Rosted Aug 30, 2024

Choose a reason for hiding this comment

Aske-Rosted Aug 30, 2024

Choose a reason for hiding this comment

RasmusOrsoe Sep 16, 2024

Choose a reason for hiding this comment

Aske-Rosted Aug 30, 2024

Choose a reason for hiding this comment

Aske-Rosted Aug 30, 2024 • edited Loading

Choose a reason for hiding this comment

Aske-Rosted Aug 30, 2024

Choose a reason for hiding this comment

Aske-Rosted Aug 30, 2024 • edited Loading

Choose a reason for hiding this comment

jvaracarbonell Sep 10, 2024 • edited Loading

Choose a reason for hiding this comment

RasmusOrsoe Sep 16, 2024

Choose a reason for hiding this comment

RasmusOrsoe commented May 29, 2024 •

edited

Loading

Aske-Rosted Aug 30, 2024 •

edited

Loading

Aske-Rosted Aug 30, 2024 •

edited

Loading

jvaracarbonell Sep 10, 2024 •

edited

Loading