Addition of the Loss derived type and of the MSE loss function #175

jvdp1 · 2024-04-16T19:05:12Z

As discussed with @milancurcic in #173 :

addition of a derived type for loss functions
addition of the Mean Square Error loss function

TODO:

Addition of docs
Addition of tests

jvdp1 · 2024-04-16T19:25:04Z

@milancurcic what is the strategy for the tests?

jvdp1 · 2024-04-17T08:22:21Z

src/nf/nf_loss_submodule.f90

+  pure module function mse_derivative(true, predicted) result(res)
+    real, intent(in) :: true(:)
+    real, intent(in) :: predicted(:)
+    real :: res(size(true))
+    res = 2 * (predicted - true) / size(true)
+  end function mse_derivative


This function should be checked if it is valid.

milancurcic · 2024-04-18T13:53:06Z

Thanks, @jvdp1, I'll start a test program.

milancurcic · 2024-04-18T14:34:12Z

@jvdp1 I put a few very minimal tests that check the expected values given simple inputs. Feel free to add if you can think of better tests. I've been also thinking about how we can test for the integration of these loss functions with the network; perhaps also using simple inputs and known outputs, but pass them through the network type.

jvdp1 · 2024-04-19T09:44:36Z

Feel free to add if you can think of better tests.

Thank you. These tests LGTM.

perhaps also using simple inputs and known outputs, but pass them through the network type.
It could be a possibility. But I guess this will be more to test their support in the implementation than the functions themself. If so, would such tests be more appropriate in e.g., test_dense_network.f90?

milancurcic · 2024-04-19T14:09:22Z

On second thought, let's wait on testing the integration with the network (regardless of where those tests would be defined). As we implemented general mechanisms to specify and use losses and optimizers, it's become apparent to me the important to separate model creation (i.e. via the network_from_layers constructor) from the "compilation", as it's done in more mature Python frameworks (e.g. in Keras you create the model first by specifying the architecture, and then in a separate step "compile" the model by passing it the loss function, the optimizer, and the eval metrics to use; this allows for example reusing the same network instance with different optimizers/losses, etc.).

I'll merge this and open a separate issue. Thank you for the PR!

Vandenplas, Jeremie added 4 commits April 16, 2024 20:26

Addition of the abstract DT loss_type and of the DT quadratic

6d867e2

Support of the loss_type for the derivative loss function

6efeea0

Addition of the MSE loss function

572c331

add documentation

646a564

jvdp1 marked this pull request as ready for review April 16, 2024 19:17

jvdp1 commented Apr 17, 2024

View reviewed changes

milancurcic self-requested a review April 18, 2024 13:52

milancurcic added 3 commits April 18, 2024 10:02

Test program placeholder

bccb0bb

Add loss test to CMake config

6033ec1

Minimal test for expected values

634ce92

milancurcic approved these changes Apr 18, 2024

View reviewed changes

Bump version and copyright years

6bed12b

milancurcic merged commit f7b6006 into modern-fortran:main Apr 19, 2024
2 checks passed

jvdp1 deleted the loss_dt branch April 19, 2024 16:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Addition of the Loss derived type and of the MSE loss function #175

Addition of the Loss derived type and of the MSE loss function #175

jvdp1 commented Apr 16, 2024 •

edited

Loading

jvdp1 commented Apr 16, 2024

jvdp1 Apr 17, 2024

milancurcic commented Apr 18, 2024

milancurcic commented Apr 18, 2024

jvdp1 commented Apr 19, 2024

milancurcic commented Apr 19, 2024

Addition of the Loss derived type and of the MSE loss function #175

Addition of the Loss derived type and of the MSE loss function #175

Conversation

jvdp1 commented Apr 16, 2024 • edited Loading

jvdp1 commented Apr 16, 2024

jvdp1 Apr 17, 2024

Choose a reason for hiding this comment

milancurcic commented Apr 18, 2024

milancurcic commented Apr 18, 2024

jvdp1 commented Apr 19, 2024

milancurcic commented Apr 19, 2024

jvdp1 commented Apr 16, 2024 •

edited

Loading