MDS #205

msluszniak · 2023-11-03T18:15:36Z

Closes #163

Currently, because we call isotonic regression inside defn I changed an implementation of isotonic regression so it would be able to runing MDS. Ofc, commenting whole sections of code is temporal solution and I'm wondering what we should do with it.

josevalim · 2023-11-04T11:40:36Z

Do we need to preprocess? Because if we need to preprocess, then the best approach is likely to give the result of IsotonicRegression as an input to MDS.

msluszniak · 2023-11-04T13:26:49Z

The preprocessing is generally needed for points that have the same x coordinate, so in real cases there are probably not much of these cases. The problem with passing results of IsotonicRegression is that this function is called multiple times inside while loop

josevalim · 2023-11-04T14:47:25Z

So I think we can isotonic regression fit and then run the linear regression ourselves?

msluszniak · 2023-11-04T15:02:06Z

I think I don't understand the concept

josevalim · 2023-11-04T15:54:30Z

What I mean is that you defined a defn special_preprocess in IsotonicRegression that does what you need. :)

msluszniak · 2023-11-04T15:56:12Z

Yeah, that's probably the best idea

…gression

msluszniak · 2023-11-07T20:46:51Z

I don't know why the test results here are so strange. I mean, I know that when I invoke x() and key() directly inside the EXLA.jit_apply then I will achieve the same results as here, but they aren't reasonable (in contradiction to what I achieved locally calling x and key before)

josevalim · 2023-11-07T22:06:36Z

@msluszniak what exactly is strange about it?

msluszniak · 2023-11-07T22:11:23Z

For example, when I choose a smaller epsilon :eps (even veeery small) the result does not change in this setup as if there was a perfect convergence after only 5 or 6 steps, which is rather impossible

msluszniak · 2023-11-07T22:12:56Z

Locally, the response for smaller epsilon was as expected: much more iterations and smaller stress loss. And they correspond to what sklearn implemenation returns.

lib/scholar/linear/isotonic_regression.ex

josevalim · 2023-11-07T22:18:54Z

@msluszniak I see. In this case I can't help much, it seems to be related to the algorithm. :( maybe print_value can help debug.

msluszniak · 2023-11-07T23:58:29Z

Yeap, there is definitely something wrong with smacof procedure since it's mathematically proven that the stress shouldn't increase but currently sometime it does for bigger number of samples

msluszniak · 2023-11-09T20:10:53Z

Actually I debug the Sklearn implementation and there is the same behaviour here, so maybe there is nothing wrong here 😅

msluszniak · 2023-11-09T21:01:42Z

Ok, I found a problem. The problem was in pairwise distance. For a function that calculates pairwise distance on a single tensor, there was only a check for negative values (that occures sometimes because of numerical instability). However, there is also a check needed for positive values on the main diagonal. Ideally, there should be only zeros, but because of numerical instability sometimes there are positive values that in my specific case cause that loss explode to huge values

msluszniak · 2023-11-10T11:00:31Z

@josevalim why on one CI two test fails and on the other three? And why do even these tests fail? 😅

josevalim · 2023-11-10T11:04:47Z

@msluszniak for what is worth, it is very common to have precision differences across machines and compiler versions. You will have to either increase the tolerance of the matrix comparison or test based on a property (I don't know if there is such a property for MDS).

msluszniak · 2023-11-10T11:17:37Z

Ok, I understand. Unfortunately, there is no such property in MDS. This algorithm is very sensitive to any numerical changes, and the final result will be completely different (but also correct, only embedded in different space). I guess that we may flag these tests as @to_skip or something like that because I think it's good to have them just in case.

josevalim · 2023-11-10T11:56:32Z

So how do you know MDS is correct? :D

msluszniak · 2023-11-10T12:01:30Z

Actually, via visualization and empirical check. Even Sklearn has barely any tests for MDS https://github.com/scikit-learn/scikit-learn/blob/093e0cf14aff026cca6097e8c42f83b735d26358/sklearn/manifold/tests/test_mds.py. We can check the stress majorization procedure like they do, but that's all

josevalim · 2023-11-10T12:31:03Z

Sounds like I plan to me.

msluszniak · 2023-11-15T15:43:28Z

I think it's ready to go

josevalim · 2023-11-15T16:21:14Z

💚 💙 💜 💛 ❤️

msluszniak and others added 12 commits August 22, 2023 20:46

Update mix.installs

77ac659

Merge branch 'elixir-nx:main' into main

47c288b

Merge branch 'elixir-nx:main' into main

4be4733

Merge branch 'main' of github.com:msluszniak/scholar into main

65f7376

Merge branch 'main' of github.com:msluszniak/scholar into main

dd1a29c

Initial commit

a222e2f

Merge branch 'main' of github.com:msluszniak/scholar into main

ba4b000

Initial commit

f69d4af

Draft of implementation

d96b5a2

Working version, but linear interpolation breaks

81fa246

resolve conflicts

4d44f37

Add doctests and other

a3b2a82

msluszniak marked this pull request as draft November 3, 2023 18:15

Fromat

b231377

Add tests, change default param, add new preprocessing to isotonic re…

b3d8f0a

…gression

msluszniak marked this pull request as ready for review November 7, 2023 19:45

msluszniak added 3 commits November 7, 2023 20:52

Remove deprecated covariance

bab1a74

merge upstream

c6140f3

Remove covariance test

0a375ce

josevalim reviewed Nov 7, 2023

View reviewed changes

lib/scholar/linear/isotonic_regression.ex Outdated Show resolved Hide resolved

msluszniak added 2 commits November 9, 2023 22:47

Add check in pairwise distances

00bc689

Move special preprocess to MDS

9d514b3

msluszniak added 2 commits November 15, 2023 12:32

Use Binary Backend in MDS tests

defc6c6

Fromat

48068ba

josevalim merged commit dacf106 into elixir-nx:main Nov 15, 2023
2 checks passed

msluszniak deleted the mds branch July 12, 2024 08:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MDS #205

MDS #205

msluszniak commented Nov 3, 2023

josevalim commented Nov 4, 2023

msluszniak commented Nov 4, 2023

josevalim commented Nov 4, 2023

msluszniak commented Nov 4, 2023

josevalim commented Nov 4, 2023

msluszniak commented Nov 4, 2023

msluszniak commented Nov 7, 2023

josevalim commented Nov 7, 2023

msluszniak commented Nov 7, 2023

msluszniak commented Nov 7, 2023 •

edited

Loading

josevalim commented Nov 7, 2023

msluszniak commented Nov 7, 2023

msluszniak commented Nov 9, 2023

msluszniak commented Nov 9, 2023

msluszniak commented Nov 10, 2023

josevalim commented Nov 10, 2023

msluszniak commented Nov 10, 2023

josevalim commented Nov 10, 2023

msluszniak commented Nov 10, 2023

josevalim commented Nov 10, 2023

msluszniak commented Nov 15, 2023

josevalim commented Nov 15, 2023

MDS #205

MDS #205

Conversation

msluszniak commented Nov 3, 2023

josevalim commented Nov 4, 2023

msluszniak commented Nov 4, 2023

josevalim commented Nov 4, 2023

msluszniak commented Nov 4, 2023

josevalim commented Nov 4, 2023

msluszniak commented Nov 4, 2023

msluszniak commented Nov 7, 2023

josevalim commented Nov 7, 2023

msluszniak commented Nov 7, 2023

msluszniak commented Nov 7, 2023 • edited Loading

josevalim commented Nov 7, 2023

msluszniak commented Nov 7, 2023

msluszniak commented Nov 9, 2023

msluszniak commented Nov 9, 2023

msluszniak commented Nov 10, 2023

josevalim commented Nov 10, 2023

msluszniak commented Nov 10, 2023

josevalim commented Nov 10, 2023

msluszniak commented Nov 10, 2023

josevalim commented Nov 10, 2023

msluszniak commented Nov 15, 2023

josevalim commented Nov 15, 2023

msluszniak commented Nov 7, 2023 •

edited

Loading