Expose algorithm kwargs for Grassmann methods #11

Gertian · 2024-08-06T15:04:12Z

fixed tiny bug

lkdvos · 2024-08-07T10:12:54Z

These test failures seem unrelated, we should probably check if the master even runs...

As a side-comment, this is not really a bug. The SDD algorithm should be a lot faster for large matrices, and this change really does have some performance implications. Nevertheless, I agree that in our case, as the stability of SDD turns out to be insufficient for the cases at hand, we should probably default to the (slower) more stable SVD algorithm

Jutho · 2024-08-07T10:15:38Z

We should probably adopt a SafeSVD version that does

try
   # use SDD
catch
   # fall back to SVD
end

A singular value decomposition is probably sufficiently costly in most cases that the try ... catch overhead is negligible.

Gertian · 2024-08-07T10:47:06Z

@lkdvos I'm quite certain that the failing tests are indeed unrelated since I've ran many hours of Gradient Descent with this change without any issues.

@Jutho This was my original patch, the problem is that SDD already changes the input tensor so we should really do :
Make hardcopy(tensor) try catch
but then there is a substantial overhead which is why I finally just accepted to SDD.

Jutho · 2024-08-07T11:01:08Z

In this case without the exclamation mark, tsvd should not change the input parameter. I agree that it is more tricky with the mutating method. So maybe SafeSVD should only be valid for tsvd but not for tsvd!.

SVD/SDD

Gertian · 2024-08-12T13:50:45Z

Hi all,

I updated the PR.

From the TensorKitManifolds perspective the only changes compared to master are now that all methods (I think) now take the alg keyword which can be SVD or SDD.

I simultaneously made a PR in MPSKit which ads an extra argument svd_alg to the GradientGrassman struct. This can be set to SVD or SDD and is passed to the underlying TensorKitManifolds algorithms.

I removed some unnecessary comments and reinstated alg=nothing in those cases where alg is not used ayways.

lkdvos · 2024-08-13T14:25:19Z

I added some changes to make the tests run again:

getproperty is treated specially in the Julia compiler, in the sense that it has a default aggressive constprop. This ensures that it is type stable with the different symbols. Adding a keyword argument breaks this, so I had to revert that change. In principle, we could consider manually adding a Base.@constprop :aggressive, but somehow I don't think getproperty with a keyword is very Julian.
I refactored the default algorithm out, and changed this from SDD to SVD. I think the stability issues are severe enough to warrant this.

Gertian · 2024-08-13T14:41:35Z

This is probably a ridiculous edge case but I'm wondering what might happen if a user sets the TensorKitManifolds default svd algorithm to SDD and then uses the dispatching in MPSKit to use SVD everywhere.

In this case the SDD call inside getproperty might still lead to a crash ?

Or do you think users should simply not use SDD anymore ?

lkdvos · 2024-08-14T07:39:20Z

The current approach does not actually have an option to set the default algorithm, it is a compile time constant.

Note that your approach also did not really expose the alg keyword in the getproperty function, this is the function that gets called when you access fields of a tangent: y.U is completely equivalent to getproperty(y, :U). (This is also why this function is treated differently).

Thus, the current implementation has the stable one as default, and if you would want to change that, it would require quite a bit of rewrite. I am more or less okay with anything, so I would wait for @Jutho 's opinion

Gertian · 2024-08-14T07:58:28Z

Ok, that makes sense thanks !

Jutho · 2024-09-02T09:52:38Z

My apologies for not getting to this sooner, which will probably result in double work. I have the following requests / suggestions:

I would define a default_svd_alg(A::TensorMap) = SVD() in the main module file ("src/TensorKitManifolds.jl")
I would change the default value of the alg keyword of projectisometric! (also in the TensorKitManifolds.jl file) to defualt_svd_alg(A). Also use this value in any other tsvd and tsvd! call that you can find, I think they are only in "src/grassman.jl".
I would then remove the alg value (reset it to nothing) in Grassmann.retract and friends (invretract, relativegauge). This alg argument is actually aimed at different retraction schemes (which exist for example for the Stiefel case).

This makes all svd calls in TensorKitManifolds.jl completely hard coded, but you can still change it by redefining TensorKitManifolds.default_svd_alg(::TensorMap) = SDD().

Ultimately, we should have algorithms structures at the level of retract and transport and friends, where then the algorithm to be used for the svd would be a field in that structure. But that should be part of a larger overhaul, possibly where we consider to buy in to the Manifolds.jl ecosystem.

Gertian · 2024-09-02T12:02:20Z

Thanks @Jutho for the suggestions and @lkdvos for implementing them faster then I could even read them 😓

Does this mean that

using TensorKitManifolds
TensorKitManifolds.default_svd_alg(::TensorMap) = SDD()


using MPSKit

rest of my code

will result in using the stable version of svd inside all MPSKit algorithms ?

lkdvos · 2024-09-02T12:04:49Z

No, SDD() is the unstable/fast algorithm, so by default the stable version will already be selected, you won't have to do anything.

Gertian · 2024-09-02T12:06:49Z

Oh yes, indeed. Thanks for pointing that out !

Jutho · 2024-09-02T12:10:21Z

Great. I will merge when lights turn green. There seems one svd! call hidden in "src/auxiliary.jl" in _stiefellog, but that is unused I assume so let's keep that for another time.

Also, we should probably switch to the LinearAlgebra name of these algorithms in TensorKit.jl at some point. default_svd_alg is also defined in LinearAlgebra, but it might actually be useful to have independent default_svd_alg routines in the different packages for more granular control, until we have a batter strategy implemented.

lkdvos · 2024-09-02T12:34:19Z

I saw the talk about ScopedValues on JuliaCon last week, it seems like these kinds of defaults and configurations are precisely what is being targeted with that, and from what I understand, this would solve our issues. Just something to keep in the back of our heads when we revisit this.

Update grassmann.jl

2866e2e

fixed tiny bug

Update grassmann.jl

2437d27

SVD/SDD

Gertian mentioned this pull request Aug 13, 2024

Incorporated an alg argument for GD which can be set to SDD(fast, unstable) or SVD(slow, stable) QuantumKitHub/MPSKit.jl#175

Closed

Gertian and others added 4 commits August 13, 2024 11:10

Remove comments + restyle

b48ff50

I removed some unnecessary comments and reinstated alg=nothing in those cases where alg is not used ayways.

Fix formatting

379f078

restore argument check

50dfa28

Revert project_isometric default alg to Polar

fa3849e

lkdvos changed the title ~~Update grassmann.jl~~ Expose algorithm kwargs for Grassmann methods Aug 13, 2024

lkdvos added 2 commits August 13, 2024 15:12

Merge branch 'master' into Gertian/master

f974ce0

Refactor default SVD algorithm

fbe4db5

Implement code suggestions

67f0c7f

lkdvos force-pushed the master branch from 1be6e8f to 67f0c7f Compare September 2, 2024 12:04

Jutho merged commit 0623799 into Jutho:master Sep 2, 2024
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose algorithm kwargs for Grassmann methods #11

Expose algorithm kwargs for Grassmann methods #11

Gertian commented Aug 6, 2024

lkdvos commented Aug 7, 2024

Jutho commented Aug 7, 2024

Gertian commented Aug 7, 2024

Jutho commented Aug 7, 2024

Gertian commented Aug 12, 2024

lkdvos commented Aug 13, 2024

Gertian commented Aug 13, 2024

lkdvos commented Aug 14, 2024

Gertian commented Aug 14, 2024

Jutho commented Sep 2, 2024 •

edited

Loading

Gertian commented Sep 2, 2024

lkdvos commented Sep 2, 2024

Gertian commented Sep 2, 2024

Jutho commented Sep 2, 2024

lkdvos commented Sep 2, 2024

Expose algorithm kwargs for Grassmann methods #11

Expose algorithm kwargs for Grassmann methods #11

Conversation

Gertian commented Aug 6, 2024

lkdvos commented Aug 7, 2024

Jutho commented Aug 7, 2024

Gertian commented Aug 7, 2024

Jutho commented Aug 7, 2024

Gertian commented Aug 12, 2024

lkdvos commented Aug 13, 2024

Gertian commented Aug 13, 2024

lkdvos commented Aug 14, 2024

Gertian commented Aug 14, 2024

Jutho commented Sep 2, 2024 • edited Loading

Gertian commented Sep 2, 2024

lkdvos commented Sep 2, 2024

Gertian commented Sep 2, 2024

Jutho commented Sep 2, 2024

lkdvos commented Sep 2, 2024

Jutho commented Sep 2, 2024 •

edited

Loading