[WIP] [ENH] add faster jvp computation for lasso type problems #17

QB3 · 2021-08-26T10:35:30Z

Algorithmically implicit differentiation is composed of 2 steps:

1 compute the solution of the optimization problem
2 solve a linear system.

This linear system can be large, and expensive to solve.
When one differentiates the solution of sparse optimization problems (such as the Lasso), it is possible to reduce the size of the linear system to solve (http://proceedings.mlr.press/v119/bertrand20a/bertrand20a.pdf, https://arxiv.org/pdf/2105.01637.pdf).

The goal of this PR is to implement such an acceleration for sparse optimization problems.

To this aim, I implemented a sparse_root_vjp function (https://github.com/QB3/jaxopt/blob/c9b0daea3f90dec392fbcb73097eb88d39010aa2/jaxopt/_src/implicit_diff.py#L125) where I solve a smaller linear system (https://github.com/QB3/jaxopt/blob/c9b0daea3f90dec392fbcb73097eb88d39010aa2/jaxopt/_src/implicit_diff.py#L169).
The correctness of the implementation is tested here https://github.com/QB3/jaxopt/blob/c9b0daea3f90dec392fbcb73097eb88d39010aa2/tests/implicit_diff_test.py#L88.
Note that this implementation is very brutal, and not general at all, the goal is to see if we observe any speedups.

To check the speed of the implementation I created a sparse_vjp.py file in the benchmarks directory (https://github.com/QB3/jaxopt/blob/add_sparse_vjp/benchmarks/sparse_vjp.py).
I differentiate the solution of a Lasso with (n_samples=10, n_features=1000), lam = lam_max / 2.

I have the following results:
Time taken to solve the Lasso optimization problem 0.008
Time taken to compute the Jacobian 2.168
Time taken to compute the Jacobian with the sparse implementation 2.168

This benchmark tells us 2 things:

there is room for improvement when computing the Jacobian (it takes 1000 more times than solving the optimization problem)
the sparse implementation does not provide speedups.

I do not understand why we do not observe any speedups, does someone have a lead?

google-cla · 2021-08-26T10:35:37Z

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here with @googlebot I signed it! and we'll verify it.

What to do if you already signed the CLA

Individual signers

It's possible we don't have your GitHub username or you're using a different email address on your commit. Check your existing CLA data and verify that your email is set on your git commits.

Corporate signers

Your company has a Point of Contact who decides which employees are authorized to participate. Ask your POC to be added to the group of authorized contributors. If you don't know who your Point of Contact is, direct the Google project maintainer to go/cla#troubleshoot (Public version).
The email used to register you as an authorized contributor must be the email used for the Git commit. Check your existing CLA data and verify that your email is set on your git commits.
The email used to register you as an authorized contributor must also be attached to your GitHub account.

ℹ️ Googlers: Go here for more info.

google-cla · 2021-08-26T10:46:29Z

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here with @googlebot I signed it! and we'll verify it.

What to do if you already signed the CLA

Individual signers

It's possible we don't have your GitHub username or you're using a different email address on your commit. Check your existing CLA data and verify that your email is set on your git commits.

Corporate signers

Your company has a Point of Contact who decides which employees are authorized to participate. Ask your POC to be added to the group of authorized contributors. If you don't know who your Point of Contact is, direct the Google project maintainer to go/cla#troubleshoot (Public version).
The email used to register you as an authorized contributor must be the email used for the Git commit. Check your existing CLA data and verify that your email is set on your git commits.
The email used to register you as an authorized contributor must also be attached to your GitHub account.

ℹ️ Googlers: Go here for more info.

google-cla · 2021-08-26T10:52:40Z

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here with @googlebot I signed it! and we'll verify it.

What to do if you already signed the CLA

Individual signers

It's possible we don't have your GitHub username or you're using a different email address on your commit. Check your existing CLA data and verify that your email is set on your git commits.

Corporate signers

Your company has a Point of Contact who decides which employees are authorized to participate. Ask your POC to be added to the group of authorized contributors. If you don't know who your Point of Contact is, direct the Google project maintainer to go/cla#troubleshoot (Public version).
The email used to register you as an authorized contributor must be the email used for the Git commit. Check your existing CLA data and verify that your email is set on your git commits.
The email used to register you as an authorized contributor must also be attached to your GitHub account.

ℹ️ Googlers: Go here for more info.

google-cla · 2021-08-26T11:22:12Z

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here with @googlebot I signed it! and we'll verify it.

What to do if you already signed the CLA

Individual signers

It's possible we don't have your GitHub username or you're using a different email address on your commit. Check your existing CLA data and verify that your email is set on your git commits.

Corporate signers

Your company has a Point of Contact who decides which employees are authorized to participate. Ask your POC to be added to the group of authorized contributors. If you don't know who your Point of Contact is, direct the Google project maintainer to go/cla#troubleshoot (Public version).
The email used to register you as an authorized contributor must be the email used for the Git commit. Check your existing CLA data and verify that your email is set on your git commits.
The email used to register you as an authorized contributor must also be attached to your GitHub account.

ℹ️ Googlers: Go here for more info.

google-cla · 2021-08-26T11:32:25Z

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here with @googlebot I signed it! and we'll verify it.

What to do if you already signed the CLA

Individual signers

It's possible we don't have your GitHub username or you're using a different email address on your commit. Check your existing CLA data and verify that your email is set on your git commits.

Corporate signers

Your company has a Point of Contact who decides which employees are authorized to participate. Ask your POC to be added to the group of authorized contributors. If you don't know who your Point of Contact is, direct the Google project maintainer to go/cla#troubleshoot (Public version).
The email used to register you as an authorized contributor must be the email used for the Git commit. Check your existing CLA data and verify that your email is set on your git commits.
The email used to register you as an authorized contributor must also be attached to your GitHub account.

ℹ️ Googlers: Go here for more info.

tests/implicit_diff_test.py

…g time is the same

jaxopt/_src/implicit_diff.py

mblondel · 2021-08-27T14:22:38Z

@QB3 updated the pull request description with the current state of this pull request. Currently, he doesn't observe any speed ups despite restricting the linear system to the support.

@shoyer @froystig @fabianp If you have any idea what could make things faster, let us know.

shoyer · 2021-08-27T17:18:08Z

@QB3 updated the pull request description with the current state of this pull request. Currently, he doesn't observe any speed ups despite restricting the linear system to the support.

I believe the number of iterations required for CG to converge typically depends on the condition number of the linear operator, rather than size of the system. It's possible that restricting the support of the system does not change that. More generally, it seems like it would be a good idea to collect some metrics (e.g., via host_callback) about the rate of convergence.

I would test with direct solvers jax.scipy.linalg.solve (on small systems), which likely depend more directly on the size of the system. You also might try calculating the condition number of the linear operators, e.g., via the approximate eigenvalue calcualtions from scipy.sparse.linalg.

froystig · 2021-08-27T22:19:27Z

Might it also be useful to profile (tutorial, docs) for any lower-level surprises?

mblondel · 2021-08-29T13:09:28Z

@shoyer In the reference NumPy-based implementation of our paper, that also uses CG, @QB3 apparently observed speed ups by restricting the support (please confirm @QB3).

QB3 · 2021-08-30T10:41:18Z

Thanks a lot for your answers, I tried on a larger example and I observed some nice speedups!
(n=10, p=10_000)

Time taken to solve the Lasso optimization problem 0.015
Time taken to compute the Jacobian 24.673
Time taken to compute the Jacobian with the sparse implementation 1.435

[WIP] first draft for sparse vjp

6ef6ed0

google-cla bot added the cla: no label Aug 26, 2021

[ci skip] fixed call jax.vjp, tests still do not pass

64034b8

[ci skip] made test implicit diff for sparse_jvp pass

71b8b2c

[ci skip] added test lasso, currently fails

4e8a791

[ci skip] added test lasso without sparsity

a9883d6

Trigger google-cla

7733f9a

google-cla bot added cla: yes and removed cla: no labels Aug 26, 2021

QB3 added 2 commits August 26, 2021 15:14

[ci skip] made test pass for lasso without sparse computation

cbe7bac

[ci skip]@ new try for sparse computation, still fails

478d871

mblondel reviewed Aug 26, 2021

View reviewed changes

tests/implicit_diff_test.py Outdated Show resolved Hide resolved

QB3 added 2 commits August 26, 2021 16:53

[ci skip] made sparse jvp work, remains to see how much we win

0346b97

[ci skip] added little bench for sparse vjp, sol is better but runnin…

8a3c128

…g time is the same

mblondel reviewed Aug 26, 2021

View reviewed changes

jaxopt/_src/implicit_diff.py Outdated Show resolved Hide resolved

[ci skip] try implemetation with hardcoded support

c9b0dae

[ci skip] take larger number of features, see speed ups

b698e02

QB3 added 3 commits August 30, 2021 15:50

add make_restricted_optimality_fun to sparse_vjp

0c230ce

[ci skip] simplified + rearanged args in tests

933f287

[ci skip] CLN

1ddeeb9

QB3 added 12 commits August 30, 2021 16:41

[ci skip] made test_custom_root_lasso

3aae541

[ci skip] added sparse custom root + tests

3a3ef0b

[ci skip] adapted example lasso, toward a sparse implementation

4669b96

[ci skip] added benchmark sparse custom root

d110ff1

[ci skip] added sparse_custom_root to implicit diff example

d15ae55

improved benchmark file

77661b1

jax.numpy.linalg >> onp.linalg.norm

a7e5436

[ci skip] added back version with hardcoded support

a8ee7cc

[ciskip] updated benchmark

d7e5cc1

[ciskip] added sparse_custom root with other implem, currently fails

1c59af7

[ciskip] X.T@(X @ params - y) >> grad(obj.square)

91348aa

[ci skip] made test custom root work

c64c596

tristandeleu mentioned this pull request Jul 14, 2022

Sparse root VJP for Lasso penalty #274

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] [ENH] add faster jvp computation for lasso type problems #17

[WIP] [ENH] add faster jvp computation for lasso type problems #17

QB3 commented Aug 26, 2021 •

edited

Loading

google-cla bot commented Aug 26, 2021

google-cla bot commented Aug 26, 2021

google-cla bot commented Aug 26, 2021

google-cla bot commented Aug 26, 2021

google-cla bot commented Aug 26, 2021

mblondel commented Aug 27, 2021

shoyer commented Aug 27, 2021

froystig commented Aug 27, 2021

mblondel commented Aug 29, 2021

QB3 commented Aug 30, 2021 •

edited

Loading

[WIP] [ENH] add faster jvp computation for lasso type problems #17

Are you sure you want to change the base?

[WIP] [ENH] add faster jvp computation for lasso type problems #17

Conversation

QB3 commented Aug 26, 2021 • edited Loading

google-cla bot commented Aug 26, 2021

What to do if you already signed the CLA

Individual signers

Corporate signers

google-cla bot commented Aug 26, 2021

What to do if you already signed the CLA

Individual signers

Corporate signers

google-cla bot commented Aug 26, 2021

What to do if you already signed the CLA

Individual signers

Corporate signers

google-cla bot commented Aug 26, 2021

What to do if you already signed the CLA

Individual signers

Corporate signers

google-cla bot commented Aug 26, 2021

What to do if you already signed the CLA

Individual signers

Corporate signers

mblondel commented Aug 27, 2021

shoyer commented Aug 27, 2021

froystig commented Aug 27, 2021

mblondel commented Aug 29, 2021

QB3 commented Aug 30, 2021 • edited Loading

QB3 commented Aug 26, 2021 •

edited

Loading

QB3 commented Aug 30, 2021 •

edited

Loading