FIX ProxNewton solver with fixpoint strategy #259

mathurinm · 2024-06-01T08:29:28Z

fixes #256

Hard to test properly, but the following now works fine:

import numpy as np
import matplotlib.pyplot as plt
from sklearn.model_selection import train_test_split
from numpy.linalg import norm
from skglm.utils.jit_compilation import compiled_clone

from skglm import GeneralizedLinearEstimator
from skglm import datafits
from skglm import penalties
from skglm.solvers import ProxNewton
from skglm.utils.data import make_correlated_data

X, y, _ = make_correlated_data(500, 5000, random_state=0)

y = np.abs(y) // 1

# datafit = compiled_clone(datafits.Quadratic())
datafit = compiled_clone(datafits.Poisson())
penalty = compiled_clone(penalties.L1(alpha=1))
alpha_max = penalty.alpha_max(datafit.gradient(X, y, np.zeros(len(y))))

penalty.alpha = alpha_max / 10

solver = ProxNewton(verbose=3, max_iter=20, warm_start=True, fit_intercept=False, tol=1e-4, ws_strategy="fixedpoint", max_pn_iter=20)

solver.solve(X, y, datafit, penalty)

skglm/solvers/anderson_cd.py

mathurinm · 2024-06-01T10:30:47Z

Frankly compared to the cost of the rest of the computation I expect this to be totally negligible Il sab 1 giu 2024, 12:15 Badr MOUFAD ***@***.***> ha scritto:

…

***@***.**** commented on this pull request. ------------------------------ In skglm/solvers/anderson_cd.py <#259 (comment)> : > @@ -184,7 +184,7 @@ def solve(self, X, y, datafit, penalty, w_init=None, Xw_init=None): opt_ws = penalty.subdiff_distance(w[:n_features], grad_ws, ws) elif self.ws_strategy == "fixpoint": opt_ws = dist_fix_point_cd( - w[:n_features], grad_ws, lipschitz, datafit, penalty, ws + w[:n_features], grad_ws, lipschitz[ws], datafit, penalty, ws I feel that we should expect performance regression in AnderonCD as after this we will be making copies WDYT @mathurinm <https://github.com/mathurinm>? — Reply to this email directly, view it on GitHub <#259 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACETTQUUZKB4LRBG6JXI54TZFGNLTAVCNFSM6AAAAABIT65EP6VHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMZDAOJSGA3TEMRQHA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

Badr-MOUFAD

We have also

skglm/skglm/solvers/multitask_bcd.py

Line 234 in 9682660

def dist_fix_point_bcd(W, grad_ws, lipschitz, datafit, penalty, ws):

We better adopt the same convention here as well.

I'm +1 with tackling it in this PR as this it touches the fixpoint ws strategy.

skglm/solvers/prox_newton.py

mathurinm · 2024-06-02T10:26:40Z

@Badr-MOUFAD merge if happy

mathurinm · 2024-06-02T10:41:17Z

@Badr-MOUFAD With L1 it works fine, but with L0_5 the solver is still stuck.
If I disable the working set (by passing p0 = n_features for solver), it works fine.

import numpy as np
from skglm.utils.jit_compilation import compiled_clone

from skglm import datafits
from skglm import penalties
from skglm.solvers import ProxNewton
from skglm.utils.data import make_correlated_data

X, y, _ = make_correlated_data(50, 100, random_state=0)

y = np.abs(y) // 1

datafit = compiled_clone(datafits.Quadratic())
penalty = compiled_clone(penalties.L0_5(alpha=1))
# penalty = compiled_clone(penalties.L1(alpha=1))
alpha_max = penalties.L1(alpha=1).alpha_max(datafit.gradient(X, y, np.zeros(len(y))))

penalty.alpha = alpha_max / 10


solver = ProxNewton(verbose=3, max_iter=20, warm_start=True, fit_intercept=False, tol=1e-4, ws_strategy="fixpoint", max_pn_iter=20, p0=10)
solver.solve(X, y, datafit, penalty)

Badr-MOUFAD · 2024-06-02T11:56:00Z

It seems that the solver get trapped in a working set that doesn't happen to be the support of the solution

I have added a print in ProxNewton code to see that.
To reproduce, use n_samples=10, n_feautures=30 (for concise logs) and set verbose=0

Iter 0  : [ 0 18  1 13 12 25 26  7  6 14]
Iter 1  : [12 11 10  6 18 25  7  3  2  1]
Iter 2  : [ 6 13  8  9 10 25 18  3 11 29]
Iter 3  : [ 6 13  8  9 10 25 18  3 11 29]
Iter 4  : [ 6 13  8  9 10 25 18  3 11 29]
Iter 5  : [ 6 13  8  9 10 25 18  3 11 29]
Iter 6  : [ 6 13  8  9 10 25 18  3 11 29]
Iter 7  : [ 6 13  8  9 10 25 18  3 11 29]
Iter 8  : [ 6 13  8  9 10 25 18  3 11 29]
Iter 9  : [ 6 13  8  9 10 25 18  3 11 29]
Iter 10 : [ 6 13  8  9 10 25 18  3 11 29]
Iter 11 : [ 6 13  8  9 10 25 18  3 11 29]
Iter 12 : [ 6 13  8  9 10 25 18  3 11 29]
Iter 13 : [ 6 13  8  9 10 25 18  3 11 29]
Iter 14 : [ 6 13  8  9 10 25 18  3 11 29]
Iter 15 : [ 6 13  8  9 10 25 18  3 11 29]
Iter 16 : [ 6 13  8  9 10 25 18  3 11 29]
Iter 17 : [ 6 13  8  9 10 25 18  3 11 29]
Iter 18 : [ 6 13  8  9 10 25 18  3 11 29]
Iter 19 : [ 6 13  8  9 10 25 18  3 11 29]

Perhaps it is something related to the non-convexity of the penalty 🤔

Badr-MOUFAD

I feel that the issue related to the L_05 penalty is not closely related to this PR and suggest investigating it later in another PR.

Thanks for the fix @mathurinm 🚀

mathurinm added 2 commits June 1, 2024 10:24

FIX ProxNewton solver with fixpoint strategy

bdca5af

update other solvers

d33360d

mathurinm requested a review from Badr-MOUFAD June 1, 2024 08:29

Badr-MOUFAD reviewed Jun 1, 2024

View reviewed changes

skglm/solvers/anderson_cd.py Show resolved Hide resolved

Badr-MOUFAD reviewed Jun 1, 2024

View reviewed changes

skglm/solvers/prox_newton.py Outdated Show resolved Hide resolved

multitask bcd + lipschitz ws

11a110b

Badr-MOUFAD added 2 commits June 2, 2024 13:44

for debugging

6c00224

print ws

2ebddbc

Badr-MOUFAD added 5 commits June 2, 2024 13:57

rm stale compute_lipschitz

8804486

refactor name in sparse case

ecfa08c

ws_size ---> len(ws)

9a3b940

silent bug multitask

165ad25

rm debug print

62ecc75

Badr-MOUFAD approved these changes Jun 2, 2024

View reviewed changes

mathurinm merged commit ccc6344 into scikit-learn-contrib:main Jun 3, 2024
4 checks passed

mathurinm deleted the fix_pn_lipschitz branch June 3, 2024 05:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX ProxNewton solver with fixpoint strategy #259

FIX ProxNewton solver with fixpoint strategy #259

mathurinm commented Jun 1, 2024

mathurinm commented Jun 1, 2024 via email

Badr-MOUFAD left a comment

mathurinm commented Jun 2, 2024

mathurinm commented Jun 2, 2024

Badr-MOUFAD commented Jun 2, 2024 •

edited

Loading

Badr-MOUFAD left a comment

FIX ProxNewton solver with fixpoint strategy #259

FIX ProxNewton solver with fixpoint strategy #259

Conversation

mathurinm commented Jun 1, 2024

mathurinm commented Jun 1, 2024 via email

Badr-MOUFAD left a comment

Choose a reason for hiding this comment

mathurinm commented Jun 2, 2024

mathurinm commented Jun 2, 2024

Badr-MOUFAD commented Jun 2, 2024 • edited Loading

Badr-MOUFAD left a comment

Choose a reason for hiding this comment

Badr-MOUFAD commented Jun 2, 2024 •

edited

Loading