Issue loading mechanic optimizer state #2

yitzhaklevi · 2023-11-06T16:40:26Z

The following simple script reproduces the issue:

model = torch.nn.Linear(1, 1)
optimizer = mechanize(torch.optim.AdamW)(model.parameters(), lr=1e-5)
x = torch.ones([5, 1])
out = torch.sum(model(x))
out.backward()
optimizer.step()
print('done first step')

new_optimizer = mechanize(torch.optim.AdamW)(model.parameters(), lr=1e-5)
new_optimizer.load_state_dict(optimizer.state_dict())
out = torch.sum(model(x))
out.backward()
new_optimizer.step()
print('done new steps using new optimizer loaded')

The text was updated successfully, but these errors were encountered:

yitzhaklevi · 2023-11-06T16:41:58Z

The issue is due to the fact that state_dict['state']['_mechanic'] has tensor pointers as keys, those does not have the same addresses when re-initializing new Mechanic optimizer

(other optimizers e.g AdamW has the indexes as keys for the state)

acutkosky · 2023-11-09T23:57:43Z

Thanks for bringing this up! I'll take a look and make an update (or if you want to do so, please feel free to submit a PR).

yitzhaklevi · 2023-11-10T16:12:44Z

Welcome, I actually fixed that (will submit a PR next week, ) but it seems that on Windows the issue does not reproduce. (tried on my local machine and it worked)

ogencoglu · 2024-07-29T20:07:38Z

Any update on this?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue loading mechanic optimizer state #2

Issue loading mechanic optimizer state #2

yitzhaklevi commented Nov 6, 2023

yitzhaklevi commented Nov 6, 2023

acutkosky commented Nov 9, 2023

yitzhaklevi commented Nov 10, 2023

ogencoglu commented Jul 29, 2024

Issue loading mechanic optimizer state #2

Issue loading mechanic optimizer state #2

Comments

yitzhaklevi commented Nov 6, 2023

yitzhaklevi commented Nov 6, 2023

acutkosky commented Nov 9, 2023

yitzhaklevi commented Nov 10, 2023

ogencoglu commented Jul 29, 2024