Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Correct distributed_type in the accelerate test. #121

Merged
merged 4 commits into from
Feb 16, 2024

Conversation

vanbasten23
Copy link
Collaborator

@vanbasten23 vanbasten23 commented Feb 16, 2024

Description

Currently the HF accelerate test fails with error ValueError: 'TPU' is not a valid DistributedType (log). This is due to a HF change huggingface/accelerate#2176, specifically https://github.com/huggingface/accelerate/blob/97d2168e5953fe7373a06c69c02c5a00a84d5344/src/accelerate/utils/dataclasses.py#L335.

cc: @will-cromar

Tests

Please describe the tests that you ran on Cloud VM to verify changes.

Instruction and/or command lines to reproduce your tests: ...

List links for your tests (use go/shortn-gen for any internal link): ...

This is the first error. It is likely subsequent error will happen and I have seen locally.

Checklist

Before submitting this PR, please make sure (put X in square brackets):

  • I have performed a self-review of my code.
  • I have necessary comments in my code, particularly in hard-to-understand areas.
  • I have run one-shot tests and provided workload links above if applicable.
  • I have made or will make corresponding changes to the doc if needed.

@will-cromar will-cromar merged commit 17f33c0 into GoogleCloudPlatform:master Feb 16, 2024
5 checks passed
@vanbasten23 vanbasten23 deleted the fixAccelerateTest branch February 16, 2024 17:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants