Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add streaming finetuning example #935

Closed
wants to merge 32 commits into from

Conversation

bigning
Copy link
Contributor

@bigning bigning commented Feb 2, 2024

composer train.py yamls/finetune/gpt2-arc-easy--cpu-streaming-dataset.yaml

@bigning bigning changed the base branch from main to add_finetuning_streaming_dataset_conversion February 5, 2024 17:59
@bigning bigning force-pushed the add_finetuning_streaming_dataset_conversion branch from fbbe997 to bfe98bf Compare February 5, 2024 18:39
milocress and others added 14 commits February 5, 2024 11:55
* fix dash issue

* fix

* fix?

* added unit test

* fix fix

* fix tests

* fix fix tests
* added rich traceback

* sorted imports

* added rich to eval
Saw an automated ruff flag this, seems like a strict improvement and is marginally faster.

Co-authored-by: Daniel King <[email protected]>
…dictionary of metrics. (#938)

* ..

* undoing prev commit

* Refactoring the  function to accept list of metric names instead of dictionary

* ..

* ..

* ..

* ..
…hen sequence id is -1 due to attention masking (#940)

* ..

* undoing prev commit

* fixing the gen_attention_mask_in_length function to handle the case when sequence id is -1 due to attention masking

* Update modeling_mpt.py

* ..

---------

Co-authored-by: Daniel King <[email protected]>
@bigning bigning changed the base branch from add_finetuning_streaming_dataset_conversion to main February 5, 2024 20:05
@bigning bigning closed this Feb 8, 2024
@bigning bigning deleted the add_streaming_finetuning_example branch February 8, 2024 21:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

9 participants