Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove extra test suite #743

Merged
merged 37 commits into from
Nov 20, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
37 commits
Select commit Hold shift + click to select a range
7dc4531
remove test suite
dakinggg Nov 16, 2023
3592d88
wip
dakinggg Nov 16, 2023
3c973cb
Merge branch 'main' into remove-test-suites
dakinggg Nov 16, 2023
6b75e34
fix typos
dakinggg Nov 16, 2023
df25e48
Merge branch 'main' into remove-test-suites
dakinggg Nov 18, 2023
f23a3f6
fix
dakinggg Nov 18, 2023
c125b3b
wip
dakinggg Nov 18, 2023
564bfa7
precommit
dakinggg Nov 18, 2023
275f34c
fix comparison tests
dakinggg Nov 18, 2023
1913599
precommit
dakinggg Nov 18, 2023
4f21ede
attn cmp
dakinggg Nov 18, 2023
7ac1a1b
lion8b
dakinggg Nov 18, 2023
92bdcaf
training
dakinggg Nov 18, 2023
f8a2429
precommit
dakinggg Nov 18, 2023
d26f3b0
remove extra cpu workflow too
dakinggg Nov 18, 2023
fc0d944
more
dakinggg Nov 18, 2023
b54c837
rename workflow
dakinggg Nov 18, 2023
6c27302
fix?
dakinggg Nov 18, 2023
959a8de
fix
dakinggg Nov 18, 2023
b0b1637
fix auto packing on 1.13
dakinggg Nov 19, 2023
5d97575
speed up packing test
dakinggg Nov 19, 2023
16e58f6
icl speedup
dakinggg Nov 19, 2023
dff0fcf
precommit
dakinggg Nov 19, 2023
8cf8bdb
precommit
dakinggg Nov 19, 2023
52b11f5
type
dakinggg Nov 19, 2023
1a5301f
less gen
dakinggg Nov 19, 2023
eea448f
remove verbose
dakinggg Nov 19, 2023
d3d3bfe
clean up test model
dakinggg Nov 19, 2023
870441c
remove comment
dakinggg Nov 19, 2023
ff25766
fix flash2 mistaken override
dakinggg Nov 19, 2023
7a5a1f4
fix typo
dakinggg Nov 19, 2023
edacb16
fix
dakinggg Nov 19, 2023
0a9e0f2
less rope parametrization
dakinggg Nov 19, 2023
7a8bc43
precommit
dakinggg Nov 19, 2023
606f975
fix
dakinggg Nov 19, 2023
3e6ab20
Update .github/workflows/pytest-gpu.yaml
dakinggg Nov 20, 2023
dca0dc7
Merge branch 'main' into remove-test-suites
dakinggg Nov 20, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 4 additions & 1 deletion .github/mcp/mcp_pytest.py
Original file line number Diff line number Diff line change
Expand Up @@ -54,6 +54,9 @@
type=int,
default=1800,
help='Timeout for run (in seconds)')
parser.add_argument('--deps_group',
type=str,
help='Dependency group to install')
args = parser.parse_args()

name = args.name
Expand Down Expand Up @@ -89,7 +92,7 @@
clear_tmp_path_flag = '-o tmp_path_retention_policy=none'
command += f'''

pip install --upgrade --user .[all]
pip install --upgrade --user .[{args.deps_group}]

export COMMON_ARGS="-v --durations=20 -m '{args.pytest_markers}' {clear_tmp_path_flag}"

Expand Down
8 changes: 2 additions & 6 deletions .github/workflows/pr-cpu.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -19,12 +19,8 @@ jobs:
strategy:
matrix:
include:
- name: 'cpu-latest'
container: mosaicml/pytorch:latest_cpu # mosaicml/pytorch:1.13.1_cpu-python3.10-ubuntu20.04
markers: 'not gpu'
pytest_command: 'coverage run -m pytest'
- name: 'cpu-2.0.1'
container: mosaicml/pytorch:2.0.1_cpu-python3.10-ubuntu20.04
- name: 'cpu-1.13.1'
container: mosaicml/pytorch:1.13.1_cpu-python3.10-ubuntu20.04
markers: 'not gpu'
pytest_command: 'coverage run -m pytest'
- name: 'cpu-2.1.0'
Expand Down
13 changes: 6 additions & 7 deletions .github/workflows/pr-gpu.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -18,24 +18,22 @@ jobs:
uses: ./.github/workflows/pytest-gpu.yaml
strategy:
matrix:
# TODO: After the PR with the flash attention 2 images goes in, add the new unit test suite
include:
- name: 'gpu-latest'
container: mosaicml/pytorch:latest # mosaicml/pytorch:1.13.1_cu117-python3.10-ubuntu20.04
markers: 'gpu'
pytest_command: 'coverage run -m pytest'
- name: 'gpu-2.0.1'
container: mosaicml/pytorch:2.0.1_cu118-python3.10-ubuntu20.04
- name: 'gpu-1.13.1'
container: mosaicml/pytorch:1.13.1_cu117-python3.10-ubuntu20.04
markers: 'gpu'
pytest_command: 'coverage run -m pytest'
deps_group: 'all'
- name: 'gpu-2.1.0'
container: mosaicml/pytorch:2.1.0_cu121-python3.10-ubuntu20.04
markers: 'gpu'
pytest_command: 'coverage run -m pytest'
deps_group: 'all'
- name: 'gpu-2.1.0-flash2'
container: mosaicml/llm-foundry:2.1.0_cu121_flash2-latest
markers: 'gpu'
pytest_command: 'coverage run -m pytest'
deps_group: 'all-flash2'
name: ${{ matrix.name }}
if: github.repository_owner == 'mosaicml'
with:
Expand All @@ -45,5 +43,6 @@ jobs:
pytest-command: ${{ matrix.pytest_command }}
pytest-markers: ${{ matrix.markers }}
python-version: 3.9
deps-group: ${{ matrix.deps_group }}
secrets:
mcloud-api-key: ${{ secrets.MCLOUD_API_KEY }}
6 changes: 5 additions & 1 deletion .github/workflows/pytest-gpu.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -22,6 +22,9 @@ on:
required: false
type: string
default: 3.9
deps-group:
require: true
type: string
secrets:
mcloud-api-key:
required: true
Expand Down Expand Up @@ -77,4 +80,5 @@ jobs:
--image '${{ inputs.container }}' \
--pytest_markers '${{ inputs.pytest-markers }}' \
--pytest_command '${{ inputs.pytest-command }}' \
--timeout ${{ inputs.mcloud-timeout }} ${REF_ARGS}
--timeout ${{ inputs.mcloud-timeout }} ${REF_ARGS} \
--deps_group ${{ inputs.deps-group }}
3 changes: 2 additions & 1 deletion llmfoundry/data/packing.py
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,7 @@

import numpy as np
import torch
from composer.utils import using_torch_2
from omegaconf import DictConfig
from transformers import PreTrainedTokenizerBase

Expand Down Expand Up @@ -347,7 +348,7 @@ def profile_packing(
dataloader_cfg.dataset.packing_ratio = None
dataloader_cfg.drop_last = False
dataloader_cfg.num_workers = 0
dataloader_cfg.prefetch_factor = None
dataloader_cfg.prefetch_factor = None if using_torch_2() else 2
dataloader_cfg.persistent_workers = False

# Determine the packing_ratio values we'll try
Expand Down
2 changes: 1 addition & 1 deletion llmfoundry/models/mpt/configuration_mpt.py
Original file line number Diff line number Diff line change
Expand Up @@ -109,7 +109,7 @@ def __init__(
init_device (str): The device to use for parameter initialization.
logit_scale (Optional[Union[float, str]]): If not None, scale the logits by this value.
no_bias (bool): Whether to use bias in all layers.
verbose (int): The verbosity level. 0 is silent.
verbose (int): Deprecated.
dakinggg marked this conversation as resolved.
Show resolved Hide resolved
embedding_fraction (float): The fraction to scale the gradients of the embedding layer by.
norm_type (str): choose type of norm to use
use_cache (bool): Whether or not the model should return the last key/values attentions
Expand Down
1 change: 0 additions & 1 deletion mcli/mcli-1b-max-seq-len-8k.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -123,7 +123,6 @@ parameters:
activation_checkpointing_reentrant: false
activation_cpu_offload: false
limit_all_gathers: true
verbose: false

# Logging
progress_bar: false
Expand Down
1 change: 0 additions & 1 deletion mcli/mcli-llama2-finetune.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -127,7 +127,6 @@ parameters:
activation_checkpointing_reentrant: false
activation_cpu_offload: false
limit_all_gathers: true
verbose: false

# Logging
progress_bar: false
Expand Down
5 changes: 5 additions & 0 deletions scripts/data_prep/convert_dataset_hf.py
Original file line number Diff line number Diff line change
Expand Up @@ -186,6 +186,11 @@ def __init__(self,
folder_split='val_xsmall',
raw_samples=3000,
truncated_samples=3000)
c4constants.splits['val_xxsmall'] = DataSplitConstants(
dakinggg marked this conversation as resolved.
Show resolved Hide resolved
hf_split='validation',
folder_split='val_xxsmall',
raw_samples=100,
truncated_samples=100)

CONSTS = {'c4': c4constants, 'the_pile': pileconstants}

Expand Down
32 changes: 32 additions & 0 deletions scripts/eval/local_data/world_knowledge/triviaqa_small.jsonl
Original file line number Diff line number Diff line change
@@ -0,0 +1,32 @@
{"context": "Question: Who was the man behind The Chipmunks?\nAnswer:", "answer": "David Seville", "aliases": ["David Seville"]}
dakinggg marked this conversation as resolved.
Show resolved Hide resolved
{"context": "Question: What star sign is Jamie Lee Curtis?\nAnswer:", "answer": "Scorpio", "aliases": ["Scorpio", "Skorpio", "Scorpio (disambiguation)"]}
{"context": "Question: Which Lloyd Webber musical premiered in the US on 10th December 1993?\nAnswer:", "answer": "Sunset Boulevard", "aliases": ["Sunset Blvd", "West Sunset Boulevard", "Sunset Boulevard", "Sunset Bulevard", "Sunset Blvd."]}
{"context": "Question: Who was the next British Prime Minister after Arthur Balfour?\nAnswer:", "answer": "Campbell-Bannerman", "aliases": ["Sir Henry Campbell-Bannerman", "Campbell-Bannerman", "Campbell Bannerman", "Sir Henry Campbell Bannerman", "Henry Campbell Bannerman", "Henry Campbell-Bannerman"]}
{"context": "Question: Who had a 70s No 1 hit with Kiss You All Over?\nAnswer:", "answer": "Exile", "aliases": ["Internal exile", "Exiles", "Transported for life", "Exile (politics and government)", "Voluntary exile", "Sent into exile", "Exile and Banishment", "Self-exile", "Forced exile", "Exile", "Exile in Greek tragedy", "Banish", "Banishment"]}
{"context": "Question: What claimed the life of singer Kathleen Ferrier?\nAnswer:", "answer": "Cancer", "aliases": ["Cancer pathology", "Deaths by cancer", "Anti-cancer", "Cancer (disease)", "Cancerophobia", "Malignant lesion", "Cancer medication", "Malignant tumors", "Cancer signs", "Malignant neoplasm", "Invasive (cancer)", "Malignant Neoplasms", "Malignant growth", "Sporadic cancer", "Malignant cancer", "Tumour virus", "Cancer en cuirasse", "Microtumor", "Malignant neoplasms", "Malignant tumour", "Carcinophobia", "Malignacy", "Cancer patient", "Epithelial cancers", "Solid cancer", "Cancers", "Tumor medication", "Malignant neoplastic disease", "AIDS-related cancer", "Invasive cancer", "Cancer therapy", "Cancerous tumor", "Cancer", "Financial toxicity", "Cancer diagnosis", "Cancer (medicine)", "Malignant tumor", "Cancerous", "Borderline (cancer)", "Signs of cancer", "Malignancies", "Cancer aromatase"]}
{"context": "Question: Rita Coolidge sang the title song for which Bond film?\nAnswer:", "answer": "Octopussy", "aliases": ["Kamal kahn", "List of Bond girls in Octopussy", "Magda (James Bond)", "List of James Bond allies in Octopussy", "Vijay (James Bond)", "Bond 13", "Octopussy (character)", "Penelope Smallbone", "Octopussy", "General Orlov", "Kamal Khan", "Octopussy (film)", "List of James Bond villains in Octopussy", "Jim Fanning (James Bond)"]}
{"context": "Question: To the nearest million what is the population of Australia?\nAnswer:", "answer": "18 million", "aliases": ["18million", "18 million", "eighteen million"]}
{"context": "Question: What was the last US state to reintroduce alcohol after prohibition?\nAnswer:", "answer": "Utah", "aliases": ["Utah (State)", "Forty-Fifth State", "Sports in Utah", "Climate of Utah", "Education in Utah", "UT (state)", "Utahn", "Yutas", "Geography of Utah", "Utah", "Utah, United States", "Utah state nickname", "History of mining in Utah", "State of Utah", "Religion in Utah", "Utah (U.S. state)", "Transportation in Utah", "Beehive State", "US-UT", "Utah (state)", "Forty-fifth State", "Utahan", "Politics of Utah", "Salt Lake Seagulls", "45th State", "History of Utah (to 1847)", "The Beehive State", "Youtah", "Transport in Utah"]}
{"context": "Question: Which actress was voted Miss Greenwich Village in 1942?\nAnswer:", "answer": "Lauren Bacall", "aliases": ["Bacall", "Lauren Becal", "Lauren Bacall", "Lauren Becall", "Betty J. Perske", "Loren Bacall", "Betty Joan Perske", "Betty Perske", "Betty Joan Perski"]}
{"context": "Question: What is the Japanese share index called?\nAnswer:", "answer": "Nikkei", "aliases": ["Nikkei", "Nikkei (disambiguation)"]}
{"context": "Question: What was the name of Michael Jackson's autobiography written in 1988?\nAnswer:", "answer": "Moonwalk", "aliases": ["Walk on the Moon", "Walk on the moon", "Moonwalk (disambiguation)", "Lunar walks", "Moonwalk", "Moon Walk", "Moonwalking", "Lunar walk", "Moon walk", "Moonwalks", "Moon walks", "Lunar walking", "Moon walking"]}
{"context": "Question: In which decade did stereo records first go on sale?\nAnswer:", "answer": "1930s", "aliases": ["1930’s", "Thirties", "1930s literature", "Nineteen-thirties", "1930–1939", "1930-1939", "'30s", "1930s", "1930's", "%6030s", "1930s (decade)", "The Thirties"]}
{"context": "Question: What was golfing great Ben Hogan's famous reply when he was asked how to improve one's game?\nAnswer:", "answer": "Hit the ball closer to the hole", "aliases": ["Hit the ball closer to the hole"]}
{"context": "Question: In what year's Olympics were electric timing devices and a public-address system used for the first time?\nAnswer:", "answer": "In 1912, in Stockholm", "aliases": ["In 1912, in Stockholm"]}
{"context": "Question: Why is the site of a boxing match called a ring when it's square?\nAnswer:", "answer": "Boxing rings were originally circular", "aliases": ["Boxing rings were originally circular"]}
{"context": "Question: In the very first Boston Marathon, 15 runners competed. How many finished?\nAnswer:", "answer": "$85,000", "aliases": ["eighty-five thousand distance", "$85,000", "85000 distance"]}
{"context": "Question: \"How many different animal shapes are there in the \"\"Animal Crackers\"\" cookie zoo?\"\nAnswer:", "answer": "Eighteen--two bears (one walking, one seated), a bison, camel, cougar, elephant, giraffe, gorilla, hippopotamus, hyena , kangaroo, lion, monkey, rhinoceros, seal, sheep, tier, and zebra", "aliases": ["Eighteen--two bears (one walking, one seated), a bison, camel, cougar, elephant, giraffe, gorilla, hippopotamus, hyena , kangaroo, lion, monkey, rhinoceros, seal, sheep, tier, and zebra"]}
{"context": "Question: Which volcano in Tanzania is the highest mountain in Africa?\nAnswer:", "answer": "Kilimanjaro", "aliases": ["Mawensi", "Mt. Kilimanjaro", "Kibo (volcano)", "Mount killimanjaro", "Highest mountain in Africa", "Kilimanjaro Massif", "Stella Point", "Kilimandjaro", "Kilimonjaro", "Kilimanjaro", "Gilman's Point", "Killimanjaro", "Kilima-Njaro", "Kiliminjaro", "Mt Kilimanjaro", "Kilimanjaro Mountain", "Mount Kilimanjaro", "Mawenzi", "Uhuru Peak", "Kilimanjiro", "Kaiser-Wilhelm-Spitze", "Mt Kilamanjaro", "Mount Kiliminjaro", "Mount Kilimandjaro", "Mount Kilamanjaro", "Tussock Grassland (Tanzania)", "Kilamanjaro"]}
{"context": "Question: The flag of Libya is a plain rectangle of which color?\nAnswer:", "answer": "Green", "aliases": ["Greenishly", "Avacado (color)", "Green (color)", "Rgb(0, 255, 0)", "Greenishness", "The colour green", "Greenest", "List of terms associated with the color green", "The color green", "Green", "Pastel green", "(0, 255, 0)", "Green (colour)", "Greenness"]}
{"context": "Question: Of which African country is Niamey the capital?\nAnswer:", "answer": "Niger", "aliases": ["Niger Republic", "Nigerois", "Republic Of Niger", "Republic of Niger", "The Republic of Niger", "Nigerien", "Niger (country)", "République du Niger", "Republique du Niger", "ISO 3166-1:NE", "Niger", "NG-NI"]}
{"context": "Question: Who was the director of the CIA from 1976-81?\nAnswer:", "answer": "George Bush", "aliases": ["George Bush", "George bush", "Goerge Bush", "George W. Bush (disambiguation)", "GeorgeBush", "George Bushe", "Georgebush", "Georg bush", "G Bush", "George Bush, President", "George Bush (disambiguation)", "Bush, George", "Geroge Bush"]}
{"context": "Question: Which musical featured the song The Street Where You Live?\nAnswer:", "answer": "My Fair Lady", "aliases": ["My Fair Lady (2010 film)", "Enry Iggins", "Why Can't the English%3F", "My Fair Lady", "My Fair Lady (upcoming film)", "My Fair Lady (musical)", "My fair lady", "I'm an Ordinary Man", "My Fair Lady (2014 film)", "My Fair Lady (2012 film)", "My Fair Lady (2015 film)"]}
{"context": "Question: \"Who was the target of the failed \"\"Bomb Plot\"\" of 1944?\"\nAnswer:", "answer": "Hitler", "aliases": ["Hitlerian", "Adolph Schicklgruber", "HitlerAdolf", "Hitler's medical health", "Adolf Hitle", "Hitlar", "Adolph Hiedler", "Adolf Hiedler", "Adolph Hittler", "Day of Potsdam", "Adolpf Hitler", "Adolf Hister", "Adolf Hitlier", "Adolph Hitler's health", "Hitler's health", "Hitlers", "Aldof Hilter", "HITLER", "Hitler, Adolph", "History of Adolf Hitler", "Hitler,Adolph", "Adolph Hiter", "Adolf Hittler", "Herr Hitler", "Hitler,Adolf", "Adolf Schicklegruber", "Adolf hitler", "Adlof hitler", "Adolph Schickelgruber", "Hitler Adolf", "Hitlers medical health", "HitlerAdolph", "Adolph Schicklegruber", "Adolf Hiler", "Adolf Hitler's medical condition", "Hittler", "Adolf Schickelgruber", "Adolf Hitler", "Hitler's", "Hitler, adolf", "Nazi leader", "Hitler, Adolf", "Herr Wolf", "Adolph Hitler's medical health", "Adolph Hitler", "Adolf Hitler's health", "Adolf Schicklgruber", "AdolphHitler", "Adolf Hilter", "Health of Adolf Hitler", "Adolf Hitler's medical health", "Hitler Adolph", "AdolfHitler", "Adolf HItler", "Hitlet", "Hitler adolf", "Adoff Hitler", "Adolfus Hitler", "Hitler", "Adolph hitler"]}
{"context": "Question: Who had an 80s No 1 hit with Hold On To The Nights?\nAnswer:", "answer": "Richard Marx", "aliases": ["Richard Noel Marx", "Richard Marx"]}
{"context": "Question: Who directed the classic 30s western Stagecoach?\nAnswer:", "answer": "John Ford", "aliases": ["John Ford (1895-1973)", "Sean O'Feeney", "John Ford (film director)", "Ford, John (1895-1973)", "Argosy Pictures", "John Ford statue", "John Martin O'Feeney", "John Ford (director)", "Cavalry trilogy", "John O'Feeney", "Sean Aloysius O'Feeney", "Ford, John", "John Ford"]}
{"context": "Question: Dave Gilmore and Roger Waters were in which rock group?\nAnswer:", "answer": "Pink Floyd", "aliases": ["Grey Floyd", "Pink Floyd trivia", "The Screaming Ab Dabs", "Pink flowd", "The Meggadeaths", "The Architectural Abdabs", "PINK FLOYD", "Pink Flod", "Pink Floyd", "Pink Floyd Trivia", "The Pink Floyd", "Notable or frequent contributors to pink floyd", "The Tea Set", "Pinkfloyd", "Pi5", "Pink floid", "Pink Floyd (band)", "The T Set", "Screaming abdabs", "Notable or frequent contributors to Pink Floyd", "The Megadeaths", "Pik floyd", "The Pink Floyd Sound", "Pink floyd", "The T-Set", "The Screaming Abdabs", "Clive Metcalfe", "Meggadeaths"]}
{"context": "Question: Which highway was Revisited in a classic 60s album by Bob Dylan?\nAnswer:", "answer": "61", "aliases": ["61", "sixty-one"]}
{"context": "Question: Which was the only eastern bloc country to participate in the 1984 LA Olympics?\nAnswer:", "answer": "Rumania", "aliases": ["ISO 3166-1:RO", "Romanian state", "ROMANIA", "Roumania", "Etymology of Romania", "Romainia", "Romînia", "North Danubian region", "Carpathian Danubian space", "ROU", "România", "Romanian State", "Roumanie", "Country ROM", "Rromania", "Romania", "Republic of Romania", "RO (country)", "Rumänien", "Danubian-Carpathian Area", "Rumania", "Austro-Hungarian Empire (Romania)", "Rumunia"]}
{"context": "Question: Which 90s sci fi series with James Belushi was based on Bruce Wagner's comic strip of the same name?\nAnswer:", "answer": "Wild Palms", "aliases": ["Wild Palms"]}
{"context": "Question: If I Were A Rich Man Was a big hit from which stage show?\nAnswer:", "answer": "Fiddler on the Roof", "aliases": ["Fiddler on a Roof", "Fiddler on the roof", "Sprintze", "Fiddler On the Roof", "2 life", "Fiddler On The Roof", "The Fiddler on the Roof", "Fiddler on the Roof", "Fiddler on the reoof", "Anatevka"]}
{"context": "Question: Men Against the Sea and Pitcairn's Island were two sequels to what famous novel?\nAnswer:", "answer": "Mutiny On The Bounty", "aliases": ["HMS Bounty mutineers", "Mutiny on the Bounty", "Mutiny on Bounty", "Mutiny On The Bounty", "Mutiny on the Bounty (history)", "Mutiny on the bounty", "Bounty (vessel)", "Thomas Ledward"]}
1 change: 0 additions & 1 deletion scripts/train/finetune_example/mpt-7b-arc-easy--gpu.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -92,7 +92,6 @@ fsdp_config:
activation_checkpointing_reentrant: false
activation_cpu_offload: false
limit_all_gathers: true
verbose: false

# Logging
progress_bar: false
Expand Down
1 change: 0 additions & 1 deletion scripts/train/yamls/finetune/1b_local_data_sft.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -111,7 +111,6 @@ fsdp_config:
activation_checkpointing_reentrant: false
activation_cpu_offload: false
limit_all_gathers: true
verbose: false

# Logging
progress_bar: false
Expand Down
1 change: 0 additions & 1 deletion scripts/train/yamls/finetune/7b_dolly_sft.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -99,7 +99,6 @@ fsdp_config:
activation_checkpointing_reentrant: false
activation_cpu_offload: false
limit_all_gathers: true
verbose: false

# Logging
progress_bar: false
Expand Down
1 change: 0 additions & 1 deletion scripts/train/yamls/finetune/mpt-30b-instruct.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -101,7 +101,6 @@ fsdp_config:
activation_cpu_offload: false
limit_all_gathers: true
sync_module_states: true
verbose: false

# Logging
progress_bar: false
Expand Down
1 change: 0 additions & 1 deletion scripts/train/yamls/finetune/mpt-7b_dolly_sft.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -105,7 +105,6 @@ fsdp_config:
activation_checkpointing_reentrant: false
activation_cpu_offload: false
limit_all_gathers: true
verbose: false

# Logging
progress_bar: false
Expand Down
1 change: 0 additions & 1 deletion scripts/train/yamls/finetune/mpt-7b_domain_adapt.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -90,7 +90,6 @@ fsdp_config:
activation_checkpointing_reentrant: false
activation_cpu_offload: false
limit_all_gathers: true
verbose: false

# Logging
progress_bar: false
Expand Down
1 change: 0 additions & 1 deletion scripts/train/yamls/finetune/t5-small_dolly_sft.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -76,7 +76,6 @@ fsdp_config:
activation_checkpointing_reentrant: false
activation_cpu_offload: false
limit_all_gathers: true
verbose: false

# Logging
progress_bar: false
Expand Down
1 change: 0 additions & 1 deletion scripts/train/yamls/pretrain/gpt-neo-125m.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -92,7 +92,6 @@ fsdp_config:
activation_checkpointing_reentrant: false
activation_cpu_offload: false
limit_all_gathers: true
verbose: false

# Logging
progress_bar: false
Expand Down
1 change: 0 additions & 1 deletion scripts/train/yamls/pretrain/gpt-neo-125m_eval.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -92,7 +92,6 @@ fsdp_config:
activation_checkpointing_reentrant: false
activation_cpu_offload: false
limit_all_gathers: true
verbose: false

# Logging
progress_bar: false
Expand Down
1 change: 0 additions & 1 deletion scripts/train/yamls/pretrain/gpt2-small.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -92,7 +92,6 @@ fsdp_config:
activation_checkpointing_reentrant: false
activation_cpu_offload: false
limit_all_gathers: true
verbose: false

# Logging
progress_bar: false
Expand Down
1 change: 0 additions & 1 deletion scripts/train/yamls/pretrain/mpt-125m.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -91,7 +91,6 @@ fsdp_config:
activation_checkpointing_reentrant: false
activation_cpu_offload: false
limit_all_gathers: true
verbose: false

# Logging
progress_bar: false
Expand Down
1 change: 0 additions & 1 deletion scripts/train/yamls/pretrain/mpt-13b.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -91,7 +91,6 @@ fsdp_config:
activation_checkpointing_reentrant: false
activation_cpu_offload: false
limit_all_gathers: true
verbose: false

# Logging
progress_bar: false
Expand Down
1 change: 0 additions & 1 deletion scripts/train/yamls/pretrain/mpt-1b.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -91,7 +91,6 @@ fsdp_config:
activation_checkpointing_reentrant: false
activation_cpu_offload: false
limit_all_gathers: true
verbose: false

# Logging
progress_bar: false
Expand Down
1 change: 0 additions & 1 deletion scripts/train/yamls/pretrain/mpt-30b.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -91,7 +91,6 @@ fsdp_config:
activation_checkpointing_reentrant: false
activation_cpu_offload: false
limit_all_gathers: true
verbose: false

# Logging
progress_bar: false
Expand Down
1 change: 0 additions & 1 deletion scripts/train/yamls/pretrain/mpt-350m.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -91,7 +91,6 @@ fsdp_config:
activation_checkpointing_reentrant: false
activation_cpu_offload: false
limit_all_gathers: true
verbose: false

# Logging
progress_bar: false
Expand Down
1 change: 0 additions & 1 deletion scripts/train/yamls/pretrain/mpt-3b.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -91,7 +91,6 @@ fsdp_config:
activation_checkpointing_reentrant: false
activation_cpu_offload: false
limit_all_gathers: true
verbose: false

# Logging
progress_bar: false
Expand Down
Loading
Loading