-
Notifications
You must be signed in to change notification settings - Fork 538
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Validation #844
Open
XiaohanZhangCMU
wants to merge
126
commits into
mosaicml:main
Choose a base branch
from
XiaohanZhangCMU:validation
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Validation #844
Changes from 69 commits
Commits
Show all changes
126 commits
Select commit
Hold shift + click to select a range
8cb6522
add validation script
xiaohanzhan-db c59c11f
update
xiaohanzhan-db 66f34eb
change token count function
2cd387b
reorganize cells
3eac3bf
Add unit tests
xiaohanzhan-db d2d9767
Add a printout for CPT
xiaohanzhan-db be25591
update question
xiaohanzhan-db 4651be7
Add questions
5cd6a94
Fix lints
xiaohanzhan-db 8e2c1f4
Merge branch 'main' into validation
XiaohanZhangCMU e6e4a81
update format
xiaohanzhan-db 34c5690
Merge branch 'validation' of github.com:XiaohanZhangCMU/llm-foundryX …
xiaohanzhan-db 1668b9a
update
xiaohanzhan-db 2219135
nb source
xiaohanzhan-db 86c6e87
add validation script
xiaohanzhan-db 678b376
update
xiaohanzhan-db 297e057
change token count function
09d0ebb
reorganize cells
460df65
Add unit tests
xiaohanzhan-db 3ffd200
Add a printout for CPT
xiaohanzhan-db 9362886
update question
xiaohanzhan-db 898e5ac
Add questions
a4bef71
Fix lints
xiaohanzhan-db 4ca9cc6
update format
xiaohanzhan-db d636a0f
update
xiaohanzhan-db 827d155
nb source
xiaohanzhan-db 6bbf3fc
Remove license insert for validation notebook
xiaohanzhan-db 4f6a4fb
Merge branch 'validation' of github.com:XiaohanZhangCMU/llm-foundryX …
xiaohanzhan-db 5966b68
Add validation utils
xiaohanzhan-db da17813
Merge branch 'main' into validation
xiaohanzhan-db 89fb909
Validation (#856)
XiaohanZhangCMU 55e4626
update utils/__init__.py to include extra validation functions
xiaohanzhan-db 45544a1
update notebook
d2797b3
update
xiaohanzhan-db 019da77
Merge branch 'validation' of github.com:XiaohanZhangCMU/llm-foundryX …
xiaohanzhan-db 756fdae
update
xiaohanzhan-db 93b5a9f
Add download remote function to util
xiaohanzhan-db b47c878
update
xiaohanzhan-db 13fd34c
update
xiaohanzhan-db 610f669
update
xiaohanzhan-db 9f2e51b
update
xiaohanzhan-db ec68f10
update
xiaohanzhan-db 1e76068
update
xiaohanzhan-db 7a5c164
update
xiaohanzhan-db e76038f
Merge branch 'main' into validation
xiaohanzhan-db 5b413f5
update
xiaohanzhan-db a1aa31f
update
xiaohanzhan-db d24fd5c
update
xiaohanzhan-db 55fce37
Add dask and dataframe_to_mds
xiaohanzhan-db 86e2412
update
xiaohanzhan-db bbfec65
update
xiaohanzhan-db b2e880d
update
xiaohanzhan-db 596443a
update
xiaohanzhan-db ea65187
Add notebook
xiaohanzhan-db 378a4e0
update
xiaohanzhan-db af6e9aa
update
4e286ec
remove script and tests, keep notebook
xiaohanzhan-db 09c4892
update
xiaohanzhan-db c82da6c
update
xiaohanzhan-db e5f83cc
update
xiaohanzhan-db 17d2b9f
update
xiaohanzhan-db 6579d55
Merge branch 'main' into validation
xiaohanzhan-db 56308ff
Merge branch 'byod/data_validation' into validation
XiaohanZhangCMU 00a51b5
Validation (#862)
XiaohanZhangCMU 4daa324
updated notebook
b809691
Merge branch 'main' into validation
xiaohanzhan-db 8b75f94
remove scripts keep notebook
xiaohanzhan-db 99bf2cd
merge with byod/data_validation
xiaohanzhan-db 9b37063
Validation (#866)
XiaohanZhangCMU 22014d6
update notebook. rephrase.
d9f28aa
merged
xiaohanzhan-db f1fa63c
Validation (#867)
XiaohanZhangCMU 43c8ac9
update
xiaohanzhan-db b8ac771
Add response tokens
xiaohanzhan-db 1b9681c
update
xiaohanzhan-db 16883c2
merge
xiaohanzhan-db a9218d6
Validation (#875)
XiaohanZhangCMU c7567f1
update
xiaohanzhan-db 1764b72
Disable MDSWrite, return token counts
xiaohanzhan-db 808ced5
Change plot settings
xiaohanzhan-db 26ae516
Fix conflict
xiaohanzhan-db a212ee8
update notebook
d279817
update
xiaohanzhan-db f1cfe9e
Validation (#898)
XiaohanZhangCMU dbe3f4e
update notebook
3005718
update
xiaohanzhan-db 8498662
Validation (#900)
XiaohanZhangCMU f5b900c
update
02d0979
Merge branch 'byod/data_validation' of https://github.com/mosaicml/ll…
xiaohanzhan-db 205e405
Validation (#901)
XiaohanZhangCMU 2f883a7
update notebook
0315caf
update
xiaohanzhan-db 1a510ff
update pip install link
xiaohanzhan-db 530a55a
Change done file location
xiaohanzhan-db 5493295
Validation (#902)
XiaohanZhangCMU 81c3757
Create the dest folder
xiaohanzhan-db 5090e13
Validation (#1025)
XiaohanZhangCMU f88917d
update notebook
xiaohanzhan-db 4c86f74
update
xiaohanzhan-db 962974b
Merge branch 'byod/data_validation' into validation
XiaohanZhangCMU 9fd91cf
Validation (#1027)
XiaohanZhangCMU 67f7b4c
Merge pull request #1 from mosaicml/byod/data_validation
XiaohanZhangCMU 28cd2e6
update notebook
xiaohanzhan-db 944b260
Validation (#1028)
XiaohanZhangCMU 9a19d8a
fix conflict
xiaohanzhan-db a6b2ae0
Validation (#1031)
XiaohanZhangCMU de90934
update token_counts
xiaohanzhan-db 5dfd30c
Validation (#1032)
XiaohanZhangCMU 61adb43
update pip install list
xiaohanzhan-db c404dc7
Validation (#1033)
XiaohanZhangCMU c77bdf6
fix
xiaohanzhan-db ad71cc0
update
xiaohanzhan-db 9bc3a39
fix token counts
xiaohanzhan-db 9ec582e
Expose validate chat
xiaohanzhan-db 734008e
Expose more
xiaohanzhan-db 51f2eef
update
xiaohanzhan-db 7b6956d
expose
xiaohanzhan-db 60ed7de
add collate
xiaohanzhan-db fba1dcb
Fix
xiaohanzhan-db 58185ba
Fix conflict
xiaohanzhan-db 8e8f431
Validation (#1034)
XiaohanZhangCMU 24f3d9e
update notebook
xiaohanzhan-db 714002d
Fix conflict
xiaohanzhan-db 1640f30
Validation (#1035)
XiaohanZhangCMU b053363
Merge branch 'byod/data_validation' of https://github.com/mosaicml/ll…
xiaohanzhan-db 7e1d567
update notebook
xiaohanzhan-db File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
was there a particular reason this was excluded just curious?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
For databricks to render a python script as a notebook, it needs the script to start with #databricks notebook source. This change asks pre-commit to skip adding the license header to the script.