-
Notifications
You must be signed in to change notification settings - Fork 487
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Enable ONNX export decoder only without need to merge (#1257)
* ONNX export decoder model refactorization * fix IO bindings * format * enable mpt support * format * add trust remote code * fix test * format * fix quantization * add test * format * fix optimization * fix compatibility with legacy models * fix style * add export to main_export * add legacy to ONNX export * patch model to fix causal lm generation * add no post process * remove bloom caching * fix dynamic axis for position ids * fix external data * add model patcher * format * fix bart model patcher * fix model patcher for opt models * fix format * add test * format * fix ort docker * add test * fix bart model patcher * raise when unsupported model * add cached file * add position warning * fixes * enable post process after export to remove tied weights * comment * remove test * fix test * modify model * remove deprecated use_merged in test * Add mistral model patcher * add slow test * add workflow
- Loading branch information
Showing
22 changed files
with
999 additions
and
714 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,33 @@ | ||
name: ONNX Runtime slow / Python - Test | ||
|
||
on: | ||
workflow_dispatch: | ||
schedule: | ||
- cron: 0 7 * * * # every day at 7am | ||
|
||
concurrency: | ||
group: ${{ github.workflow }}-${{ github.head_ref || github.run_id }} | ||
cancel-in-progress: true | ||
|
||
jobs: | ||
build: | ||
strategy: | ||
fail-fast: false | ||
matrix: | ||
python-version: [3.8, 3.9] | ||
os: [ubuntu-20.04] | ||
|
||
runs-on: ${{ matrix.os }} | ||
steps: | ||
- uses: actions/checkout@v2 | ||
- name: Setup Python ${{ matrix.python-version }} | ||
uses: actions/setup-python@v2 | ||
with: | ||
python-version: ${{ matrix.python-version }} | ||
- name: Install dependencies for export | ||
run: | | ||
pip install .[tests,onnxruntime] | ||
- name: Test with unittest | ||
working-directory: tests | ||
run: | | ||
RUN_SLOW=1 pytest onnxruntime -s -m "run_slow" --durations=0 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.