Add trt support for BF16 #195

andompesta · 2024-11-14T22:47:24Z

This pull request aims to add support for TensortRT integration

Summary of Changes

CLI Support for TensorRT
- Added a new variable in src/flux/cli.py to enable TensorRT inference.
- environment variable TRT_ENGINE_DIR specifies the directory for storing TensorRT engines
- environment variable ONNX_DIRspecifies the directory for ONNX model exports.
- surrently supports the bf16 (fp16 and fp8 coming soon)
- supports for model offloading can be added
Modifications for ONNX Export
- Minor changes to src/flux/math and src/flux/modules/autoencoder.py to enable proper export in ONNX
- Additional changes are required to address numerical stability issues within the Flux-Transformer model.
TensorRT Exporter
- Added the flux/trt/exporter package, containing code to export PyTorch models to ONNX and build TensorRT engines.
TensorRT Engine Execution
- flux/trt/engine package is responsible to execute inference using TRT
TensorRT Mixin Classes
- Added the flux/trt/mixin package with mixin classes to share parameters between the model building and inference phases.
TensorRT Manager
- Introduced flux/trt/trt_manager.py as the main TensorRT management class. It handles the conversion of PyTorch models to TensorRT engines and manages the TensorRT context for inference.

Add trt support push

andompesta added 30 commits November 14, 2024 22:32

add question logger

f8747c2

add cli input for TRT support

bf71e81

initial support for TRT engine builder

223017d

add onnx export functions taken from

fd7057e

base class for convert to onnx

0713049

add missing dependencies

d0ba09a

fix imports

959eaa7

moved to wrappers package

50149bd

moved to wrapper package and renamed into base wrapper

71c1b0d

implement load engines function

407a3ac

remove old wrapper

ca691f1

add additional parameters to base class constructor

78fbb25

implemented CLIP wrapper

3548085

remove model as a property

6b01977

enable float16 optimization

2477223

reorder arguments

969e06d

first wrapper fir onnx build

f8a183b

add load_engines with minimal parameters

5572ae7

fix get_sample_input interface format and add get_model_to_trace

713a66e

fix stage name

b4bebd2

ad imports for wrappers

e56078c

set assert error message for missing stage

f1fa53b

add assert to validate only 1 dtype is active

0afcc8e

add set_model_to_dtype function to set the correct dtype

9b600b3

call set_model_to_dtype instead of doing it manually

da78bef

T5wrapper as a copy of CLIPwrapper

2c7201f

rename get_model_to_trace to get_model

d2b2567

do_constant_folding should not be configurable

d6c18c8

removed custom configurations

89f616e

fix import

44a3280

andompesta and others added 28 commits November 14, 2024 22:36

disable autocast

f14de69

stronglytyped t5

3410d34

fix input type and max image size

2422538

max image size

9bef65b

T5 not strongly typed

a3bd8fc

testing

e615fa0

fix torch sycronize problem

611efed

don't build already present engines

13b1016

remove torch save

01b508c

removed onnx dependencies

f57b5a5

add trt dependencies

9cffa24

remove trt dependencies from toml

63e29cc

rename requirements and fix readme

c978cc3

remove unused files

3087c60

fix import format

5c2cba1

remove comments

08fbb60

add gitignore

1b4a41a

reset dependencies

a404144

add hidden setup files

a8b8478

solve ruff check

8fa1d22

fix imports with rufs

3f20508

run ruff formatter

7662313

update gitignore

4691502

simplify dependencies

deb5633

remove gitignore

1de2799

add cli formatting

64cbb8f

fix import orders

fd1455e

Merge pull request #1 from andompesta/add-trt-support-push

095ee89

Add trt support push

This was referenced Nov 24, 2024

Awful Flux Fill results. (blurry, grainy results) comfyanonymous/ComfyUI#5746

Open

Flux fill is prone to crashing comfyanonymous/ComfyUI#5715

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add trt support for BF16 #195

Add trt support for BF16 #195

andompesta commented Nov 14, 2024

Add trt support for BF16 #195

Are you sure you want to change the base?

Add trt support for BF16 #195

Conversation

andompesta commented Nov 14, 2024

Summary of Changes