Vectorized infer_beam_batch for improved performance #697

9173860 · 2024-08-21T15:29:43Z

This pull request introduces key optimizations and improvements in model_48px aimed at increasing the efficiency of the OCR pipeline:

Vectorized infer_beam_batch:

The infer_beam_batch function in model_48px has been vectorized, significantly improving the speed of OCR inference. The original infer_beam_batch is also kept, so you can easily switch between the two for testing purposes.

Refactored `encoders` and `decoders` with `forward` methods:

Extracted the forward methods for both encoders and decoders. This refactoring enables more straightforward model exporting (e.g., to ONNX), allowing further optimization of inference performance and integration with deployment platforms like Triton Inference Server.

Vectorized infer_beam_batch for improved performance

796f65e

zyddnys merged commit ead6693 into zyddnys:main Aug 24, 2024
0 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vectorized infer_beam_batch for improved performance #697

Vectorized infer_beam_batch for improved performance #697

9173860 commented Aug 21, 2024

Vectorized infer_beam_batch for improved performance #697

Vectorized infer_beam_batch for improved performance #697

Conversation

9173860 commented Aug 21, 2024

Vectorized infer_beam_batch:

Refactored encoders and decoders with forward methods:

Refactored `encoders` and `decoders` with `forward` methods: