Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Add ModelStreamInfer to Triton MethodInfos #81

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

Legion2
Copy link

@Legion2 Legion2 commented Jan 21, 2024

Motivation

Fix #80

Modifications

Add ModelStreamInfer to triton MethodInfos

Result

inference.GRPCInferenceService/ModelStreamInfer gRPC requests can be send to triton, which enable the use of triton backends and models which require streaming.

@oss-prow-bot oss-prow-bot bot requested review from joerunde and njhill January 21, 2024 15:24
Copy link

oss-prow-bot bot commented Jan 21, 2024

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: Legion2
Once this PR has been reviewed and has the lgtm label, please assign ckadner for approval by writing /assign @ckadner in a comment. For more information see:The Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@Legion2 Legion2 force-pushed the triton-stream-infer branch from 81f6d8b to 9ce4393 Compare January 21, 2024 15:44
@rafvasq rafvasq requested review from rafvasq and ckadner and removed request for joerunde January 29, 2024 14:57
@rafvasq rafvasq changed the title Add ModelStreamInfer to triton MethodInfos feat: Add ModelStreamInfer to Triton MethodInfos Jan 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Triton RuntimeStatus.MethodInfos is missing ModelStreamInfer
1 participant