Name		Name	Last commit message	Last commit date
parent directory ..
assets		assets
README.md		README.md
image.jpg		image.jpg
mosaic-onnx.ipynb		mosaic-onnx.ipynb
onnx.yaml		onnx.yaml
requirements.txt		requirements.txt

README.md

Predict on a InferenceService using ONNX

Setup

Your ~/.kube/config should point to a cluster with KFServing installed.
Your cluster's Istio Ingress gateway must be network accessible.

Create the InferenceService

Apply the CRD

kubectl apply -f onnx.yaml

Expected Output

$ inferenceservice.serving.kubeflow.org/style-sample configured

Run a sample inference

Setup env vars The first step is to determine the ingress IP and ports and set INGRESS_HOST and INGRESS_PORT

export MODEL_NAME=style-sample
export SERVICE_HOSTNAME=$(kubectl get inferenceservice ${MODEL_NAME} -o jsonpath='{.status.url}' | cut -d "/" -f 3)

Verify the service is healthy

SERVICE_URL=http://${INGRESS_HOST}:${INGRESS_PORT}/v1/models/$MODEL_NAME
curl ${SERVICE_URL}

Install dependencies

pip install -r requirements.txt

Run the sample notebook in jupyter

jupyter notebook

Uploading your own model

The sample model for the example in this readme is already uploaded and available for use. However if you would like to modify the example to use your own ONNX model, all you need to do is to upload your model as model.onnx to S3, GCS or an Azure Blob.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

onnx

onnx

README.md

Predict on a InferenceService using ONNX

Setup

Create the InferenceService

Run a sample inference

Uploading your own model

Files

onnx

Directory actions

More options

Directory actions

More options

Latest commit

History

onnx

Folders and files

parent directory

README.md

Predict on a InferenceService using ONNX

Setup

Create the InferenceService

Run a sample inference

Uploading your own model