-
Create a cluster using the IBM Cloud Container Service. For this Code Pattern we will only need to use the Free tier which gives us access to a Kubernetes cluster with 1 worker node.
Note: This could take anywhere from 10 minutes to half an hour.
-
(Optional Step) The instructions to install the command line tools are optional and not required to deploy the model using this tutorial. While not required, you may choose to install them while waiting for the cluster to be created.
-
On your cluster dashboard, select
Worker Nodes
in the sidebar and take note of thePublic IP Address
for the node.Note: The IP address will not be displayed until the cluster is fully deployed.
-
Click the
Kubernetes Dashboard
button to open the dashboard. Hit the+ Create
button in the top right corner. Go toCreate from form
. Fill in the following fields:App name: <Select a name> Container image: quay.io/codait/max-image-caption-generator:latest Number of pods: 1 Service: External Port: 5000 | Target Port: 5000 | Protocol: TCP
Then click
Deploy
-
In the sidebar, scroll down to
Services
underDiscovery and Load Balancing
. Click on the service that was created in the previous step. Under theInternal endpoints
, make note of the port number that is within the range30000
to32676
. This port is theNodePort
. -
The MAX model should be accessible at
<Public IP>:<NodePort>
.