Build MxTransporter in GKE

This section describes the procedure for building MxTransporter in the GKE environment.
All commands to create each GCP resources are wrapped in Makefile.

Prepare

Command

Need to use following commands. The version listed is the verified version.

bq       v2.0.71
docker   v20.10.8
gcloud   v355.0.0
kubectl  v1.22.1
helm     v3.6.3
make     v3.81

Environment variables files

Before starting the construction, create .env and .secrets.env in the current directory by referring to .env.template and secrets.env.template.

If you want to export change streams to BigQuery or Pub/Sub, write the following description in .secrets.env.

EXPORT_DESTINATION=bigquery

or

EXPORT_DESTINATION=pubsub

or

EXPORT_DESTINATION=bigquery,pubsub

Pubsub Ordering (optional)

If you want to order message in pubsub, set PUBSUB_ORDERING_BY env.
Specify the field name of one of the Change Streams.
https://cloud.google.com/pubsub/docs/ordering

NOTICE ordering message can cause performance issues.
see https://medium.com/google-cloud/google-cloud-pub-sub-ordered-delivery-1e4181f60bc8

BigQuery schema (optional)

If you want to export change streams to BigQuery, specify the following table schema.

Table schema

[
    {
      "mode": "NULLABLE",
      "name": "id",
      "type": "STRING"
    },
    {
      "mode": "NULLABLE",
      "name": "operationType",
      "type": "STRING"
    },
    {
      "mode": "NULLABLE",
      "name": "clusterTime",
      "type": "TIMESTAMP"
    },
    {
      "mode": "NULLABLE",
      "name": "fullDocument",
      "type": "STRING"
    },
    {
      "mode": "NULLABLE",
      "name": "ns",
      "type": "STRING"
    },
    {
      "mode": "NULLABLE",
      "name": "documentKey",
      "type": "STRING"
    },
    {
      "mode": "NULLABLE",
      "name": "updateDescription",
      "type": "STRING"
    }
]

Procedure

Optional: Setup BigQuery

You need to create a dataset and a table.

Create a dataset. It's interactive, so specify the dataset name.

$ make create-bigquery-dataset

Create a table. It's interactive, so specify the dataset name and table name.

$ make create-bigquery-table

Set the expiration date of the partition set in table. The expiration value is specified in .env as BIGQUERY_TABLE_PARTITIONING_EXPIRATION_TIME. It's interactive, so specify the dataset name and table name.

$ make set-bigquery-table-expiration-date

1. Create GKE cluster, node group and IAM resources.

$ make build

2. Create kubernetes secrets.

Collect the environment variables written in secrets.env and create them in a cluster as kubernetes secrets.

$ make secrets

3. Deploy kubernetes resources.

If you have set an Optional environment variable in .secrets.env, edit the container env in ./templates/stateless.yaml. Set only the environment variables required for the container running on kubernetes.

Create kubernetes resources with helm.

Following command creates a StatefulSet, HeadlessService, Horizontal Pod Autoscaler, and PVC.

$ make deploy

The following processing is performed here.
・build docker image.
・push docker image to gcr repository.
・build helm variables.
・deploy with helm templates.

4. Upgrade kubernetes resources.

You can upgrade kubernetes resources with the following command.

Note that if you do not update GCR_REPO_TAG, the new docker image will be referenced and the container will not be created.

$ make upgrade

Architects

A pod is created for each collection, and a persistent volume is linked to each pod. Since the StatefulSet is created, even if the pod stops, you can get the change streams by referring to the resume token saved in the persistent volume again.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Build MxTransporter in GKE

Prepare

Command

Environment variables files

Pubsub Ordering (optional)

BigQuery schema (optional)

Procedure

Architects

Files

README.md

Latest commit

History

README.md

File metadata and controls

Build MxTransporter in GKE

Prepare

Command

Environment variables files

Pubsub Ordering (optional)

BigQuery schema (optional)

Procedure

Architects