Concourse workshop

There are a number of things we can do to improve our pipeline. It won't be in a format of a lab because otherwise the workshop would last for days. However, if time permits, the attendees can choose which of the following topics is of most interests and we can implement them.

Use internal repo to resolve dependencies rather than Maven central repo
Create scripts to facilitate setting pipelines
Artifacts version and releases
Publish Unit Test Report
Shared pipelines rather One pipeline per application
Credentials files more structured, flexible, cleaner and with less duplication
Less cluttered pipelines
Customizable Pipelines depending on the type of application
Use dedicated pipelines to build custom images
Automatically tracking Feature-branches
Provision services
Blue/Green deployments

Use internal repo to resolve dependencies rather than Maven central repo

So far we managed to build our Java application and verify it but it was too slow because Maven had to download all the dependencies from central repo. We need to configure Maven with our internal repository.

Hints:

We need a new script tasks/generate-settings.sh, that generates a standard Maven's settings.xml file with the location of our local Maven repository.
It is fully explained here

Create scripts to facilitate setting pipelines

The idea is to call a script like scripts/set-pipeline.sh local app1 and it automatically sets the pipeline with the name app1 using the target local. The script assumes that we are current logged in. The script also assumes we have a local secrets.yml file. It is up to us how we generate it, e.g. using Lastpass or decrypting it from an encrypted version.

The implementation of that scripts/set-pipeline.sh is very trivial:

#!/bin/bash -e


syntax() {
  echo "Usage: set-pipeline.sh concourse-target pipelineName"
}

PIPELINE_DIR=$(dirname "$0")/..

if [ "$#" -ne 2 ]; then
  syntax
  exit 1
fi

FLY_TARGET=$1
PIPELINE=$2
CREDENTIALS=credentials.yml
SECRETS=secrets.yml

echo "Setting $PIPELINE pipeline in Concourse ..."
fly -t "$FLY_TARGET" set-pipeline -p "$PIPELINE" \
  -c "$PIPELINE_DIR"/ci/application/pipeline.yml -l "$PIPELINE_DIR"/credentials.yml -l "$PIPELINE_DIR"/secrets.yml \

Artifacts version and releases

So far we have produced a snapshot version of our artifact. That snapshot version is fine for continuous integration where we don't really care the actual version number. However, once our product is ready to be released when we need to use release versions, not snapshots. We can use any versioning scheme we like. Semantic versioning is one of them: major.minor.patch.

Concourse has a resource, semver which helps with the task of tracking the last version and incrementing it.

Our application has hard-coded the current version in the pom.xml. It is 0.0.1-SNAPSHOT. When we release that snapshot, we produce an artifact like demo-0.0.1.jar and we have to edit the pom.xml to bump the patch number to 0.0.2-SNAPSHOT.

Briefly about semver resource: The idea is to have somewhere (git, s3, swift) a file which has a version number. If the file does not exit, we can configure the first version. The process of modifying the version number is very simple. We dont have to do it manually, semver resource does it for us.

Let's start configuring the version number of our application in an external file using semver resource. The initial version number is 0.0.1. We assume that the build task always produces snapshots. Hence, it will append the tag -SNAPSHOT at the end of the current version.

Add semantic version resource.

- name: version
  type: semver
  source:
    driver: git
    initial_version: 0.0.1-SNAPSHOT
    uri: {{source-code-url}}
    private_key: {{source-code-private-key}}
    branch: {{source-code-branch}}
    file: version

Configure source-code resource to ignore the version file (i.e. it does not trigger when this file changes)

- name: source-code
  type: git
  source:
    uri: {{source-code-url}}
    branch: {{source-code-branch}}
    ignore_paths:
      - version

Fetch the version and pass it onto the build task

- name: build-and-verify
  plan:
  - get: source-code
    trigger: true
  - get: version
  - task: build-and-verify
    file: source-code/tasks/build.yml
  - put: artifact-repo
    params:
      file: build-artifact/*.jar
      pom_file: source-code/pom.xml

Tell maven to use the semantic version


VERSION=`cat version/number`-SNAPSHOT

cd source-code

echo "Setting maven with version ${VERSION}"
mvn versions:set -DnewVersion=${VERSION}

mvn package

echo "Copying artifacts ..."
cp -r target/*.jar ../build-artifact

Add private key variable to secrets.yml. This is necessary so that semver can commit changes to the source-code repo where we track the version file.

...
source-code-private-key: |
  -----BEGIN RSA PRIVATE KEY-----
  ....
  ...
  -----END RSA PRIVATE KEY-----

Commit the changes

git add tasks/build.sh tasks/build.yml
git add ci/application/pipeline.yml
git commit -m "Use semver to set maven artifact versions"

Update pipeline

scripts/set-pipeline.sh local pipeline

We should see the task build printing out the version found in the file and setting Maven to use that version.

Let's continue adding a few jobs that will assist us when we need to increment the minor or major parts.

Add 2 jobs to the pipeline

- name: increase-major
  serial: true
  plan:
  - put: version
    params: { bump: major }

- name: increase-minor
  serial: true
  plan:
  - put: version
    params: { bump: minor }

Update the pipeline scripts/set-pipeline.sh local pipeline
Bump up the minor by triggering the job increase-minor. Check that there is a file called version with the bumped up version.

The next move is to think about the release process. After every release we are going to bump the path number. But the release process is far more complex to deal with it now and it varies depending who you talk to.

Publish Unit Test Report

There are not nice dashboards with junit reports like in Bamboo or similar tools. If we don't want to check the build logs to find out which test cases failed, we can add a task that builds the maven site with just the junit reports and publish the site to PCF. But that site would only have the latest build, not a history.

Shared pipelines rather One pipeline per application

Let's recap a number of good practices we introduce here:

Pipeline and variable files (--load-vars-from) must be versioned controlled
Sensitive data (like passwords and private keys) stored in variable files should never be versioned controlled (or at least in clear)
Pipelines and variable files should be stored along with the application (or microservice) we are building
We should not reinvent the wheel on each application. We should build pipelines in such a way that we can use them to build any application
We are aiming for consistent builds. Lock down pipeline and resource type's versions too
Tasks should be defined in "Task Definition" files rather than inline in the pipeline

Move the pipeline infrastructure to a dedicated git repository

Create a new git repo for the pipelines, e.g. concourse-workshop-ci
Move the folders ci, scripts, and tasks to the pipelines repo
Pipeline repo becomes another resource

- name: pipeline
  type: git
  source:
    uri: {{pipeline-url}}
    branch: {{pipeline-branch}}
    private_key: {{pipeline-private-key}}

Fetch pipeline resource because tasks are no longer in source-code but in pipeline:

- name: build-and-verify
  plan:
  - get: source-code
    trigger: true
  - get: pipeline
  - get: version
  - task: build-and-verify
    file: pipeline/tasks/build.yml
  - put: artifact-repo
    params:
      file: build-artifact/*.jar
      pom_file: source-code/pom.xml

Did not include the deploy job for brevity sake.

Update tasks definition files because they should take pipeline input because scripts are now in that input folder.

platform: linux
image_resource:
  type: docker-image
  source:
    repository: maven
    tag: 3.3.9-jdk-8
inputs:
  - name: pipeline
  - name: source-code
  - name: version
outputs:
  - name: build-artifact
run:
  path: pipeline/tasks/build.sh

We need to make this change to the other task definition files.

Add new credentials to the application's credentials file. Ideally, we want to lock down the version of the pipeline rather using the latest.

pipeline-code-url: git@github.com:MarcialRosales/concourse-workshop-ci
pipeline-code-branch: master

Change set-pipeline.sh script so that we can call it from the application's root folder

#!/bin/bash -e


syntax() {
  echo "Usage: set-pipeline.sh concourse-target pipelineName"
}

PIPELINE_DIR=$(dirname "$0")/..

if [ "$#" -ne 2 ]; then
  syntax
  exit 1
fi

FLY_TARGET=$1
PIPELINE=$2
CREDENTIALS=credentials.yml
SECRETS=secrets.yml

echo "Setting $PIPELINE pipeline in Concourse ..."
fly -t "$FLY_TARGET" set-pipeline -p "$PIPELINE" \
  -c "$PIPELINE_DIR"/ci/application/pipeline.yml -l $CREDENTIALS -l $SECRETS \

Update pipeline from the application's root folder where the credentials files are. ../concourse-workshop-ci/scripts/set-pipeline.sh local app1

Adding a new Java application, we would only require to:

Add credentials.yml which refers to the git URL of the java application
Add secrets.yml
Check out the pipeline repo so that we can call set-pipeline.sh
Call ../concourse-workshop-ci/scripts/set-pipeline.sh local applicationName

Credentials files more structured, flexible, cleaner and with less duplication

Variable interpolation is quite limited in Concourse:

flat namespace
string manipulation not possible like string concatenation
lots of variable value duplication because we cannot use the value of one variable to define another

Wouldn't be better if credentials.yml would look like this:

app:      
  name: demo
  initial_version: 0.0.1
  artifact: com.example:demo:jar
  source:
    uri: git@github.com:MarcialRosales/concourse-workshop-app1
    branch: master

pipeline:
  source:
    uri: http://192.168.1.36:8081/nexus/content/repositories/snapshots
    branch: master

  repository:
    uri: https://registry.npm.r3pi.net

deployment:
  pcf:
    api: https://api.system-dev.chdc20-cf.solera.com
    organization: marcial.rosales@r3pi.io
    space: sandbox
    skip_cert_check: false

    host: mr-demo
    domain: apps-dev.chdc20-cf.solera.com

We are going to use Spruce. Spruce is a tool that allows us to work with YAML files very easily. We can use it to do simple things like variable interpolation to more complex tasks like injecting YAML blobs, resolve variables from Vault and many more things.

Install Spruce (https://github.com/geofffranks/spruce/releases/). Download it, move it to /usr/local/bin/spruce and chmod u+x /usr/local/bin/spruce.
We are going to use hierarchical YAML in credentials and secrets
We Spruce to inject variables into the pipeline.

resources:

- name: pipeline
  type: git
  source:
    uri: (( grab pipeline.source.uri ))
    branch: (( grab pipeline.source.branch ))
    private_key: (( grab pipeline.source.private_key ))

Replace every {{var}} with the corresponding (( grab equivalent.var )) 5. Use Spruce to render pipeline in set-pipeline.sh

....

tmp=$(mktemp $TMPDIR/pipeline.XXXXXX.yml)
trap 'rm $tmp' EXIT

PIPELINE_FILES="$PIPELINE_DIR/ci/application/pipeline.yml"

echo "Generating $PIPELINE pipeline ..."
spruce merge --prune meta --prune pipeline --prune app --prune deployment $PIPELINE_FILES $CREDENTIALS $SECRETS > $tmp

echo "Setting $PIPELINE pipeline in Concourse ..."
fly -t "$FLY_TARGET" set-pipeline -p "$PIPELINE" -c $tmp

Less cluttered pipelines

As the number of jobs increases it is better to split them into several views where each view groups jobs related to certain aspect of the pipeline. We are going to create 2 groups: main and versioning. In the versioning we move all the jobs related to version handling.

Add the following to the pipeline:

groups:
- name: main
  jobs:
  - build-and-verify
  - deploy
- name: versioning
  jobs:
  - increase-minor
  - increase-major

resource-types:
  ....

Customizable Pipelines depending on the type of application

The idea is to build pipelines like a lego. Rather than having one big pipeline we want to build it from smaller pipeline files. It has 2 advantages:

Pipelines are easier to read because each pipeline file focuses on one simple functionality
We can easily customize pipelines by selecting the pieces we want to use

Say we have 3 type of applications:

Java executable applications, i.e. those we deploy to PCF We need to build, test, publish to central repo, deploy and verify that deployed app works.
Java libraries, i.e pure jar of common infrastructure stuff like caching, etc. We need to build and test and eventually publish it to central repo
Static web site We need to package it up, publish to central repo, deploy it and verify that it is running.

First we need to create the various pipeline files for each type of functionality:

build Java and/or library apps
deploy to PCF
build Static sites

Second we need scripts to build different type of applications. Each script calls Spruce to merge the corresponding pipeline file to produce a single pipeline file: e.g. set-java-app-pipeline.sh, set-java-lib-pipeline.sh, set-static-site-pipeline.sh.

Automatically tracking Feature-branches

Use dedicated pipelines to build custom images

Use custom build images as opposed to public one is considered a best practice. We are in full control of what's inside.

We should have one pipeline to build all the images required by the rest of the pipelines. We propose to place it under ci/images folder.

And we should also have a docker folder where we place all the dockerfiles. The pipeline monitors these files.

Say we want to build a docker image to run Terraform. Terraform is a tool that allows us to write, plan, and create Infrastructure as Code. It is going to be extremely useful to provision the PCF services, either managed or user-provided ones.

Create dockerfile docker/terraform/Dockerfile. It downloads Terraform binary, it also downloads source code of CloudFoundry Provider and compiles it and registers it as a plugin with terraform.
Create pipeline that monitor the dockerfile

- name: terraform-dockerfile
  type: git
  source:
    uri: (( grab pipeline.source.uri ))
    branch: (( grab pipeline.source.branch ))
    private_key: (( grab pipeline.source.private_key ))
    paths: [ docker/terraform/*]

Add docker image resource we use to publish it

- name: terraform-image
  type: docker-image
  source:
    username: (( grab pipeline.registry.username ))
    password: (( grab pipeline.registry.password ))
    repository: (( concat pipeline.registry.root "/terraform"))

Add job that fetches the dockerfile when it changes, builds the docker image and pushes it.

- name: terraform
  public: true
  plan:
  - aggregate:
    - get: terraform-dockerfile
      trigger: true
    - get: pipeline
  - put: terraform-image
    params:
      build: terraform-dockerfile/docker/terraform

Declare credentials within the pipeline repository. We need the following structure:

pipeline:
  registry:
    root: marcialfrg
    username: dummy
    password: dummy

Add script set-images-pipeline.sh to set this pipeline. It is very similar to the set-pipeline.sh except that we use ci/images/pipeline rather than ci/application/pipeline.
Set up pipeline: scripts/set-images-pipeline.sh local images

Provision services

If our applications require a number of services in PCF, such as a managed service like a RabbitMQ vhost/user, or a mysql database, or a user-provided-service with the credentials to an external service, we need to automatically provision those before we deploy the application. We cannot expect to do it manually.

We will use Terraform to provision those services. We have built the Terraform docker image so we are ready to use it.

We should have a job to provision services and the deploy job should only trigger when the provision job has successfully completed.

Brief introduction to Terraform

In terraform we use a DSL to describe the final infrastructure we wish to have and Terraform builds that final infrastructure.

If we focus on Cloud Foundry, we want to declare a number of services. To talk to Cloud Foundry we need to configure a Terraform Provider. There is an open source (work in progress) provider for Cloud Foundry that allows us to create services, among many other things, in Cloud Foundry.

In Terraform we declare the infrastructure in .tf files. The file below declares the Cloud Foundry provider. Terraform has the concept of variables. For instance, we want to externalize the api endpoint, user and password so that we can use this same file for any environment.

provider "cf" {
    api_url = "${var.api_url}"
    user = "${var.user}"
    password = "${var.password}"
}

Along with the provicer.tf file we have vars.tf where we must declare the variables:

variable "api_url" {}
variable "user" {}
variable "password" {}

variable "org" { }
variable "space" { }

Applying terraform to our application

Each application may have in their repo a folder (terraform) which hosts its infrastructure.

Previously we said that the pipeline will have a provision job between deploy and build-and-verify jobs. The provision job needs terraform files in order to create the corresponding infrastructure. How do we make those terraform files available to the provision job is totally up to us. 2 ideas:

include them in the application's artifact, e.g. within the jar. All we have to do is configure the pom.xml to add the terraform folder.

    <build>
    <resources>
        <resource>
        <targetPath>terraform</targetPath>
            <directory>terraform</directory>
        </resource>
      </resources>
      ...
  </build>

build-and-verify job shall produce a release file (zip) which contains the jar and the terraform files.

The provision job calls a task, terraform, which extracts the terraform folder from the zip file and invokes terraform apply.

Remote backing state

Terraform produces a local file which contains the state of the infrastructure after we run terraform apply. We need to save this file in a central location called remote state. There are a few stores supportes: S3, swift, Artifactory, Consul, and a few others.

If we want to use Terraform we have to configure with a remote store otherwise it will always try to recreate all the infrastructure.

Add terraform task

platform: linux
image_resource:
  type: docker-image
  source:
    repository: marcialfrg/terraform
    # TODO PUT A TAG
inputs:
  - name: pipeline
  - name: artifact
params:
  TERRAFORM_PATH: "BOOT-INF/classes/terraform"
run:
  path: pipeline/tasks/terraform.sh

Add script

#!/bin/bash

env | grep TF_VAR 

cd artifact
ARTIFACT=`ls *`

unzip $ARTIFACT $TERRAFORM_PATH/*

cd $TERRAFORM_PATH 
terraform plan

Add provision job to the pipeline.

We want it to trigger when we have a new artifact built by build-and-verify
We pass the artifact and a number of environment variables to the terraform task
The environment variables are Terraform variables. We need to define as many variables as defined in terraform/vars.tf file.
The actual values for those variables come from the applications's credentials file. This is ok for now, but in the long term we don't want to make our pipeline aware of deployment credentials. Mainly because environments come and go and it is a big hassle to update the pipeline when that occurs.

- name: provision
  plan:
  - get: artifact-repo
    trigger: true
    passed: [build-and-verify]
  - get: pipeline
  - task: apply
    file: pipeline/tasks/terraform.yml
    input_mapping: {artifact: artifact-repo}
    params:
      TF_VAR_api_url: (( grab deployment.dev.pcf.api ))
      TF_VAR_user: (( grab deployment.dev.pcf.username ))
      TF_VAR_password: (( grab deployment.dev.pcf.password ))
      TF_VAR_org: (( grab deployment.dev.pcf.organization ))
      TF_VAR_space: (( grab deployment.dev.pcf.space ))

Blue/Green deployments

Build a docker image with cf program
Build a task that determines the current version and the next version. Current/Next can be blue or green. If the app is not deployed yet, we use either.
Update task that builds the manifest so that it takes from a file the name of the version to use
Build a task that promotes the new version: It uses plays with the routes to promote it and delete the old version.
Build pipeline so that it deploys the new version, followed by a job that verifies the new version, followed by a job that promotes the new version, and finally deletes the old version.
Build a task that execute tests case. We should bundle each release with a set of tests.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improvingPipeline.md

improvingPipeline.md

Concourse workshop

Use internal repo to resolve dependencies rather than Maven central repo

Create scripts to facilitate setting pipelines

Artifacts version and releases

Publish Unit Test Report

Shared pipelines rather One pipeline per application

Move the pipeline infrastructure to a dedicated git repository

Credentials files more structured, flexible, cleaner and with less duplication

Less cluttered pipelines

Customizable Pipelines depending on the type of application

Automatically tracking Feature-branches

Use dedicated pipelines to build custom images

Provision services

Brief introduction to Terraform

Applying terraform to our application

Remote backing state

Blue/Green deployments

Files

improvingPipeline.md

Latest commit

History

improvingPipeline.md

File metadata and controls

Concourse workshop

Use internal repo to resolve dependencies rather than Maven central repo

Create scripts to facilitate setting pipelines

Artifacts version and releases

Publish Unit Test Report

Shared pipelines rather One pipeline per application

Move the pipeline infrastructure to a dedicated git repository

Credentials files more structured, flexible, cleaner and with less duplication

Less cluttered pipelines

Customizable Pipelines depending on the type of application

Automatically tracking Feature-branches

Use dedicated pipelines to build custom images

Provision services

Brief introduction to Terraform

Applying terraform to our application

Remote backing state

Blue/Green deployments