diff --git a/teps/0090-matrix.md b/teps/0090-matrix.md new file mode 100644 index 000000000..5899ef57d --- /dev/null +++ b/teps/0090-matrix.md @@ -0,0 +1,629 @@ +--- +status: proposed +title: Matrix +creation-date: '2021-10-13' +last-updated: '2021-10-27' +authors: +- '@jerop' +- '@pritidesai' +--- + +# TEP-0090: Matrix + + +- [Summary](#summary) +- [Motivation](#motivation) + - [Goals](#goals) + - [Non-Goals](#non-goals) + - [Use Cases](#use-cases) + - [1. Parallel Kaniko Build](#1-parallel-kaniko-build) + - [2. Parallel Docker Build](#2-parallel-docker-build) + - [3. Parallel Vault Reading](#3-parallel-vault-reading) + - [4. Testing Strategies](#4-testing-strategies) + - [5. Test Sharding](#5-test-sharding) + - [6. Build and Test Combination](#6-build-and-test-combination) + - [Requirements](#requirements) + - [Related Work](#related-work) + - [GitHub Actions](#github-actions) + - [Jenkins](#jenkins) + - [Argo Workflows](#argo-workflows) + - [Ansible](#ansible) +- [References](#references) + + +## Summary + +Today, users cannot supply varying `Parameters` to execute a `PipelineTask`, that is, fan out a `PipelineTasks`. +To solve this problem, this TEP aims to enable executing the same `PipelineTask` with different combinations of +`Parameters` specified in a `matrix`. `TaskRuns` or `Runs` will be created with variables substituted with each +combination of the `Parameters` in the `matrix`. This `matrix` construct will enable users to specify concise but +powerful `Pipelines`. Moreover, it would improve the composability, scalability, flexibility and reusability of +*Tekton Pipelines*. + +## Motivation + +Users can specify `Parameters`, such as artifacts' names, that they want to supply to [`PipelineTasks`][tasks-docs] +at runtime. However, they don't have a way to supply varying `Parameters` to the a `PipelineTask`. Today, users would +have to duplicate that `PipelineTask` in the `Pipelines` specification as many times as the number of varying +`Parameters` that they want to pass in. This is limiting and challenging because: +- it is tedious and creates large `Pipelines` that are hard to understand and maintain. +- it does not scale well because users have to add a `PipelineTask` entry to handle an additional `Parameter`. +- it is error-prone to duplicate `PipelineTasks`' specifications, and it may be challenging to debug those errors. + +A common scenario is [a user needs to build multiple images][kaniko-example-1] from one repository using the +[kaniko][kaniko-task] `Task` from the *Tekton Catalog*. Let's assume it's three images. The user would have to specify +that `Pipeline` with the kaniko `PipelineTask` duplicated, as such: + +```yaml +apiVersion: tekton.dev/v1beta1 +kind: Pipeline +metadata: + name: kaniko-pipeline +spec: + workspaces: + - name: shared-workspace + params: + - name: image-1 + description: reference of the first image to build + - name: image-2 + description: reference of the second image to build + - name: image-3 + description: reference of the third image to build + tasks: + - name: fetch-repository + taskRef: + name: git-clone + workspaces: + - name: output + workspace: shared-workspace + params: + - name: url + value: https://github.com/tektoncd/pipeline + - name: subdirectory + value: "" + - name: deleteExisting + value: "true" + - name: kaniko-1 + taskRef: + name: kaniko + runAfter: + - fetch-repository + workspaces: + - name: source + workspace: shared-workspace + params: + - name: IMAGE + value: $(params.image-1) + - name: kaniko-2 + taskRef: + name: kaniko + runAfter: + - fetch-repository + workspaces: + - name: source + workspace: shared-workspace + params: + - name: IMAGE + value: $(params.image-2) + - name: kaniko-3 + taskRef: + name: kaniko + runAfter: + - fetch-repository + workspaces: + - name: source + workspace: shared-workspace + params: + - name: IMAGE + value: $(params.image-3) +``` + +As shown in the above example, the user would have to add another `PipelineTask` entry to build another image. Moreover, +they can easily make errors while duplicating the *kaniko* `PipelineTasks`' specifications. A user described their +experience with these challenges and limitations as such: + +> "Right now I'm doing all of this by just having a statically defined single `Pipeline` with a `Task` and then +delegating to code/loops within that single `Task` to achieve the `N` things I want to do. This works, but then +I'd prefer the concept of a single Task does a single thing, rather than overloading it like this. Especially +when viewing it in the dashboard etc, things get lost" ~ [bitsofinfo][kaniko-example-2] + +To solve this problem, this TEP aims to enable executing the same `PipelineTask` with different combinations of +`Parameters` specified in a `matrix`. `TaskRuns` or `Runs` will be created with variables substituted with each +combination of the `Parameters` in the `matrix`. This `matrix` construct will enable users to specify concise but +powerful `Pipelines`. Moreover, it would improve the composability, scalability, flexibility and reusability of +*Tekton Pipelines*. + +Note that one of the use cases we aim to cover before the [*Tekton Pipelines V1*][v1] release includes: + +> "A `matrix` build pipeline (build, test, … with some different env’ variables — using CustomResource)". + +### Goals + +The main goal of this TEP is to enable executing a `PipelineTask` with different combinations of `Parameters` +specified in a `matrix`. + +### Non-Goals + +The following are out of scope for this TEP: + +1. Terminating early when one of the `TaskRuns` or `Runs` created in parallel fails. As is currently, running `TaskRuns` +and `Runs` have to complete execution before termination. +2. Controlling the concurrency of `TaskRuns` or `Runs` created in a given `matrix`. This will be addressed more broadly +in Tekton Pipelines ([tektoncd/pipeline: issue#2591][issue-2591], [tektoncd/experimental: issue#804][issue-804]). +3. Configuring the `TaskRuns` or `Runs` created in a given `matrix` to execute sequentially. This remains an option that +we can explore this later. +4. Dynamically generating `TaskRuns` or `Runs` by using `Results` based on previous `TaskRuns` or `Runs` in the `matrix`. +This is a feature we can support later based on experiences and feedback supporting `matrix` execution without `Results`. +5. Excluding generating a `TaskRun` or `Run` for a specific combination in the `matrix`. This can be handled using +guarded execution through `when` expressions. This remains an option we can explore later if needed. +6. Including generating a `TaskRun` or `Run` for a specific combination in the `matrix`. This can be handled by adding +the items that produce that combination into the `matrix`, and using guarded execution through `when` expressions to +exclude the combinations that should be skipped. This remains an option we can explore later if needed. + +### Use Cases + +#### 1. Parallel Kaniko Build + +As a `Pipeline` author, I [need to build multiple images][kaniko-example-1] from one repository using the same +`PipelineTask`. I use the [*kaniko*][kaniko-task] `Task` from the *Tekton Catalog*. Let's assume it's three images. + +```text +image-1 +image-2 +image-3 +... +``` + +I want to pass in varying `Parameter` values for `IMAGE` to create three `TaskRuns`, one for each image. + +``` + clone + | + v + -------------------------------------------------- + | | | + v v v + ko-build-image-1 ko-build-image-2 ko-build-image-3 +``` + +Read more in [user experience report #1][kaniko-example-1] and [user experience report #2][kaniko-example-2]. + +In the future, I may need to specify a *get-images* `PipelineTask` that fetches the images from a configuration file in +my repository and produces a `Result` that is used to dynamically execute `TaskRuns` for each image. +This is a non-goal (#5) in this TEP. + +``` + clone + | + v + get-dir + | + v + -------------------------------------------------- + | | | + v v v + ko-build-image-1 ko-build-image-2 ko-build-image-3 +``` + +#### 2. Parallel Docker Build + +As a `Pipeline` author, I have several dockerfiles in my repository. + +```text +/ docker / Dockerfile + python / Dockerfile + Ubuntu / Dockerfile +... +``` + +I have a *clone* `PipelineTask` that fetches the repository to a shared `Workspace`. I want to pass in an array +`Parameter` with directory names of the dockerfiles to *docker-build* `PipelineTask` which runs docker build and push. + +``` + clone + | + v + get-dir + | + v + -------------------------------------------------- + | | | + v v v + docker-build-1 docker-build-2 docker-build-3 +``` + +Read more in the [user experience report][docker-example]. + +In the future, I may need to specify a *get-dir* `PipelineTask` that fetches the dockerfiles directory names from a +configuration file in my repository and produces a `Result` that is used to dynamically execute `TaskRuns` for +each dockerfile. This is a non-goal (#5) in this TEP. + +``` + clone + | + v + get-dir + | + v + -------------------------------------------------- + | | | + v v v + docker-build-1 docker-build-2 docker-build-3 +``` + +#### 3. Parallel Vault Reading + +As a `Pipeline` author, I have several vault paths in my repository. + +```text +path1 +path2 +path3 +... +``` + +I have a *vault-read* `PipelineTask` that I need to run for every vault path and get the secrets in each of them. +As such, I need to fan out the *vault-read* `PipelineTask` N times, where N is the number of vault paths. + +``` + clone + | + v + -------------------------------------------------- + | | | + v v v + vault-read-1 vault-read-2 vault-read-3 +``` + +Read more in the [user experience report][vault-example]. + +In the future, I may need to specify a *get-vault-paths* `PipelineTask` that fetches the vault paths from a configuration +file in my repository and produces a `Result` that is used to dynamically execute `TaskRuns` for each vault path. +This is a non-goal (#5) in this TEP. + +``` + clone + | + v + get-vault-paths + | + v + -------------------------------------------------- + | | | + v v v + vault-read-1 vault-read-2 vault-read-3 +``` + +#### 4. Testing Strategies + +As a `Pipeline` author, I have several test types that I want to run. + +```text +code-analysis +unit-tests +e2e-tests +... +``` + +I have a *test* `PipelineTask` that I need to run for each test type - the `Task` runs tests based on a `Parameter`. +I need to run this *test* `PipelineTask` for multiple test types that are defined in my repository. + +``` + clone + | + v + -------------------------------------------------- + | | | + v v v + test-code-analysis test-unit-tests e2e-tests +``` + +In the future, I may need to specify a *tests-selector* `PipelineTask` that fetches the test types from a configuration +file in my repository and produces a `Result` that is used to dynamically execute the `TaskRuns` for each test type. +This is a non-goal (#5) in this TEP. + +``` + clone + | + v + tests-selector + | + v + -------------------------------------------------- + | | | + v v v + test-code-analysis test-unit-tests e2e-tests +``` + +#### 5. Test Sharding + +As a `Pipeline` author, I have a large test suite that's slow (e.g. browser based tests) and I need to speed it up. +I need to split up the test suite into groups, run the tests separately, then combine the results. + +```text +[ +[test_a, test_b], +[test_c, test_d], +[test_e, test_f], +] +``` + +I choose to use the [Golang Test][golang-test] `Task` from the *Tekton Catalog*. Let's assume we've updated it to +support running a subset of tests. So I pass in divide the tests into shards and pass them to the `PipelineTask` +through an array `Parameter`. + +``` + clone + | + v + -------------------------------------------------- + | | | + v v v + test-ab test-cd test-ef +``` + +In the future, I may need to specify a *test-sharding* `PipelineTask` that divides the tests across shards and produces +a `Result` that is used to dynamically execute the `TaskRuns` for each shard. This is a non-goal (#5) in this TEP. + +``` + clone + | + v + tests-sharding + | + v + -------------------------------------------------- + | | | + v v v + test-ab test-cd test-ef +``` + +#### 6. Build and Test Combination + +As a `Pipeline` author, I need to run tests on a combination of platforms and browsers. + +```text +# platforms +linux +windows +mac + +# browsers +chrome +firefox +safari +``` + + +``` + clone + | + v + -------------------------------------------------------------------------------------------------------------------------- + | | | | | | | | | + v v v v v v v v v +linux-chrome linux-firefox linux-safari windows-chrome windows-firefox windows-safari mac-chrome mac-firefox mac-safari +``` + +In the future, I may need to specify a *get-platforms* and *get-browsers* `PipelineTask` that fetches the platforms and +browsers from a configuration file in my repository and produces `Results` that is used to dynamically execute the +`TaskRuns` for each combination of platform and browser. This is a non-goal (#5) in this TEP. + +``` + clone + | + v + -------------------------------------------------- + | | + v v + get-plaforms get-browsers + | | + v v + -------------------------------------------------------------------------------------------------------------------------- + | | | | | | | | | + v v v v v v v v v +linux-chrome linux-firefox linux-safari windows-chrome windows-firefox windows-safari mac-chrome mac-firefox mac-safari +``` + +### Requirements + +1. A `matrix` of `Parameters` can be specified to execute a `PipelineTask` in `TaskRuns` or `Runs` with variables +substituted with the combinations of `Parameters` in the `matrix`. +2. The `TaskRuns` or `Runs` executed from the `matrix` of `Parameters` should be run in parallel. +3. The `Parameters` in the `matrix` should not use `Results` from previous `PipelineTasks` to dynamically generate +`TaskRuns` and `Runs` for the current `PipelineTask`. +4. Excluding the execution of a `TaskRun` or `Run` with a specific combination in the `matrix` using `when` expressions +should be supported. + +### Related Work + +The `matrix` construct is related to the `map`, `fan out` and `matrix` constructs available in programming languages and +computer systems. In this section, we explore related work on `matrix` constructs in other continuous delivery systems. + +#### GitHub Actions + +GitHub Actions allows users to define a `matrix` of job configurations - which creates jobs with after substituting +variables in each job. It also allows users to include or exclude combinations in the build `matrix`. + +For example: + +```yaml +runs-on: ${{ matrix.os }} +strategy: + matrix: + os: [macos-latest, windows-latest, ubuntu-18.04] + node: [8, 10, 12, 14] + exclude: + # excludes node 8 on macOS + - os: macos-latest + node: 8 + include: + # includes node 15 on ubuntu-18.04 + - os: ubuntu-18.04 + node: 15 +``` + +GitHub Actions workflows syntax also allows users to: +- cancel in-progress jobs is one of the `matrix` jobs fails +- specify maximum number of jobs to run in parallel + +Read more in the [documentation][github-actions]. + +#### Jenkins + +Jenkins allows users to define a configuration `matrix` to specify what steps to duplicate. It also allows users to +exclude certain combinations in the `matrix` + +For example: + +```yaml +pipeline { + agent none + stages { + stage("build") { + matrix { + axes { + axis { + name 'OS_VALUE' + values "linux", "windows", "mac" + } + axis { + name 'BROWSER_VALUE' + values "firefox", "chrome", "safari", "ie" + } + } + excludes { + exclude { + axis { + name 'OS_VALUE' + values 'linux' + } + axis { + name 'BROWSER_VALUE' + values 'safari' + } + } + exclude { + axis { + name 'OS_VALUE' + notValues 'windows' + } + axis { + name 'BROWSER_VALUE' + values 'ie' + } + } + } + + stages { + stage("build") { + steps { + echo "Do build for OS=${OS_VALUE} - BROWSER=${BROWSER_VALUE}" + } + } + } + } + } + } +} +``` + +Read more in the [documentation][jenkins-docs] and related [blog][jenkins-blog]. + +#### Argo Workflows + +Argo Workflows allows users to iterate over: +- a list of items as static inputs +- a list of sets of items as static inputs +- parameterized list of items or list of sets of items +- dynamic list of items or lists of sets of items + +Here's an example from the [documentation][argo-workflows]: +```yaml +apiVersion: argoproj.io/v1alpha1 +kind: Workflow +metadata: + generateName: loops-param-result- +spec: + entrypoint: loop-param-result-example + templates: + - name: loop-param-result-example + steps: + - - name: generate + template: gen-number-list + # Iterate over the list of numbers generated by the generate step above + - - name: sleep + template: sleep-n-sec + arguments: + parameters: + - name: seconds + value: "{{item}}" + withParam: "{{steps.generate.outputs.result}}" + + # Generate a list of numbers in JSON format + - name: gen-number-list + script: + image: python:alpine3.6 + command: [python] + source: | + import json + import sys + json.dump([i for i in range(20, 31)], sys.stdout) + + - name: sleep-n-sec + inputs: + parameters: + - name: seconds + container: + image: alpine:latest + command: [sh, -c] + args: ["echo sleeping for {{inputs.parameters.seconds}} seconds; sleep {{inputs.parameters.seconds}}; echo done"] +``` + +Read more in the [documentation][argo-workflows]. + +#### Ansible + +Ansible allows users to execute a task multiple times using `loop`, `with_` and `until` keywords. + +For example: + +```yaml +- name: Show the environment + ansible.builtin.debug: + msg: " The environment is {{ item }} " + loop: + - staging + - qa + - production +``` + +Read more in the [documentation][ansible]. + +## References + +- [Task Loops Experimental Project][task-loops] +- Issues: + - [#2050: `Task` Looping inside `Pipelines`][issue-2050] + - [#4097: List of `Results` of a `Task`][issue-4097] + +[task-loops]: https://github.com/tektoncd/experimental/tree/main/task-loops +[issue-2050]: https://github.com/tektoncd/pipeline/issues/2050 +[issue-4097]: https://github.com/tektoncd/pipeline/issues/4097 +[tasks-docs]: https://github.com/tektoncd/pipeline/blob/main/docs/tasks.md +[custom-tasks-docs]: https://github.com/tektoncd/pipeline/blob/main/docs/pipelines.md#using-custom-tasks +[kaniko-example-1]: https://github.com/tektoncd/pipeline/issues/2050#issuecomment-625423085 +[kaniko-task]: https://github.com/tektoncd/catalog/tree/main/task/kaniko/0.5 +[kaniko-example-2]: https://github.com/tektoncd/pipeline/issues/2050#issuecomment-671959323 +[docker-example]: https://github.com/tektoncd/pipeline/issues/2050#issuecomment-814847519 +[vault-example]: https://github.com/tektoncd/pipeline/issues/2050#issuecomment-841291098 +[tep-0050]: https://github.com/tektoncd/community/blob/main/teps/0050-ignore-task-failures.md +[argo-workflows]: https://github.com/argoproj/argo-workflows/blob/7684ef4a0c5f57e8723dc8e4d3a17246f7edc2e6/examples/README.md#loops +[github-actions]: https://docs.github.com/en/actions/learn-github-actions/workflow-syntax-for-github-actions +[ansible]: https://docs.ansible.com/ansible/latest/user_guide/playbooks_loops.html#loops +[jenkins-docs]: https://plugins.jenkins.io/matrix-project/ +[jenkins-blog]: https://www.jenkins.io/blog/2019/11/22/welcome-to-the-matrix/ +[golang-test]: https://github.com/tektoncd/catalog/tree/main/task/golang-test/0.2 +[issue-804]: https://github.com/tektoncd/experimental/issues/804 +[issue-2591]: https://github.com/tektoncd/pipeline/issues/2591 +[v1]: https://github.com/tektoncd/pipeline/issues/3548 diff --git a/teps/README.md b/teps/README.md index d0c9f84e2..8e03858d5 100644 --- a/teps/README.md +++ b/teps/README.md @@ -225,3 +225,4 @@ This is the complete list of Tekton teps: |[TEP-0081](0081-add-chains-subcommand-to-the-cli.md) | Add Chains sub-command to the CLI | proposed | 2021-08-31 | |[TEP-0084](0084-endtoend-provenance-collection.md) | end-to-end provenance collection | proposed | 2021-09-16 | |[TEP-0085](0085-per-namespace-controller-configuration.md) | Per-Namespace Controller Configuration | proposed | 2021-10-14 | +|[TEP-0090](0090-matrix.md) | Matrix | proposed | 2021-10-27 |