Support Tasks with custom parameters #413

adrianna-chang-shopify · 2021-05-13T20:04:27Z

This PR introduces support for custom Task parameters by leveraging ActiveModel attributes & validations.

Task authors use ActiveModel::Attributes for their parameters, and can define their own validations to ensure any parameters that are passed from the UI conform to their expectations (ie. Maintenance::ParamsTask uses a presence validator and a regex to ensure that post ids is a comma-delimited list of integers).

Approach taken

MaintenanceTasks::Task includes relevant ActiveModel modules
In the view, we render a text field for every parameter the Task supports
These params are passed from controller => Runner#run. MaintenanceTasks::Run has a field for parameters now, and we pass the parameters to the constructor
On run creation, we assign the parameters (as attributes) to a Task instance for the run, and then validate whether the attributes are valid. If they aren't, we take the errors and add them to the run, which gets propagated to the UI in the form of a flash alert.
The run how has parameters persisted, and we've also set the parameters on the Task so that any of the methods in the Task (#process, #collection, #count) now have access to these attribute

🎩

Go to http://127.0.0.1:3000/maintenance_tasks/tasks/Maintenance::ParamsTask
Run it with 1,2,3 in the Post ids field
Run it with an invalid string in the Post ids field
Run it with no value in the Post ids field

To Do

Documentation

Let me know what you think!

etiennebarrie

I think we're missing a CSS class, the field is almost invisible:

Does it make sense to start the job and raise if parameters are invalid?

Potentially the validations would be quite costly, so I think it's ok to make them in before_perform. In the mean time, if the validations are costly, it also means we do them at each interruption. Validations have a concept of context which we could use to decide which ones to run when.

I guess we could assume they're the same kind of validations that would happen in a regular controller flow, without any job involved, for now. If we realize there's a need for deeper more intensive validations, we could add them in the on_start callback to run only once.

Looks great! I have a few comments but overall really neat.

app/controllers/maintenance_tasks/tasks_controller.rb

app/models/maintenance_tasks/task_data.rb

test/dummy/app/tasks/maintenance/params_task.rb

test/jobs/maintenance_tasks/task_job_test.rb

app/tasks/maintenance_tasks/task.rb

samuelgiles

This is exciting! We were talking in https://github.com/Shopify/arrive-server/pull/17707 about how we needed maintenance_tasks to support parameters to be able to convert some ShipIt tasks and now its here 👏

app/views/maintenance_tasks/tasks/show.html.erb

test/dummy/app/tasks/maintenance/params_task.rb

dirceu · 2021-05-14T14:39:00Z

Looking at our internal usage, it seems like the most commonly used param types are:

String—usually in a constant, so validation should be both straightforward and inherently custom
Integer
Array of String
Array of Integer

The latter two are super common; so common, in fact, that it makes me think that people will just copy that validation regex over and over again. Would it make sense to have first-class support for array of string and array of numbers (maybe with a default, named validation)?

I don't think it's a blocker for this PR though!

adrianna-chang-shopify · 2021-05-14T14:42:50Z

I think we're missing a CSS class, the field is almost invisible:

@etiennebarrie what browser are you using? It renders with a dark outline for me in Chrome:

I guess we could assume they're the same kind of validations that would happen in a regular controller flow, without any job involved, for now. If we realize there's a need for deeper more intensive validations, we could add them in the on_start callback to run only once.

Good point, I overlooked the fact that these validations would run every time the job interrupts. But, this is potentially a benefit as you mentioned, since they might be evaluated in to a context that changes based on when the job first runs vs when it's being resumed. Let's leave it in #before_perform for now, and revisit moving it to #on_start if they are proving costly.

adrianna-chang-shopify · 2021-05-14T14:54:58Z

@dirceu that's a great idea! The whole reason I went with post_ids for the test Task was because I noticed that Arrays of Integers was one of the most frequent use cases. 😁 It definitely makes sense to simplify this for users and offer them an API that takes care of these details for them, even if ActiveModel::Attributes won't give us Array support directly.

Lemme see what I can whip up

etiennebarrie · 2021-05-14T16:11:28Z

what browser are you using?

Firefox!

Let's leave it in #before_perform for now

I feel like this is the least interesting choice. Either we're worried about the potential cost of validations and we do it once before the whole run, or we're not and we can do it at the controller level. "Validation" code can run inside collection to run at each interruption, or inside process to run for each item. So I think overall the most useful place for us to run the validations is directly before creating a Run in the controller. We should just document it clearly so that it's clear.

Would it make sense to have first-class support for array of string and array of numbers

Yeah that would be cool, but either we introduce our own DSL (instead of being a simple Active Model), or we add a type to ActiveModel::Registry, which is too invasive for a gem IMO. The last solution I see would be to have an extensive example creating a Type, but that's a bit overblown I think. If we're worried about the regex being wrongly copied, we could remove it 😄

Active Record deals pretty well with invalid data:

Post.where(id: "2,foo,4".split(",")).to_sql
# => "SELECT \"posts\".* FROM \"posts\" WHERE \"posts\".\"id\" IN (2, 4)"

One other thing I had missed is that this doesn't persist the parameters, so we can't pause/resume unless I'm missing something?

adrianna-chang-shopify · 2021-05-14T17:42:52Z

Okay, might be a Bulma + Firefox thing in that case @etiennebarrie ?

Gotcha, yeah I misinterpreted what you were getting at before re: validations needing to run in certain contexts. Let's just move it into the controller for now.

In terms of types, IMO a nice way to do it would be to to add ActiveModel::Type::Value classes that Task authors can use if they want, without actually adding them to the registry. This is what I've been mocking up:

module MaintenanceTasks
  module Parameters
    class IntegerArrayType < ActiveModel::Type::Value
      def cast(input)
        return unless input.present?

        validate_input(input)
        input.split(",").map(&:to_i)
      end

      private

      def validate_input(input)
        unless /\A(\s?\d+(,\s?\d+\s?)*\s?)\z/.match?(input)
          error_message = "IntegerArrayType expects expects a comma-delimited "\
            "string of integers. Input received: #{input}"
          raise TypeError.new(error_message)
        end
      end
    end
  end
end

module Maintenance
  class ParamsTask < MaintenanceTasks::Task
    attribute :post_ids, MaintenanceTasks::Parameters::IntegerArrayType.new
    validates :post_ids, presence: true
    ....

IMHO this is a much better experience than requiring every user to validate / sanitize input strings from the UI or write their own type.

One other thing I had missed is that this doesn't persist the parameters, so we can't pause/resume unless I'm missing something?

LOL I completely overlooked this 😆 Should we persist params on the run as well? It might be a nice feature to be able to see that data historically in the UI as well (beyond just logging params, making them visible in the UI should increase confidence in terms of being able to track down what happened if someone ran a Task with faulty params)

adrianna-chang-shopify · 2021-05-17T20:33:37Z

I've extracted playing around with ActiveModel::Type::Value objects for complex parameter support here: https://github.com/Shopify/maintenance_tasks/pull/414/files

And then made changes to this PR:

Added column to maintenance_tasks_runs so we can persist parameters to the Run
Instantiating the Task is now the responsibility of the Run (it no longer happens in the job), and parameter validation happens on creation of the Run
Instantiated Run in Runner with parameters, removed the passing around of parameters to the Job class

cc @etiennebarrie ready for some more feedback 🙇‍♀️

app/controllers/maintenance_tasks/tasks_controller.rb

etiennebarrie

Two main things:

CLI support
parameters vs arguments naming

But it's looking great! ✨

app/controllers/maintenance_tasks/tasks_controller.rb

etiennebarrie · 2021-05-20T14:51:23Z

app/models/maintenance_tasks/run.rb


    attr_readonly :task_name

    serialize :backtrace
+    serialize :parameters, Hash


I noticed it still gets saved as:

--- !ruby/hash:ActiveSupport::HashWithIndifferentAccess post_ids: '4,2'

Can we turn the arguments Hash into a proper Hash somehow?

Why not support a hash with indifferent access though, since this is how Rails represents hashes by default?

app/models/maintenance_tasks/run.rb

app/models/maintenance_tasks/runner.rb

app/views/maintenance_tasks/tasks/show.html.erb

app/models/maintenance_tasks/run.rb

test/dummy/app/tasks/maintenance/params_task.rb

adrianna-chang-shopify · 2021-05-20T16:44:53Z

@etiennebarrie addressed most of your comments, there were a couple I'm not sure about, feel free to follow up 😄

Re: CLI support, I think it should be almost all the way there (probably just need to add the option + document it in the CLI?) , but I'd prefer to ship that separately and move forward with this.

app/models/maintenance_tasks/run.rb

etiennebarrie

I'm still not a fan of persisting ruby/hash:ActiveSupport::HashWithIndifferentAccess in the database, and I think we should support CLI before shipping a new gem version supporting tasks with parameters, but we can figure it out later.

We still sleep where we shouldn't though.

test/application_system_test_case.rb

adrianna-chang-shopify · 2021-05-25T20:33:26Z

@etiennebarrie we could serialize it to JSON if you'd prefer?
Yes, definitely intend to have CLI support prior to cutting a new version, but this PR is large enough as is and would prefer to start a clean PR for that 😄

etiennebarrie · 2021-05-26T14:19:12Z

we could serialize it to JSON if you'd prefer

Maybe that would make more sense 🤔

app/models/maintenance_tasks/run.rb

test/models/maintenance_tasks/run_test.rb

etiennebarrie · 2021-05-27T14:37:37Z

Since we have parameters, should we still prevent running multiple tasks with different arguments? It probably requires a few changes because we've always assumed "one Task = zero or one Run", but conceptually, two runs with different arguments are similar to two different Tasks.
It came up to me while writing about #422, users could work around the lack of parallelism by having start/end parameters and running the same Task multiple time with different arguments.

I don't think it's reasonable to tackle this here, but just leaving this here.

app/models/maintenance_tasks/run.rb

adrianna-chang-shopify · 2021-05-27T18:48:28Z

@etiennebarrie re: parallelism, I'd like for us to take the time to explore viable solutions for that, with one of them being allowing multiple instances of the same Task to be run concurrently (with args). It does change some fundamental assumptions within our system as you mentioned, so I think we want to commit to it as the single solution we're going with for concurrency, rather than building it in as a temporary workaround if we intend to build out concurrency fully.

With batch support built in, I suspect a lot of the use cases pushing for concurrency support will actually just be able to use batches.

etiennebarrie · 2021-05-27T19:25:42Z

app/controllers/maintenance_tasks/tasks_controller.rb

@@ -38,6 +39,12 @@ def run

    private

+    def task_arguments
+      return {} unless params[:task_arguments].present?
+      task_attributes = Task.named(params[:id]).attribute_names


Previously a wrong Task name would result in a ActiveRecord::RecordInvalid raised from Runner and error caught in the action, now it will raise here and will result in a 500. We don't have tests for this because we didn't want controller tests and there's no way to test this with the UI (with a browser test, you'd need to see the UI and remove the task before clicking on Run). We could ignore that for now because like I mentioned in a previous review, parts of this logic should move to the Runner, with the CLI job needing similar handling, and fix it once we support arguments from the CLI.

Hum, interesting. Let's leave it for now, I can remove it in the CLI PR if we choose to go that route and we feel it's redundant. Alternatively, we could keep the use of the Strong Parameters API here and just rescue Task::NotFoundError and return {}. I'm suspecting that we may just want to check the keys in validate_task_arguments though, since we're already rescuing on Task::NotFoundError there. But I'll defer to the CLI PR.

app/models/maintenance_tasks/run.rb

app/models/maintenance_tasks/runner.rb

app/views/maintenance_tasks/tasks/show.html.erb

db/migrate/20210517131953_add_arguments_to_maintenance_tasks_runs.rb

test/application_system_test_case.rb

test/dummy/db/schema.rb

test/system/maintenance_tasks/runs_test.rb

adrianna-chang-shopify · 2021-05-27T20:59:11Z

@etiennebarrie latest round of changes in 😁 Can you let me know:

How the textarea looks in Firefox for you now with the added Bulma class? (we can also make it larger / smaller, see https://bulma.io/documentation/form/textarea/#sizes)
Validating the length of arguments - see Support Tasks with custom parameters #413 (comment)

adrianna-chang-shopify · 2021-05-28T20:33:25Z

I'm going to 🚢 , if anything additional comes up we can fix forward (along with CLI support).

adrianna-chang-shopify requested review from etiennebarrie and dirceu May 13, 2021 20:04

etiennebarrie reviewed May 13, 2021

View reviewed changes

samuelgiles reviewed May 14, 2021

View reviewed changes

app/views/maintenance_tasks/tasks/show.html.erb Outdated Show resolved Hide resolved

test/dummy/app/tasks/maintenance/params_task.rb Show resolved Hide resolved

adrianna-chang-shopify force-pushed the tasks-with-params-active-model branch 8 times, most recently from 096367c to 3c68d96 Compare May 17, 2021 20:18

adrianna-chang-shopify mentioned this pull request May 17, 2021

Introduce value objects for complex parameter types #414

Closed

adrianna-chang-shopify force-pushed the tasks-with-params-active-model branch 2 times, most recently from d1048c7 to 3701bf1 Compare May 19, 2021 17:28

adrianna-chang-shopify commented May 19, 2021

View reviewed changes

app/controllers/maintenance_tasks/tasks_controller.rb Outdated Show resolved Hide resolved

etiennebarrie reviewed May 20, 2021

View reviewed changes

adrianna-chang-shopify force-pushed the tasks-with-params-active-model branch 3 times, most recently from 86967ed to c2cb0b0 Compare May 25, 2021 16:20

adrianna-chang-shopify requested a review from etiennebarrie May 25, 2021 16:22

adrianna-chang-shopify commented May 25, 2021

View reviewed changes

app/models/maintenance_tasks/run.rb Outdated Show resolved Hide resolved

etiennebarrie reviewed May 25, 2021

View reviewed changes

test/application_system_test_case.rb Outdated Show resolved Hide resolved

adrianna-chang-shopify requested a review from etiennebarrie May 26, 2021 14:26

adrianna-chang-shopify mentioned this pull request May 26, 2021

CLI support for parameters #420

Closed

rafaelfranca reviewed May 27, 2021

View reviewed changes

app/models/maintenance_tasks/run.rb Outdated Show resolved Hide resolved

rafaelfranca reviewed May 27, 2021

View reviewed changes

app/models/maintenance_tasks/run.rb Outdated Show resolved Hide resolved

test/models/maintenance_tasks/run_test.rb Outdated Show resolved Hide resolved

test/models/maintenance_tasks/run_test.rb Outdated Show resolved Hide resolved

rafaelfranca reviewed May 27, 2021

View reviewed changes

app/models/maintenance_tasks/run.rb Outdated Show resolved Hide resolved

rafaelfranca reviewed May 27, 2021

View reviewed changes

app/models/maintenance_tasks/run.rb Outdated Show resolved Hide resolved

rafaelfranca approved these changes May 27, 2021

View reviewed changes

etiennebarrie reviewed May 27, 2021

View reviewed changes

adrianna-chang-shopify added 5 commits May 28, 2021 16:29

Add arguments column to maintenance_tasks_runs

8cbe810

Define API and add sample params task

c15394e

Tasks support parameters

b4365a6

System tests don't need to sleep in Tasks

ff41ba6

Rescue and alert on ActiveRecord::ValueTooLong errors

0d548e3

adrianna-chang-shopify force-pushed the tasks-with-params-active-model branch from 35a4314 to 0d548e3 Compare May 28, 2021 20:31

adrianna-chang-shopify merged commit 6c3b8c0 into main May 28, 2021

adrianna-chang-shopify deleted the tasks-with-params-active-model branch May 28, 2021 20:36

This was referenced Jun 2, 2021

Support Tasks with Params with CLI #428

Merged

Complex Parameter Type Support #432

Closed

shopify-shipit bot temporarily deployed to rubygems June 7, 2021 15:25 Inactive

etiennebarrie mentioned this pull request Jul 8, 2021

Show Task arguments #447

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Tasks with custom parameters #413

Support Tasks with custom parameters #413

adrianna-chang-shopify commented May 13, 2021 •

edited

Loading

etiennebarrie left a comment

samuelgiles left a comment •

edited

Loading

dirceu commented May 14, 2021 •

edited

Loading

adrianna-chang-shopify commented May 14, 2021

adrianna-chang-shopify commented May 14, 2021

etiennebarrie commented May 14, 2021

adrianna-chang-shopify commented May 14, 2021

adrianna-chang-shopify commented May 17, 2021

etiennebarrie left a comment

etiennebarrie May 20, 2021

adrianna-chang-shopify May 20, 2021

adrianna-chang-shopify commented May 20, 2021

etiennebarrie left a comment

adrianna-chang-shopify commented May 25, 2021

etiennebarrie commented May 26, 2021

etiennebarrie commented May 27, 2021

adrianna-chang-shopify commented May 27, 2021

etiennebarrie May 27, 2021

adrianna-chang-shopify May 27, 2021

adrianna-chang-shopify commented May 27, 2021

adrianna-chang-shopify commented May 28, 2021

Support Tasks with custom parameters #413

Support Tasks with custom parameters #413

Conversation

adrianna-chang-shopify commented May 13, 2021 • edited Loading

Approach taken

🎩

To Do

etiennebarrie left a comment

Choose a reason for hiding this comment

samuelgiles left a comment • edited Loading

Choose a reason for hiding this comment

dirceu commented May 14, 2021 • edited Loading

adrianna-chang-shopify commented May 14, 2021

adrianna-chang-shopify commented May 14, 2021

etiennebarrie commented May 14, 2021

adrianna-chang-shopify commented May 14, 2021

adrianna-chang-shopify commented May 17, 2021

etiennebarrie left a comment

Choose a reason for hiding this comment

etiennebarrie May 20, 2021

Choose a reason for hiding this comment

adrianna-chang-shopify May 20, 2021

Choose a reason for hiding this comment

adrianna-chang-shopify commented May 20, 2021

etiennebarrie left a comment

Choose a reason for hiding this comment

adrianna-chang-shopify commented May 25, 2021

etiennebarrie commented May 26, 2021

etiennebarrie commented May 27, 2021

adrianna-chang-shopify commented May 27, 2021

etiennebarrie May 27, 2021

Choose a reason for hiding this comment

adrianna-chang-shopify May 27, 2021

Choose a reason for hiding this comment

adrianna-chang-shopify commented May 27, 2021

adrianna-chang-shopify commented May 28, 2021

adrianna-chang-shopify commented May 13, 2021 •

edited

Loading

samuelgiles left a comment •

edited

Loading

dirceu commented May 14, 2021 •

edited

Loading