Skip to content

Commit

Permalink
add note on avoiding low data training runs
Browse files Browse the repository at this point in the history
  • Loading branch information
camallen committed Mar 30, 2023
1 parent ca2c4b2 commit a3271fa
Showing 1 changed file with 6 additions and 0 deletions.
6 changes: 6 additions & 0 deletions app/sidekiq/retrain_zoobot_job.rb
Original file line number Diff line number Diff line change
Expand Up @@ -26,6 +26,12 @@ def perform(context_id)
Export::TrainingData.new(training_data_export).run
end

# this is where we could intercept the training job submission
# to avoid a training run if there isn't enough data for a viable model
# one idea would be to check the number of rows in the training data export attached file
# or even better we store the number of exported rows in the training data export model
# https://github.com/zooniverse/kade/issues/62

# create a new training job record to track the batch training job
training_job = create_training_job(training_data_export.storage_path, training_context.workflow_id)
# submit the export training job to the batch training service
Expand Down

0 comments on commit a3271fa

Please sign in to comment.