Bulk Running the Inference Pipeline #387

CarsonDavis · 2023-09-20T15:49:23Z

Description

Right now, the inference pipeline has only been tested on small batches of URLs, like 150. Since we will need to run it on the millions of URLs that exist in the SDE, it will need to be able to run without overloading the server.

For this issue we need to do two things

test the pipeline on the bigger collections that will show current failure types
make any modifications necessary to the batch size processing necessary to run it successfully
have some code that can run this in batch on all our collections

Implementation Considerations

type your first consideration here

Deliverable

code to run on all our data
any updates to the batch process that are necessary

Dependencies

depends on

CarsonDavis assigned RajashreeDahal4 Sep 20, 2023

code-geek added the PI 24.1 Oct, Nov, Dec 2023 label Jan 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bulk Running the Inference Pipeline #387

Bulk Running the Inference Pipeline #387

CarsonDavis commented Sep 20, 2023

Bulk Running the Inference Pipeline #387

Bulk Running the Inference Pipeline #387

Comments

CarsonDavis commented Sep 20, 2023

Description

Implementation Considerations

Deliverable

Dependencies