Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

investigate expected cost of cloud deployment, and as well as possible approaches for measuring cost #1425

Open
Tracked by #1
jmartin-sul opened this issue Sep 25, 2024 · 2 comments

Comments

@jmartin-sul
Copy link
Member

jmartin-sul commented Sep 25, 2024

from sul-dlss/speech-to-text#3

see if back of the envelope estimates could rule out some of our possible implementations, or if they all look close enough in cost to keep pursuing all paths.

this involves two orthogonal sets of choices: how to deploy, and how to measure cost of deployed services.

deployment choices (not all mutually exclusive, but some are):

cost estimate approach:

@jmartin-sul
Copy link
Member Author

@edsu feel free to claim this and put it in "in progress" if you're already working on it. anyone else, feel free to also assign yourself simultaneously and coordinate, if you're exploring paths besides tagged deployment of a docker container (or @edsu can say if he'd like to claim this whole issue for now, to prevent duplicated effort)

@jmartin-sul jmartin-sul changed the title turn cost investigation bullet point from https://github.com/sul-dlss/speech-to-text/issues/3 into separate ticket, see if back of the envelope estimates could rule out some of our possible implementations, or if they all look close enough in cost to keep pursuing all paths (ECS, always on EC2 VM, thin vs heavy whisper docker container, etc). cost calculator vs deploy and test w/ tagged resources? investigate expected cost of cloud deployment, and as well as possible approaches for measuring cost Sep 25, 2024
@jmartin-sul
Copy link
Member Author

we might need a bit more of a handle on what ECS deployment and scaling look like, before we can really measure cost.

but we can also get a sense of how much it costs to caption a certain duration of media in ECS, and same in EC2. might be able to get a rough sense of cost by measuring that in the absence of ability to scale.

first few months might see the service on all the time anyway, since we probably have a substantial backlog to caption.

@andrewjbtw is interested in unit cost, and using that to determine how much we need to throttle e.g. bulk actions or preassembly batches for captioning. throttling may come naturally from our ECS or EC2 setup? e.g. set a limit on the number of STT containers, which are essentially single threaded (poll, work, poll, work, ...).

@jmartin-sul jmartin-sul transferred this issue from sul-dlss/speech-to-text Nov 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant