Add health check when deploying services and rollback in case of errors #28

imnotteixeira · 2022-02-12T17:57:46Z

Currently, when deploying services, we donºt know if the application is actually running. We only know the container was started. Recently we had an issue in configuration, where the NIJobs API couldn't connect to the DB, causing it to restart indefinitely.

To fix this, we should add a line to the project configs file specifying a health check endpoint (this only works for web applications, but that's the only services we have right now, so it should be fine).

Then, the deployment script would call that endpoint (using curl, for example), and if it didn't return 200, it would cause the deployment to fail, which would trigger a rollback (it would go back to the previous commit, and deploy that one)

Since services can take some time to boot up, this health check should have a retry mechanism (try to call the endpoint every 10 seconds up to 5 minutes, and if no call returned 200, rollback)

The text was updated successfully, but these errors were encountered:

imnotteixeira linked a pull request Feb 12, 2022 that will close this issue

Implement Health Checks w/ Rollback #29

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add health check when deploying services and rollback in case of errors #28

Add health check when deploying services and rollback in case of errors #28

imnotteixeira commented Feb 12, 2022

Add health check when deploying services and rollback in case of errors #28

Add health check when deploying services and rollback in case of errors #28

Comments

imnotteixeira commented Feb 12, 2022