Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate the slowdown of SMS while testing in production #484

Open
1 task
jimleroyer opened this issue Dec 18, 2024 · 0 comments
Open
1 task

Investigate the slowdown of SMS while testing in production #484

jimleroyer opened this issue Dec 18, 2024 · 0 comments
Labels
Bug | Bogue Bug related task. High Priority | Haute priorité Reliability Task related to reliability.

Comments

@jimleroyer
Copy link
Member

jimleroyer commented Dec 18, 2024

Describe the bug

We could not uphold the higher SMS sending limit in a recent performance test in production. We want to make sure that we can uphold it and the system is behaving as expected.

Bug Severity

(SEV-1 Critical, SEV-2 Major, SEV-3 Minor, SEV-4 Low)

SEV-2

To Reproduce

  1. Launch performance test in production (4000 fake 613-500-01** numbers.
  2. See the send rate be around 530, well below the target 2000.

Expected behavior

The sending rate limit should reach above 2,000 sending rate. But we currently see 500-600 sending rate during the tests which is much lower than our previous results and expectations.

Impact

Describe overall user/system impact to consider when prioritizing this issue.

If applicable

Impact on Notify users: Potential lower SMS sending rate limit.

Impact on Recipients: Slower reception of SMS

Impact on Notify team: Outside normal parameter of the system, leading to less reliable system and less trust and capability for our team.

QA

QA items to verify the bug is resolved:

  • Run performance test in production.

Additional context

  • In August we determined that a task rate limit of 10/s and 6 sms delivery pods (3 primary and 3 scalable) would reach a send rate of approximately 2000 sms / minute (test: upload a csv of 40000 sms).
  • However in December we saw that the same test only sent about 600 sms / minute.
@jimleroyer jimleroyer added Bug | Bogue Bug related task. High Priority | Haute priorité Reliability Task related to reliability. labels Dec 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Bug | Bogue Bug related task. High Priority | Haute priorité Reliability Task related to reliability.
Projects
None yet
Development

No branches or pull requests

1 participant