-
Notifications
You must be signed in to change notification settings - Fork 591
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The first query failed right after the RW started #15730
Comments
Might be caused by Frontend error happens at
Meanwhile, in Meta:
Explore-logs-2024-03-18 14_00_50.txt |
Just confirmed with @tabVersion that Also note that the cc. @shanicky as well |
One more strange thing is that |
Get the same dns error. |
DNS error only means a node was down and other nodes can't reach it. Please seek for logs before the Node went down. |
This happened during cluster initialization when CNs register to meta and meta tried to create a control stream with CN. There seems to be some latency on dns propagation in the cluster, and then the meta failed to resolve the dns of newly registered CN. It might be the same problem as #15650 |
@wenym1 All the scheduled pipelines failed at starting the RW cluster last night like this one https://buildkite.com/risingwave-test/backfill/builds/407#018e584d-7aa2-4415-923b-5f17de0cb641. The compute node restarted several times. And the whole RW cluster can't be ready. According to the Grafana logs |
Close this one since we didn't hit this last night. Did we have any additional fixes merged for this one? @wenym1 |
Describe the bug
We have hit this error for the first queries in our daily and weekly pipelines a lot of times.
I don't think it's an environmental issue because all the pipelines failed only at the first query in the pipelines right after the RW clusters started. And it's highly reproducible. It happened in our many builds.
We have filed an issue for this #15650. And it's fixed for one or two days.
https://buildkite.com/risingwave-test/nexmark-benchmark/builds/3300
https://buildkite.com/risingwave-test/nexmark-benchmark/builds/3302
https://buildkite.com/risingwave-test/nexmark-benchmark/builds/3304
https://buildkite.com/risingwave-test/nexmark-benchmark/builds/3303
https://buildkite.com/risingwave-test/nexmark-benchmark/builds/3307
https://buildkite.com/risingwave-test/nexmark-benchmark/builds/3311
Error message/log
To Reproduce
No response
Expected behavior
No response
How did you deploy RisingWave?
No response
The version of RisingWave
nightly-20240314
Additional context
No response
The text was updated successfully, but these errors were encountered: