Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

print the connecting IP whenever possible for better diagnosis into network issues #14207

Closed
lmatz opened this issue Dec 26, 2023 · 5 comments
Closed
Assignees
Milestone

Comments

@lmatz
Copy link
Contributor

lmatz commented Dec 26, 2023

2023-12-26T06:56:37.992865433Z ERROR pgwire::pg_protocol: error when process message error=Failed to run the query: Scheduler error: gRPC request failed: Unknown error: transport error error.sources=[Scheduler error: gRPC request failed: Unknown error: transport error, Scheduler error: gRPC request failed: Unknown error: transport error, gRPC request failed: Unknown error: transport error, transport error, connection error: host unreachable, host unreachable]

One user reported this. We are not sure what led to this error yet, but it is likely to be some network glitch caused by environmental issues or possibly even the cloud vendor. Haven't seen this error before.

The host unreachable error makes it unclear to tell which host it is, which makes it harder to debug.

@MrCroxx
Copy link
Contributor

MrCroxx commented Dec 27, 2023

Close via #14227 #14229

@MrCroxx MrCroxx closed this as completed Dec 27, 2023
@hzxa21
Copy link
Collaborator

hzxa21 commented Jan 29, 2024

Do we still need #14227 given that #14229 is merged? I noticed that there are many logs related to hyper::client::connect::http: connecting to xxx shown up even when the cluster is running happily.

@lmatz
Copy link
Contributor Author

lmatz commented Feb 2, 2024

True, observed the same a while ago

Do we still need #14227 given that #14229 is merged?

technically I am not so sure if the latter covers the former? 🤔 @MrCroxx do you know

@MrCroxx
Copy link
Contributor

MrCroxx commented Feb 7, 2024

The latter one reports all resolved DNS addresses. The former reports reports the exact address that the client actually used.

Disable one of them is okay if the information is enough for debugging.

@hzxa21
Copy link
Collaborator

hzxa21 commented Mar 15, 2024

The latter one reports all resolved DNS addresses. The former reports reports the exact address that the client actually used.

Disable one of them is okay if the information is enough for debugging.

+1 for disabling this hyper log because hyper is used in gRPC/aws-sdk/opendal and the connection creation log can flood the log file.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants