-
Notifications
You must be signed in to change notification settings - Fork 255
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Frequent "failed to exchange" errors #396
Comments
We'll need more information about this, could you please record a full log? |
Does this help?
|
Not really:( We'll need to see the whole lifetime of a connection to figure out what's going on with them. Could you please share the log in private via |
Can this not be reproduced/debugged on your side by using |
All those related to H3/Quic may be caused by different quic-go library issues, I am tbh more interested in what you had with DoT. Errors in the log may be reproduced, at some point the server closes the connection and we handle this gracefully, just repeating the request. I don't yet understand where the 6 seconds timeout is getting from. If it happens with DoT than this is very strange. If it happens with H3/DoQ then it's understandable since quic-go does not handle GOAWAY frames from the server properly at the moment. |
The timeouts (or errors that take a long time) happen with TLS too:
|
I inspected the DoT upstream code and tbh I believe this is some network issue and not a bug in dnsproxy. DoT retries establishing a new connection when there's any error with the cached one but according to your log even though the new connection was established it's getting closed right away. I don't see how dnsproxy itself could do that in the code. |
I was suddenly seeing timeouts too with DoT using v0.71.1, so removed all DoT from upstream-servers last week and only using Quic/H3 now without any issues. Strange... |
@iJorgen the same DNS upstream? |
I observe the same issues on 2 different cloud provider networks, both IPv4 and IPv6, 2 different architectures (Linux amd64 and arm64), 2 different upstreams (Quad9, Cloudflare) and for both DoT and DoH. Maybe there are no actual issues and it's just log noise. |
I'm also seeing this with multiple different udp upstreams, v0.71.2 . All requests fail
Edit: nevermind, it was my firewall issue |
dnsproxy[1303399]: 2024/05/06 00:40:59 [error] dnsproxy: upstream https://1.1.1.2:443/dns-query failed to exchange
dnsproxy v0.71.1
Requests also take a long time to time out when this happens, around 6s.
It happens for both DoH (
h3://
) and DoT (tls://
) upstreams, on Linux both amd64 and arm64. Observed for both Quad9 and Cloudflare upstreams.The text was updated successfully, but these errors were encountered: