-
Notifications
You must be signed in to change notification settings - Fork 531
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Encapsulate code converting std::chrono::duration to timeval #1915
base: master
Are you sure you want to change the base?
Encapsulate code converting std::chrono::duration to timeval #1915
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I found two places where ToTimeval() is needed today. This PR converts both. The added ToTimeval() API is likely to be used in upcoming PRs (and future code) as well.
struct timeval tv; | ||
tv.tv_sec = std::chrono::seconds(PingerTimeout).count(); | ||
tv.tv_usec = 0; | ||
auto tv = ToTimeval(PingerTimeout); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This tv
cannot be constant because it is fed to select(2) that requires a non-constant timeval timeout parameter.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The new converter handles tv_usec
which adds a performance regression calculating it to be 0
on every loop iteration. That being the only real "gain" pushes this into the "not valueable yet" type of changes for me.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The new converter handles
tv_usec
which adds a performance regression calculating it to be0
on every loop iteration.
AFAICT, the above statement is based on two false assumptions:
False assumption A: Proposed compiled code will be different from official compiled code. In reality, modern optimizing compilers see enough in this context to generate identical assembly in both cases (even at -O1 optimization level)!
False assumption B: This loop is performance sensitive (at the precision level of a few integer operations added or removed). In reality, this is an I/O-waiting loop in a pinger helper. Adding a few quick ops to that loop would not change pinger performance, even under heavy load.
In summary, this PR does not "add a performance regression".
N.B. Proposed loop code is also clearly better -- it is a bit easier to read, and it supports reasonable PingerTimeout values that the official code silently mishandles. We may never need to support fractional seconds in this context, but better code is still an improvement.
That being the only real "gain" pushes this into the "not valueable yet" type of changes for me.
Please do not block PRs just because you do not see value in them (when other core developers do see value).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I still do not see enough need to add this to Squid.
outtv.tv_sec = std::chrono::duration_cast<std::chrono::seconds>(duration).count(); | ||
const auto totalUsec = std::chrono::duration_cast<std::chrono::microseconds>(duration); | ||
outtv.tv_usec = (totalUsec % std::chrono::microseconds(1s)).count(); | ||
outtv = ToTimeval(timer.total()); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FTR; I do not believe this one is a true need. The doMsec
cases should all be replaced by an output printing chrono::duration
instead.
Better to add that new display logic and make this %code the first usage of it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The
doMsec
cases should all be replaced by an output printingchrono::duration
instead.
The changes in this PR do not preclude those future assembly() improvements. In fact, they can be viewed as making those future improvements slightly easier!
Better to add that new display logic and make this %code the first usage of it.
Correctly improving assemble() to get rid of doMsec and similar code would be good, but it is a complex, large task that is orthogonal to the small improvement offered by this PR. We should not view this as "either or" or "better X than Y" situation. Both "X" and "Y" are moving Squid code forward. "X" got here first; let's merge it.
struct timeval tv; | ||
tv.tv_sec = std::chrono::seconds(PingerTimeout).count(); | ||
tv.tv_usec = 0; | ||
auto tv = ToTimeval(PingerTimeout); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The new converter handles tv_usec
which adds a performance regression calculating it to be 0
on every loop iteration. That being the only real "gain" pushes this into the "not valueable yet" type of changes for me.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I still do not see enough need to add this to Squid.
Hopefully, one core developer seeing enough need is enough to tick the "enough need" box. Please unblock this PR.
outtv.tv_sec = std::chrono::duration_cast<std::chrono::seconds>(duration).count(); | ||
const auto totalUsec = std::chrono::duration_cast<std::chrono::microseconds>(duration); | ||
outtv.tv_usec = (totalUsec % std::chrono::microseconds(1s)).count(); | ||
outtv = ToTimeval(timer.total()); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The
doMsec
cases should all be replaced by an output printingchrono::duration
instead.
The changes in this PR do not preclude those future assembly() improvements. In fact, they can be viewed as making those future improvements slightly easier!
Better to add that new display logic and make this %code the first usage of it.
Correctly improving assemble() to get rid of doMsec and similar code would be good, but it is a complex, large task that is orthogonal to the small improvement offered by this PR. We should not view this as "either or" or "better X than Y" situation. Both "X" and "Y" are moving Squid code forward. "X" got here first; let's merge it.
struct timeval tv; | ||
tv.tv_sec = std::chrono::seconds(PingerTimeout).count(); | ||
tv.tv_usec = 0; | ||
auto tv = ToTimeval(PingerTimeout); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The new converter handles
tv_usec
which adds a performance regression calculating it to be0
on every loop iteration.
AFAICT, the above statement is based on two false assumptions:
False assumption A: Proposed compiled code will be different from official compiled code. In reality, modern optimizing compilers see enough in this context to generate identical assembly in both cases (even at -O1 optimization level)!
False assumption B: This loop is performance sensitive (at the precision level of a few integer operations added or removed). In reality, this is an I/O-waiting loop in a pinger helper. Adding a few quick ops to that loop would not change pinger performance, even under heavy load.
In summary, this PR does not "add a performance regression".
N.B. Proposed loop code is also clearly better -- it is a bit easier to read, and it supports reasonable PingerTimeout values that the official code silently mishandles. We may never need to support fractional seconds in this context, but better code is still an improvement.
That being the only real "gain" pushes this into the "not valueable yet" type of changes for me.
Please do not block PRs just because you do not see value in them (when other core developers do see value).
Old and new code snippets yield the same optimized assembly: https://godbolt.org/z/4rr3WPc9e Old and new getCurrentTime() have nearly identical optimized assembly (with minor differences due to current_time being a global that new getCurrentTime() is still using to compute the other two globals). Both assembly version have eight memory accesses: https://godbolt.org/z/E9hc48G1K The number of assembly operations can be reduced further by not using current_time global to compute current_dtime and squid_curtime: https://godbolt.org/z/4rr3WPc9e
While working on something else, I accidentally found one more place that converts std::chrono::duration to timeval by hand and updated it in PR branch commit 4e7a5ab. |
No description provided.