Move the device timer conversion in cvk_device instead of commands #630

rjodinchr · 2023-10-30T21:27:50Z

No description provided.

kpet

What's the motivation for this change? Performance improvements?

kpet · 2024-03-17T11:57:10Z

src/device.cpp

-    if (sync_host > sync_dev) {
-        return (sync_host - sync_dev) + dev;
+cl_ulong cvk_device::device_timer_to_host(cl_ulong dev) {
+    if (dev > m_sync_dev) {


Suggested change

if (dev > m_sync_dev) {

if (dev <= m_sync_dev) {

Isn't your condition backwards? You want a new pair of aligned timestamps only when you've wrapped around and the timestamp to convert to the host's time base is smaller than the device timestamp used as a reference.

I don't think it is backwards.
The idea is to get new sync points when the time we want is bigger than what we have converted so far. Because they might have been a slight divergence since we have the last sync point.

So you'll end up requesting a new pair of timestamps for pretty much each command since the device timestamp will almost always be greater than the last one obtained when getting a host/device pair. The only time where you're not requesting a host/device pair is when the device timestamp wraps around. At which point I don't see much motivation for the change.

src/device.cpp

rjodinchr · 2024-03-17T12:32:30Z

What's the motivation for this change? Performance improvements?

From what I remember, this is needed to fix issues introduced by the new pattern from using timeline semaphore.
But as without timeline semaphore this is just a different implementation that works as well, I thought it would be better to submit it before in order to reduce the size of the timeline semaphore PR.

rjodinchr · 2024-03-18T17:22:28Z

I have rerun the CTS without this patch (but with the timeline semaphore) to remind me why this patch is needed with timeline semaphore.

In the implementation I have, there is a race condition between the application thread and the executor thread. It can lead to the thread app looking for an event profiling info, because the event is complete, but the executor has not computed the profiling info.
But this event might be related to a cvk_command inside of a cvk_command_batch. As the sync points are taken care by the batch command, it was making things complicated. And this is the way I dealt with it, moving the sync point to the device instead of the batch command.

rjodinchr force-pushed the pr/device-timer branch from 1d00885 to cc94ea5 Compare November 13, 2023 15:57

rjodinchr force-pushed the pr/device-timer branch from cc94ea5 to 0d4475e Compare March 11, 2024 15:20

kpet reviewed Mar 17, 2024

View reviewed changes

rjodinchr added 2 commits May 14, 2024 09:23

Move the device timer conversion in cvk_device instead of commands

f267858

handle error in device_timer_to_host

9d8994e

rjodinchr force-pushed the pr/device-timer branch from 0d4475e to 9d8994e Compare May 14, 2024 07:30

rjodinchr requested a review from kpet May 14, 2024 07:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move the device timer conversion in cvk_device instead of commands #630

Move the device timer conversion in cvk_device instead of commands #630

rjodinchr commented Oct 30, 2023

kpet left a comment

kpet Mar 17, 2024

rjodinchr Mar 17, 2024

kpet Mar 17, 2024

rjodinchr commented Mar 17, 2024

rjodinchr commented Mar 18, 2024

Move the device timer conversion in cvk_device instead of commands #630

Are you sure you want to change the base?

Move the device timer conversion in cvk_device instead of commands #630

Conversation

rjodinchr commented Oct 30, 2023

kpet left a comment

Choose a reason for hiding this comment

kpet Mar 17, 2024

Choose a reason for hiding this comment

rjodinchr Mar 17, 2024

Choose a reason for hiding this comment

kpet Mar 17, 2024

Choose a reason for hiding this comment

rjodinchr commented Mar 17, 2024

rjodinchr commented Mar 18, 2024