For older versions of FFMPEG we should try to reuse the device context #190

ahmadsharif1 · 2024-08-20T15:58:44Z

I recently did some profiling of GPU profiling, and I found that creating Cuda contexts is quite expensive because it because it requires device synchronization:

torchcodec/src/torchcodec/decoders/_core/VideoDecoder.cpp

Line 117 in f4065f1

err = av_hwdevice_ctx_create(

trace_rank_0.json

I am actually not even sure if this is correct code because underlying this quote it may create a CUDA context and it is illegal to access memory across CUDA contexts. Somehow the code still works, though.

For new versions of FFMPEG, you can pass in the flag to this creation so it reuses the CUDA context, and therefore memory can be passed to pytorch code (which initializes its own CUDA context).

But for older versions of FFMPEG like 4.1 we should reuse the CUDA context if possible.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

For older versions of FFMPEG we should try to reuse the device context #190

For older versions of FFMPEG we should try to reuse the device context #190

ahmadsharif1 commented Aug 20, 2024

For older versions of FFMPEG we should try to reuse the device context #190

For older versions of FFMPEG we should try to reuse the device context #190

Comments

ahmadsharif1 commented Aug 20, 2024