AMD APU Benchmark (AMD A9-9410 RADEON R5, 5 COMPUTE CORES 2C+3G) #23
Replies: 3 comments 6 replies
-
For such an old GPU it seems to be quite impressive. It looks like 300 GFlops device with 3 comute units and 800MHz clock. It is less than Intel 400GFlops GPU and still it performs ~30% faster than Intel GPU.. Can you run |
Beta Was this translation helpful? Give feedback.
-
Do you think it's possible to crosscompile this to Raspberry Pi Zero? That way we could see how it would run on the most minimum of hardware, and can serve as an estimate to people how much compute and memory they need for their various models. (I'll try the cross compiling thing) OpenCL for raspi: https://github.com/doe300/VC4CL See this link for info: https://kenny-peng.com/2021/09/14/raspi_zero_opencl.html The clinfo output is like this: (copied it from the site above)
|
Beta Was this translation helpful? Give feedback.
-
This is quite problematic, that means most of algorithms wouldn't work as expected. Parallel reductions need different algorithms, GEMM in current implementation would not work. It needs different architecture. It runs (at least once was tested) on Mali and performance wasn't very optimal. So unless centeral algorithms will be tuned/rewritten for this GPU I don't thing this would work. I can't say right now how much work is it to adapt the code to this GPU and restrictions. |
Beta Was this translation helpful? Give feedback.
-
Inference:
Training
System Info:
Linux:
CL Info:
Beta Was this translation helpful? Give feedback.
All reactions