float conversion emulation routines #2985

sjoerdmeijer · 2024-08-14T17:55:59Z

I see several floating-point conversion routines, for example this float32 to float16 helper function:

Line 77 in 3070f88

static inline float16 cpu_float2half_rn(float f) {

But most modern AArch64 CPUs (Armv8.2a and up) and I believe x86 too have native support for FP16, and have different instructions for up and down converts. I believe that whole function can be replaced with just one FCVT instruction. The different rounding modes should be supported too.

excelle08 · 2024-08-22T22:48:53Z

I think the cpu_float2half_rn function is a reference implementation that intentionally implement the algorithm manually. Currently we rely on the compiler to do the optimized CPU float conversion (see line 222 and 232) if the compiler has fp16 data type extension and the CPU supports native fp16 conversion.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

float conversion emulation routines #2985

float conversion emulation routines #2985

sjoerdmeijer commented Aug 14, 2024

excelle08 commented Aug 22, 2024

float conversion emulation routines #2985

float conversion emulation routines #2985

Comments

sjoerdmeijer commented Aug 14, 2024

excelle08 commented Aug 22, 2024