-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Should be a best case scenario. Literally just element-wise FMA. Almost +30%: 107µs to 77µs on arch=native It's *cool* yes but the code quality degrades so much I wonder if its even worth it. Then when you factor in the complex 3-dimension deinterleave that'll be needed to use it properly... I still have to test it of course, but I just feel it'll eat what little perf I get. I have AVX512 as well, so AVX≤2 will probably end up hurting even more.
Showing
2 changed files
with
17 additions
and
8 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters