Skip to content

Latest commit

 

History

History
22 lines (13 loc) · 612 Bytes

README.md

File metadata and controls

22 lines (13 loc) · 612 Bytes

Goto-Style DGEMM in C

Demonstration of Goto-style DGEMM (Double-precision General Matrix Multiply) in C with 80% efficiency on AMD Opteron Processor 6136 without using assembly.

The purpose of this code is to show that it is very easy to write fairly efficient code in C (good algorithm + a bit of compiler hack) without using assembly. It uses GCC and SSE2 intrinsics.

It is not intended to provide clean portable implementation of DGEMM.

Efficiency

Efficiency

Algorithm

Goto-Style DGEMM

Developer

  • Yutaka Tsutano at University of Nebraska-Lincoln.