Aling 'linalg-to-xegpu' pass with patched XeGPU dialect #201

dchigarev · 2024-07-30T15:57:25Z

Closes #192

This PR updates linalg-to-xegpu pass to make it compatible with xegpu-to-vc-func pass from IMEX.

The PR also adds a simple e2e test for linalg->xegpu->gpu exe pipeline.

dchigarev · 2024-07-31T14:54:41Z

lib/gc/Transforms/GPU/LinalgToXeGPU.cpp

-        loc, vecLoadType, tile, vnniAxisAttr, transpose,
+        loc, vecLoadType, tile, packedAttr, transpose, transpose_bit,


vnniAxis -> packedAttr: instead of a vnni axis (0, 1) specify "packed" attribute that's equivalent of vnni_axis=0

transpose_bit: allows to transpose data while loading. Isn't used by this lowering pass

dchigarev · 2024-07-31T14:56:56Z

lib/gc/Transforms/GPU/LinalgToXeGPU.cpp

@@ -1057,7 +1076,7 @@ static LogicalResult createDPASKernel(linalg::LinalgOp linalgOp,

  // Load A sub-tiles.
  SmallVector<Value> loadVecA =
-      loadNdDescTiles(rewriter, loc, tilesA, readCacheHint, vnniConfA);


vnniConfA can't be used during loading since vnniAxis=1 is now longer supported. However we still need this config to compute proper tiles for xegpu.dpas later in the code.

Signed-off-by: dchigarev <[email protected]>

cmake/imex.cmake

dchigarev · 2024-08-01T16:19:00Z

test/mlir/test/gc/Transforms/GPU/linalg-to-xegpu-dpas.mlir

-// CHECK: %[[tC:.+]] = xegpu.update_nd_offset %[[rootC]], [0, 0]
+// CHECK: %[[tC:.+]] = xegpu.update_nd_offset %[[rootC]], [%c0, %c0]


imex doesn't support constant offsets (see intel/mlir-extensions#815)

dchigarev · 2024-08-01T16:20:24Z

test/mlir/test/gc/Transforms/GPU/linalg-to-xegpu-dpas.mlir

@@ -63,7 +63,7 @@ func.func @matmul(%arg0: memref<32x32xf16>, %arg1: memref<32x32xf16>, %arg2: mem

 // Extract DPAS-sized chunks from larger loaded tile A.
 // Tile B is already in the correct shape.
-// CHECK:   %[[vA_flat:.+]] = vector.shape_cast %[[vA]] : vector<32x8x2xf16> to vector<512xf16>
+// CHECK:   %[[vA_flat:.+]] = vector.shape_cast %[[vA]] : vector<32x16xf16> to vector<512xf16>


we do not load the A matrix via vnni_axis=1 anymore (see packed_attr)

Menooker · 2024-08-02T03:00:30Z

The IMEX changes are merged in Menooker:dev.

Signed-off-by: dchigarev <[email protected]>

dchigarev · 2024-08-02T14:47:49Z

cmake/imex.cmake

@@ -8,8 +8,8 @@ if (NOT DEFINED IMEX_INCLUDES)

    # TODO: Change to main https://github.com/intel/mlir-extensions when all the
    # required functionality is merged.
-    gc_fetch_content(imex 496b240093b5e132b60c5ee69878300fe69be300 https://github.com/Menooker/mlir-extensions
-            SET IMEX_CHECK_LLVM_VERSION=ON IMEX_ENABLE_L0_RUNTIME=0
+    gc_fetch_content(imex d5bbd635dee500b8cff138686833bacfac5ade78 https://github.com/Menooker/mlir-extensions


updated to the latest commit in dev branch

include/gc/Transforms/CMakeLists.txt

Signed-off-by: dchigarev <[email protected]>

dchigarev · 2024-08-05T15:02:49Z

lib/gc/ExecutionEngine/OpenCLRuntime/OpenCLRuntimeWrappers.cpp

-  cl_platform_id platform; // OpenCL platform
-  cl_device_id device;     // device ID
-  CL_SAFE_CALL(clGetPlatformIDs(1, &platform, NULL));
-  CL_SAFE_CALL(clGetDeviceIDs(platform, *devtype, 1, &device, NULL));
-  return device;


The old logic searched for a device of the requested type only in one platform (and couldn't find any GPU on my machine). Rewritten the logic to iterate over all available platforms and return a first suitable device

dchigarev force-pushed the fix-linalg-to-xe branch from 616c9e1 to 716af02 Compare July 31, 2024 14:48

dchigarev commented Jul 31, 2024

View reviewed changes

dchigarev marked this pull request as ready for review July 31, 2024 15:06

dchigarev force-pushed the fix-linalg-to-xe branch from aeada62 to 435b520 Compare July 31, 2024 15:29

dchigarev added 2 commits July 31, 2024 15:34

Aling 'linalg-to-xegpu' pass with patched XeGPU dialect

964398e

Signed-off-by: dchigarev <[email protected]>

add simple e2e test

2778459

Signed-off-by: dchigarev <[email protected]>

dchigarev force-pushed the fix-linalg-to-xe branch from 435b520 to 2778459 Compare July 31, 2024 15:34

dchigarev requested review from AndreyPavlenko, Menooker and kurapov-peter August 1, 2024 08:11

kurapov-peter approved these changes Aug 1, 2024

View reviewed changes

dchigarev added 3 commits August 1, 2024 16:16

Fix tests

0f25517

Signed-off-by: dchigarev <[email protected]>

fix tests

05aa8d6

Signed-off-by: dchigarev <[email protected]>

fix tests

829b9d4

Signed-off-by: dchigarev <[email protected]>

dchigarev force-pushed the fix-linalg-to-xe branch from f78f6d2 to 829b9d4 Compare August 1, 2024 16:16

dchigarev commented Aug 1, 2024

View reviewed changes

cmake/imex.cmake Outdated Show resolved Hide resolved

dchigarev commented Aug 1, 2024

View reviewed changes

dchigarev added 2 commits August 2, 2024 13:42

Merge remote-tracking branch 'origin/main' into fix-linalg-to-xe

3660cdc

fix imex build

52eb013

Signed-off-by: dchigarev <[email protected]>

dchigarev commented Aug 2, 2024

View reviewed changes

dchigarev added 5 commits August 5, 2024 09:51

distinct between ENABLE and USE IMEX

48914ac

Signed-off-by: dchigarev <[email protected]>

Merge branch 'main' into fix-linalg-to-xe

2f5561c

Merge remote-tracking branch 'origin/main' into fix-linalg-to-xe

8184f5d

remove l0 runtime

be7fdf0

Signed-off-by: dchigarev <[email protected]>

fix formatting

a94205a

Signed-off-by: dchigarev <[email protected]>

dchigarev commented Aug 5, 2024

View reviewed changes

kurapov-peter added the ready to review label Aug 5, 2024

Merge remote-tracking branch 'origin/main' into fix-linalg-to-xe

4cf3457

dchigarev mentioned this pull request Aug 6, 2024

Make linalg->xegpu->gpu_exe pipeline working #193

Closed

5 tasks

kurapov-peter requested a review from LongshengDu August 6, 2024 10:02

Menooker approved these changes Aug 7, 2024

View reviewed changes

LongshengDu approved these changes Aug 7, 2024

View reviewed changes

LongshengDu mentioned this pull request Aug 7, 2024

[GPU] Add MLP test and linalg.fill lowering in 'linalg-to-xegpu' #220

Merged

kurapov-peter merged commit dd1a80d into intel:main Aug 7, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Aling 'linalg-to-xegpu' pass with patched XeGPU dialect #201

Aling 'linalg-to-xegpu' pass with patched XeGPU dialect #201

dchigarev commented Jul 30, 2024 •

edited

Loading

dchigarev Jul 31, 2024

dchigarev Jul 31, 2024

dchigarev Aug 1, 2024

dchigarev Aug 1, 2024

Menooker commented Aug 2, 2024

dchigarev Aug 2, 2024 •

edited

Loading

dchigarev Aug 5, 2024 •

edited

Loading

		loc, vecLoadType, tile, vnniAxisAttr, transpose,
		loc, vecLoadType, tile, packedAttr, transpose, transpose_bit,

		// CHECK: %[[tC:.+]] = xegpu.update_nd_offset %[[rootC]], [0, 0]
		// CHECK: %[[tC:.+]] = xegpu.update_nd_offset %[[rootC]], [%c0, %c0]

Aling 'linalg-to-xegpu' pass with patched XeGPU dialect #201

Aling 'linalg-to-xegpu' pass with patched XeGPU dialect #201

Conversation

dchigarev commented Jul 30, 2024 • edited Loading

dchigarev Jul 31, 2024

Choose a reason for hiding this comment

dchigarev Jul 31, 2024

Choose a reason for hiding this comment

dchigarev Aug 1, 2024

Choose a reason for hiding this comment

dchigarev Aug 1, 2024

Choose a reason for hiding this comment

Menooker commented Aug 2, 2024

dchigarev Aug 2, 2024 • edited Loading

Choose a reason for hiding this comment

dchigarev Aug 5, 2024 • edited Loading

Choose a reason for hiding this comment

dchigarev commented Jul 30, 2024 •

edited

Loading

dchigarev Aug 2, 2024 •

edited

Loading

dchigarev Aug 5, 2024 •

edited

Loading