Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Lowering Through LLVM #26

Merged
merged 26 commits into from
Nov 1, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
26 commits
Select commit Hold shift + click to select a range
48f8909
add poly_to_llvm test file
j2kun Oct 24, 2023
a54f23f
add starter poly-to-llvm pipeline
j2kun Oct 30, 2023
225952a
add convert-func-to-llvm
j2kun Oct 30, 2023
a2a2500
add arith-to-llvm
j2kun Oct 30, 2023
160df60
add elementwise-to-linalg
j2kun Oct 30, 2023
4bcf28a
add back arith-to-llvm
j2kun Oct 30, 2023
c8fa178
add tensor-to-linalg
j2kun Oct 30, 2023
484edda
add do-nothing linalg-to-loops pass
j2kun Oct 30, 2023
3f536bc
add scf-to-cf and cf-to-llvm passes
j2kun Oct 30, 2023
4f12694
update upstream MLIR commit
j2kun Oct 30, 2023
fa6d030
fix conj pattern broken by mlir update
j2kun Oct 30, 2023
d2e0a72
migrate use of constFoldBinaryOp
j2kun Oct 30, 2023
ae33086
fix conflict caused by new function overload
j2kun Oct 30, 2023
0ea7a77
move arith-to-llvm to the end of the pass
j2kun Oct 30, 2023
cf54412
add one-shot bufferization pass
j2kun Oct 30, 2023
c74ab3e
move func-to-llvm after bufferization, enabling linalg-to-loops
j2kun Oct 30, 2023
a3cee1e
memref-expand-strided-metadata and finalize-memref-to-llvm
j2kun Oct 30, 2023
a9f9456
move func-to-llvm even later
j2kun Oct 30, 2023
24ac606
add canonicalization cleanup
j2kun Oct 30, 2023
e0e6ea3
encode end-to-end compilation pipeline as a test
j2kun Oct 30, 2023
29ead0b
add a poly.eval-specific end-to-end test
j2kun Oct 31, 2023
bc47670
ensure llc uses the same PositionIndependentExecutable option as clang
j2kun Oct 31, 2023
8422826
fix the off-by-one error in the lowering of eval
j2kun Oct 31, 2023
7660ef0
udpate cmake build to use same LLVM commit
j2kun Oct 31, 2023
d39dcdb
update subproject commit
j2kun Oct 31, 2023
1902439
add project_src_dir so tests work with both bazel and cmake
j2kun Oct 31, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 4 additions & 2 deletions .github/workflows/build_and_test_cmake.yml
Original file line number Diff line number Diff line change
Expand Up @@ -27,7 +27,7 @@ jobs:
with:
path: |
./externals/llvm-project
key: ${{ runner.os }}-norm-${{ hashFiles('**/CMakeLists.txt') }}
key: ${{ runner.os }}-cmake-${{ hashFiles('bazel/import_llvm.bzl') }}-${{ hashFiles('**/CMakeLists.txt') }}

- name: Git config
run: |
Expand All @@ -36,8 +36,10 @@ jobs:
- name: Build LLVM
if: steps.cache-llvm.outputs.cache-hit != 'true'
run: |
LLVM_COMMIT=$(grep LLVM_COMMIT ${GITHUB_WORKSPACE}/bazel/import_llvm.bzl | head -n 1 | cut -d'"' -f 2 )
git submodule update --init --recursive
cd externals/llvm-project
git checkout ${LLVM_COMMIT}
mkdir build && cd build
cmake -G Ninja ../llvm -DLLVM_ENABLE_PROJECTS=mlir -DLLVM_BUILD_EXAMPLES=ON -DLLVM_ENABLE_ASSERTIONS=ON -DCMAKE_BUILD_TYPE=Release -DLLVM_ENABLE_RTTI=ON -DLLVM_TARGETS_TO_BUILD="host"
cmake --build . --target check-mlir
Expand All @@ -50,4 +52,4 @@ jobs:
cmake --build . --target MLIRMulToAddPasses
cmake --build . --target mlir-headers
cmake --build . --target tutorial-opt
cmake --build . --target check-mlir-tutorial
cmake --build . --target check-mlir-tutorial
4 changes: 2 additions & 2 deletions bazel/import_llvm.bzl
Original file line number Diff line number Diff line change
Expand Up @@ -8,8 +8,8 @@ load(
def import_llvm(name):
"""Imports LLVM."""

# June 5, 2023
LLVM_COMMIT = "cd5fcea6d4c70a7328ca9538c9098d9f5af69682"
# 2023-10-30
LLVM_COMMIT = "896749aa0d420ae573255a64a349bc2a76cfed37"

new_git_repository(
name = name,
Expand Down
2 changes: 1 addition & 1 deletion externals/llvm-project
8 changes: 5 additions & 3 deletions lib/Conversion/PolyToStandard/PolyToStandard.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -48,7 +48,7 @@ struct ConvertAdd : public OpConversionPattern<AddOp> {
ConversionPatternRewriter &rewriter) const override {
arith::AddIOp addOp = rewriter.create<arith::AddIOp>(
op.getLoc(), adaptor.getLhs(), adaptor.getRhs());
rewriter.replaceOp(op.getOperation(), {addOp});
rewriter.replaceOp(op.getOperation(), addOp);
return success();
}
};
Expand All @@ -64,7 +64,7 @@ struct ConvertSub : public OpConversionPattern<SubOp> {
ConversionPatternRewriter &rewriter) const override {
arith::SubIOp subOp = rewriter.create<arith::SubIOp>(
op.getLoc(), adaptor.getLhs(), adaptor.getRhs());
rewriter.replaceOp(op.getOperation(), {subOp});
rewriter.replaceOp(op.getOperation(), subOp);
return success();
}
};
Expand Down Expand Up @@ -149,6 +149,8 @@ struct ConvertEval : public OpConversionPattern<EvalOp> {
auto lowerBound =
b.create<arith::ConstantOp>(b.getIndexType(), b.getIndexAttr(1));
auto numTermsOp = b.create<arith::ConstantOp>(b.getIndexType(),
b.getIndexAttr(numTerms));
auto upperBound = b.create<arith::ConstantOp>(b.getIndexType(),
b.getIndexAttr(numTerms + 1));
auto step = lowerBound;

Expand All @@ -163,7 +165,7 @@ struct ConvertEval : public OpConversionPattern<EvalOp> {
auto accum =
b.create<arith::ConstantOp>(b.getI32Type(), b.getI32IntegerAttr(0));
auto loop = b.create<scf::ForOp>(
lowerBound, numTermsOp, step, accum.getResult(),
lowerBound, upperBound, step, accum.getResult(),
[&](OpBuilder &builder, Location loc, Value loopIndex,
ValueRange loopState) {
ImplicitLocOpBuilder b(op.getLoc(), builder);
Expand Down
4 changes: 2 additions & 2 deletions lib/Dialect/Poly/PolyOps.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -16,12 +16,12 @@ OpFoldResult ConstantOp::fold(ConstantOp::FoldAdaptor adaptor) {
}

OpFoldResult AddOp::fold(AddOp::FoldAdaptor adaptor) {
return constFoldBinaryOp<IntegerAttr, APInt>(
return constFoldBinaryOp<IntegerAttr, APInt, void>(
adaptor.getOperands(), [&](APInt a, APInt b) { return a + b; });
}

OpFoldResult SubOp::fold(SubOp::FoldAdaptor adaptor) {
return constFoldBinaryOp<IntegerAttr, APInt>(
return constFoldBinaryOp<IntegerAttr, APInt, void>(
adaptor.getOperands(), [&](APInt a, APInt b) { return a - b; });
}

Expand Down
4 changes: 2 additions & 2 deletions lib/Dialect/Poly/PolyPatterns.td
Original file line number Diff line number Diff line change
Expand Up @@ -6,8 +6,8 @@ include "mlir/Dialect/Complex/IR/ComplexOps.td"
include "mlir/IR/PatternBase.td"

def LiftConjThroughEval : Pat<
(Poly_EvalOp $f, (ConjOp $z)),
(ConjOp (Poly_EvalOp $f, $z))
(Poly_EvalOp $f, (ConjOp $z, $fastmath)),
(ConjOp (Poly_EvalOp $f, $z), $fastmath)
>;

def HasOneUse: Constraint<CPred<"$_self.hasOneUse()">, "has one use">;
Expand Down
4 changes: 2 additions & 2 deletions lib/Transform/Arith/MulToAdd.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -46,7 +46,7 @@ struct PowerOfTwoExpand : public OpRewritePattern<MulIOp> {
MulIOp newMul = rewriter.create<MulIOp>(op.getLoc(), lhs, newConstant);
AddIOp newAdd = rewriter.create<AddIOp>(op.getLoc(), newMul, newMul);

rewriter.replaceOp(op, {newAdd});
rewriter.replaceOp(op, newAdd);
rewriter.eraseOp(rhsDefiningOp);

return success();
Expand Down Expand Up @@ -79,7 +79,7 @@ struct PeelFromMul : public OpRewritePattern<MulIOp> {
MulIOp newMul = rewriter.create<MulIOp>(op.getLoc(), lhs, newConstant);
AddIOp newAdd = rewriter.create<AddIOp>(op.getLoc(), newMul, lhs);

rewriter.replaceOp(op, {newAdd});
rewriter.replaceOp(op, newAdd);
rewriter.eraseOp(rhsDefiningOp);

return success();
Expand Down
4 changes: 4 additions & 0 deletions tests/BUILD
Original file line number Diff line number Diff line change
Expand Up @@ -6,12 +6,16 @@ filegroup(
testonly = True,
data = [
"//tests:lit.cfg.py",
"//tests:poly_to_llvm_main.c",
"//tools:tutorial-opt",
"@llvm-project//clang:clang",
"@llvm-project//llvm:FileCheck",
"@llvm-project//llvm:count",
"@llvm-project//llvm:llc",
"@llvm-project//llvm:not",
"@llvm-project//mlir:mlir-cpu-runner",
"@llvm-project//mlir:mlir-opt",
"@llvm-project//mlir:mlir-translate",
"@mlir_tutorial_pip_deps_lit//:pkg",
],
)
Expand Down
5 changes: 5 additions & 0 deletions tests/lit.cfg.py
Original file line number Diff line number Diff line change
Expand Up @@ -40,3 +40,8 @@
+ ":"
+ os.environ["PATH"]
)

substitutions = {
"%project_source_dir": runfiles_dir.joinpath(Path('mlir_tutorial')),
}
config.substitutions.extend(substitutions.items())
3 changes: 2 additions & 1 deletion tests/lit.cmake.cfg.py
Original file line number Diff line number Diff line change
Expand Up @@ -25,6 +25,7 @@

config.substitutions.append(("%PATH%", config.environment["PATH"]))
config.substitutions.append(("%shlibext", config.llvm_shlib_ext))
config.substitutions.append(("%project_source_dir", config.project_source_dir))

llvm_config.with_system_environment(["HOME", "INCLUDE", "LIB", "TMP", "TEMP"])

Expand All @@ -49,4 +50,4 @@
"tutorial-opt"
]

llvm_config.add_tool_substitutions(tools, tool_dirs)
llvm_config.add_tool_substitutions(tools, tool_dirs)
2 changes: 1 addition & 1 deletion tests/lit.cmake.site.cfg.py.in
Original file line number Diff line number Diff line change
Expand Up @@ -10,4 +10,4 @@ import lit.llvm
lit.llvm.initialize(lit_config, config)

# Let the main config do the real work.
lit_config.load_config(config, "@PROJECT_SOURCE_DIR@/tests/lit.cmake.cfg.py")
lit_config.load_config(config, "@PROJECT_SOURCE_DIR@/tests/lit.cmake.cfg.py")
16 changes: 16 additions & 0 deletions tests/poly_to_llvm.mlir
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
// RUN: tutorial-opt --poly-to-llvm %s | mlir-translate --mlir-to-llvmir | llc --relocation-model=pic -filetype=obj > %t
// RUN: clang -c %project_source_dir/tests/poly_to_llvm_main.c
// RUN: clang poly_to_llvm_main.o %t -o a.out
// RUN: ./a.out | FileCheck %s

// CHECK: 351
func.func @test_poly_fn(%arg : i32) -> i32 {
%tens = tensor.splat %arg : tensor<10xi32>
%input = poly.from_tensor %tens : tensor<10xi32> -> !poly.poly<10>
%0 = poly.constant dense<[2, 3, 4]> : tensor<3xi32> : !poly.poly<10>
%1 = poly.add %0, %input : !poly.poly<10>
%2 = poly.mul %1, %1 : !poly.poly<10>
%3 = poly.sub %2, %input : !poly.poly<10>
%4 = poly.eval %3, %arg: (!poly.poly<10>, i32) -> i32
return %4 : i32
}
12 changes: 12 additions & 0 deletions tests/poly_to_llvm_eval.mlir
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
// RUN: tutorial-opt --poly-to-llvm %s | mlir-translate --mlir-to-llvmir | llc --relocation-model=pic -filetype=obj > %t
// RUN: clang -c %project_source_dir/tests/poly_to_llvm_main.c
// RUN: clang poly_to_llvm_main.o %t -o eval_test.out
// RUN: ./eval_test.out | FileCheck %s

// CHECK: 9
func.func @test_poly_fn(%arg : i32) -> i32 {
// 2 + 3x + 4x^2 evaluated at x=1, should be 2+3+4
%input = poly.constant dense<[2, 3, 4]> : tensor<3xi32> : !poly.poly<3>
%0 = poly.eval %input, %arg: (!poly.poly<3>, i32) -> i32
return %0 : i32
}
12 changes: 12 additions & 0 deletions tests/poly_to_llvm_main.c
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
#include <stdio.h>

// This is the function we want to call from LLVM
int test_poly_fn(int x);

int main(int argc, char *argv[]) {
int i = 1;
int result = test_poly_fn(i);
printf("Result: %d\n", result);

return 0;
}
3 changes: 2 additions & 1 deletion tests/poly_to_standard.mlir
Original file line number Diff line number Diff line change
Expand Up @@ -82,10 +82,11 @@ func.func @test_lower_mul(%0 : !poly.poly<10>, %1 : !poly.poly<10>) -> !poly.pol
// CHECK-LABEL: test_lower_eval
// CHECK-SAME: (%[[poly:.*]]: [[T:tensor<10xi32>]], %[[point:.*]]: i32) -> i32 {
// CHECK: %[[c1:.*]] = arith.constant 1 : index
// CHECK: %[[c10:.*]] = arith.constant 10 : index
// CHECK: %[[c11:.*]] = arith.constant 11 : index
// CHECK: %[[accum:.*]] = arith.constant 0 : i32
// CHECK: %[[loop:.*]] = scf.for %[[iv:.*]] = %[[c1]] to %[[c11]] step %[[c1]] iter_args(%[[iter_arg:.*]] = %[[accum]]) -> (i32) {
// CHECK: %[[coeffIndex:.*]] = arith.subi %c11, %[[iv]]
// CHECK: %[[coeffIndex:.*]] = arith.subi %[[c10]], %[[iv]]
// CHECK: %[[mulOp:.*]] = arith.muli %[[point]], %[[iter_arg]]
// CHECK: %[[nextCoeff:.*]] = tensor.extract %[[poly]][%[[coeffIndex]]]
// CHECK: %[[next:.*]] = arith.addi %[[mulOp]], %[[nextCoeff]]
Expand Down
10 changes: 10 additions & 0 deletions tools/BUILD
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,17 @@ cc_binary(
"//lib/Transform/Affine:Passes",
"//lib/Transform/Arith:Passes",
"@llvm-project//mlir:AllPassesAndDialects",
"@llvm-project//mlir:ArithToLLVM",
"@llvm-project//mlir:BufferizationPipelines",
"@llvm-project//mlir:BufferizationTransforms",
"@llvm-project//mlir:ControlFlowToLLVM",
"@llvm-project//mlir:FuncToLLVM",
"@llvm-project//mlir:LinalgTransforms",
"@llvm-project//mlir:MemRefToLLVM",
"@llvm-project//mlir:MemRefTransforms",
"@llvm-project//mlir:MlirOptLib",
"@llvm-project//mlir:Pass",
"@llvm-project//mlir:SCFToControlFlow",
"@llvm-project//mlir:TensorToLinalg",
],
)
50 changes: 50 additions & 0 deletions tools/tutorial-opt.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -2,11 +2,57 @@
#include "lib/Dialect/Poly/PolyDialect.h"
#include "lib/Transform/Affine/Passes.h"
#include "lib/Transform/Arith/Passes.h"
#include "mlir/include/mlir/Conversion/ArithToLLVM/ArithToLLVM.h"
#include "mlir/include/mlir/Conversion/ControlFlowToLLVM/ControlFlowToLLVM.h"
#include "mlir/include/mlir/Conversion/FuncToLLVM/ConvertFuncToLLVMPass.h"
#include "mlir/include/mlir/Conversion/SCFToControlFlow/SCFToControlFlow.h"
#include "mlir/include/mlir/Conversion/TensorToLinalg/TensorToLinalgPass.h"
#include "mlir/include/mlir/Dialect/Bufferization/Pipelines/Passes.h"
#include "mlir/include/mlir/Dialect/Bufferization/Transforms/Passes.h"
#include "mlir/include/mlir/Dialect/Linalg/Passes.h"
#include "mlir/include/mlir/InitAllDialects.h"
#include "mlir/include/mlir/InitAllPasses.h"
#include "mlir/include/mlir/Pass/PassManager.h"
#include "mlir/include/mlir/Pass/PassRegistry.h"
#include "mlir/include/mlir/Tools/mlir-opt/MlirOptMain.h"
#include "mlir/include/mlir/Transforms/Passes.h"
//

void polyToLLVMPipelineBuilder(mlir::OpPassManager &manager) {
// Poly
manager.addPass(mlir::tutorial::poly::createPolyToStandard());
manager.addPass(mlir::createCanonicalizerPass());

manager.addPass(mlir::createConvertElementwiseToLinalgPass());
manager.addPass(mlir::createConvertTensorToLinalgPass());

// One-shot bufferize, from
// https://mlir.llvm.org/docs/Bufferization/#ownership-based-buffer-deallocation
mlir::bufferization::OneShotBufferizationOptions bufferizationOptions;
bufferizationOptions.bufferizeFunctionBoundaries = true;
manager.addPass(
mlir::bufferization::createOneShotBufferizePass(bufferizationOptions));
mlir::bufferization::BufferDeallocationPipelineOptions deallocationOptions;
mlir::bufferization::buildBufferDeallocationPipeline(manager, deallocationOptions);

manager.addPass(mlir::createConvertLinalgToLoopsPass());

// Needed to lower memref.subview
manager.addPass(mlir::memref::createExpandStridedMetadataPass());

manager.addPass(mlir::createConvertSCFToCFPass());
manager.addPass(mlir::createConvertControlFlowToLLVMPass());
manager.addPass(mlir::createArithToLLVMConversionPass());
manager.addPass(mlir::createConvertFuncToLLVMPass());
manager.addPass(mlir::createFinalizeMemRefToLLVMConversionPass());
manager.addPass(mlir::createReconcileUnrealizedCastsPass());

// Cleanup
manager.addPass(mlir::createCanonicalizerPass());

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is there a particular reason for adding mlir::createCanonicalizerPass() two times? There's one on the top which I think was for the passes we had created and there's this one which to my understanding should be dealing with the canonicalization of the other upstream passes we added to this pipeline.

I tried to remove the first one and just keep this last one here and the binary still works correctly (just that there is some difference in the instructions being generated in the end).

Copy link
Owner Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In general canonicalization must not change the behavior of the program (or else it would be buggy). It will only make things more efficient. Canonicalization patterns apply to particular ops in particular dialects, and higher-level canonicalization patterns tend to remove more unnecessary work than applying lower-level canonicalization patterns, since they capture a larger breadth of the overall program. Running canonicalization after every pass would be even better. But the reason we don't is because we expect it to do nothing, and it adds to the total runtime.

manager.addPass(mlir::createSCCPPass());
manager.addPass(mlir::createCSEPass());
manager.addPass(mlir::createSymbolDCEPass());
}

int main(int argc, char **argv) {
mlir::DialectRegistry registry;
Expand All @@ -20,6 +66,10 @@ int main(int argc, char **argv) {
// Dialect conversion passes
mlir::tutorial::poly::registerPolyToStandardPasses();

mlir::PassPipelineRegistration<>("poly-to-llvm",
"Run passes to lower the poly dialect to LLVM",
polyToLLVMPipelineBuilder);

return mlir::asMainReturnCode(
mlir::MlirOptMain(argc, argv, "Tutorial Pass Driver", registry));
}