[CIR][CIRGen] Support __builtin_isinf_sign #1142

PikachuHyA · 2024-11-19T11:53:41Z

This patch adds support for __builtin_isinf_sign. The implementation has several limitations that result in discrepancies between the generated LLVM IR and the expected output.

Firstly, it uses IsFPClass to determine if a value is infinite, whereas the original CGBuiltin implementation used direct comparisons with infinity constants.
Secondly, due to #480, there are numerous unnecessary type conversions occurring, such as converting from i1 to i8 and then back to i1.
Additionally, SignBitOp cannot set the return type to CIR_BoolType, as doing so would lead to failures during the lowering to LLVM IR.

…emission

…upport

…te personality functions While here, cleanup getOrCreateLLVMFuncOp usaga a bit.

Directly erasing the op causes a use after free later on, presumably because the lowering framework isn't aware of the op being deleted. This fixes `clang/test/CIR/CodeGen/pointer-arith-ext.c` with ASAN.

The loop was erasing the user of a value while iterating on the value's users, which results in a use after free. We're already assuming (and asserting) that there's only one user, so we can just access it directly instead. CIR/Transforms/Target/x86_64/x86_64-call-conv-lowering-pass.cpp was failing with ASAN before this change. We're now ASAN-clean except for llvm#829 (which is also in progress).

Reland llvm#638 This was reverted due to llvm#655. I tried to address the problem in the newest commit. The changes of the PR since the last landed one includes: - Move the definition of `cir::CIRGenConsumer` to `clang/include/clang/CIRFrontendAction/CIRGenConsumer.h`, and leave its `HandleTranslationUnit` interface is left empty. So that `cir::CIRGenConsumer` won't need to depend on CodeGen any more. - Change the old definition of `cir::CIRGenConsumer` in `clang/lib/CIR/FrontendAction/CIRGenAction.cpp` and to `CIRLoweringConsumer`, inherited from `cir::CIRGenConsumer`, which implements the original `HandleTranslationUnit` interface. I feel this may improve the readability more even without my original patch.

This PR fixes the lowering for multi dimensional arrays. Consider the following code snippet `test.c`: ``` void foo() { char arr[4][1] = {"a", "b", "c", "d"}; } ``` When ran with `bin/clang test.c -Xclang -fclangir -Xclang -emit-llvm -S -o -`, It produces the following error: ``` ~/clangir/llvm/include/llvm/Support/Casting.h:566: decltype(auto) llvm::cast(const From&) [with To = mlir::ArrayAttr; From = mlir::Attribute]: Assertion `isa<To>(Val) && "cast<Ty>() argument of incompatible type!"' failed. ``` The bug can be traced back to `LoweringHelpers.cpp`. It considers the values in the array as integer types, and this causes an error in this case. This PR updates `convertToDenseElementsAttrImpl` when the array contains string attributes. I have also added one more similar test. Note that in the tests I used a **literal match** to avoid matching as regex, so `!dbg` is useful.

Support expressions at the top level such as const unsigned int n = 1234; const int &r = (const int&)n; Reviewers: bcardosolopes Pull Request: llvm#857

This is to match clang CodeGen

@smeenai

Fix llvm#829 Thanks @smeenai for pointing out the root cause and UBSan failure!

As title. Also introduced buildAArch64NeonCall skeleton, which is partially the counterpart of OG's EmitNeonCall. And this could be use for many other neon intrinsics. --------- Co-authored-by: Guojin He <[email protected]>

… it (llvm#859)

These were uninitialized, which led to intermittent test failures from the use of uninitialized variables. Initialize them to `nullptr` as is done with other member variables that are pointers to fix this. I did a quick spot-check and didn't find other uninitialized variables in the main CGF class itself. Lots of subclasses have uninitialized member variables, but those are presumably expected to be initialized at all points of construction, so we can leave them alone until they cause any issues. `ninja check-clang-cir` now passes with ASan+UBSan and MSan. Fixes llvm#829

See the test for example.

This PR adds aarch64 big endian support. Basically the support for aarch64_be itself is expressed only in two extra cases for the switch statement and changes in the `CIRDataLayout` are needed to prove that we really support big endian. Hence the idea for the test - I think the best way for proof is something connected with bit-fields, so we compare the results of the original codegen and ours.

This PR splits the old `cir-simplify` pass into two new passes, namely `cir-canonicalize` and `cir-simplify` (the new `cir-simplify`). The `cir-canonicalize` pass runs transformations that do not affect CIR-to-source fidelity much, such as operation folding and redundant operation elimination. On the other hand, the new `cir-simplify` pass runs transformations that may significantly change the code and break high-level code analysis passes, such as more aggresive code optimizations. This PR also updates the CIR-to-CIR pipeline to fit these two new passes. The `cir-canonicalize` pass is moved to the very front of the pipeline, while the new `cir-simplify` pass is moved to the back of the pipeline (but still before lowering prepare of course). Additionally, the new `cir-simplify` now only runs when the user specifies a non-zero optimization level on the frontend. Also fixed some typos and resolved some `clang-tidy` complaints along the way. Resolves llvm#827 .

Currently the C style cast is not implemented/supported for unions. This PR adds support for union casts as done in `CGExprAgg.cpp`. I have also added an extra test in `union-init.c`.

Mistakenly closed llvm#850 llvm#850 (review) This PR fixes array initialization for expression arguments. Consider the following code snippet `test.c`: ``` typedef struct { int a; int b[2]; } A; int bar() { return 42; } void foo() { A a = {bar(), {}}; } ``` When ran with `bin/clang test.c -Xclang -fclangir -Xclang -emit-cir -S -o -`, It produces the following error: ``` ~/clangir/clang/lib/CIR/CodeGen/CIRGenExprAgg.cpp:483: void {anonymous}::AggExprEmitter::buildArrayInit(cir::Address, mlir::cir::ArrayType, clang::QualType, clang::Expr*, llvm::ArrayRef<clang::Expr*>, clang::Expr*): Assertion `NumInitElements != 0' failed. ``` The error can be traced back to `CIRGenExprAgg.cpp`, and the fix is simple. It is possible to have an empty array initialization as an expression argument!

As title, if element type of vector type is sized, then the vector type should be deemed sized. This would enable us generate code for neon without triggering assertion

…eon_vrndaq_v (llvm#871) as title. This also added NeonType support for Float32 Co-authored-by: Guojin He <[email protected]>

…::saved_type::save

It will hit another assert when calling initFullExprCleanup.

This PR fixes the case, when a temporary var is used, and `alloca` operation is inserted in the block start before the `label` operation. Implementation: when we search for the `alloca` place in a block, we take label operations into account as well. Fix llvm#870 --------- Co-authored-by: Bruno Cardoso Lopes <[email protected]>

…vm#1107) Before the commit, when flattening if/else clauses - the else body came before the "then" body, as opposed to clang's output order. This commit reverses this and hopefully allows easier comparisson between clang's output and cir's.

This patch implements transformations for VAArg in X86_64 ABI **in shape**. `In shape` means it can't work properly due to the dependent X86_64 ABI is not robust. e.g., when we want to use VAArg with `long double`, we need llvm#1087. This patch literally implement https://github.com/llvm/llvm-project/blob/d233fedfb0de882353c348cd1ac57dab619efa6d/clang/lib/CodeGen/Targets/X86.cpp#L3015-L3240 in CIR. There some differences due to the traditional pipeline are converting AST to LLVM and we're transforming CIR to CIR. And also to get the ABI Info, I moved `X86_64ABIInfo` to the header.

)

…ry (llvm#1111) This PR adds a support for one more case of passing structs by value, with `memcpy` emitted. First of all, don't worry - despite the PR seems big, it's basically consist of helpers + refactoring. Also, there is a minor change in the `CIRBaseBuilder` - I made static the `getBestAllocaInsertPoint` method in order to call it from lowering - we discussed once - and I here we just need it (or copy-paste the code, which doesn't seem good). I will add several comments in order to simplify review.

follow llvm#1033 handle `LongDoubleType` with `FP80Type`.

) As title, this patch refactors raw string literals for (module) attribute names into static methods of `CIRDialect`, following the convention of MLIR.

This PR handles calls with unions passed by value in the calling convention pass. #### Implementation As one may know, data layout for unions in CIR and in LLVM differ one from another. In CIR we track all the union members, while in LLVM IR only the largest one. And here we need to take this difference into account: we need to find a type of the largest member and treat it as the first (and only) union member in order to preserve all the logic from the original codegen. There is a method `StructType::getLargestMember` - but looks like it produces different results (with the one I implemented or better to say copy-pasted). Maybe it's done intentionally, I don't know. The LLVM IR produced has also some difference from the original one. In the original IR `gep` is emitted - and we can not do the same. If we create `getMemberOp` we may fail on type checking for unions - since the first member type may differ from the largest type. This is why we create `bitcast` instead. Relates to the issue llvm#1061

…enBuiltinAArch64.cpp (llvm#1124) We are still seeing crash message like `NYI UNREACHABLE executed at clang/lib/CIR/CodeGen/CIRGenBuiltinAArch64.cpp:3304`, which is not convenient for triaging as our code base changes so fast, line number doesn't help much. So, here we replaced most of `llvm_unreachable("NYI")` with more informative message.

…1125) Currently, the final `target triple` in LLVM IR is set in `CIRGenAction`, which is not executed by cir tools like `cir-translate`. This PR delay its assignment to LLVM lowering, enabling sharing the emitting of `target triple` between different invoking paths.

…ts (llvm#1074) As the title says, this PR adds support for calls with struct types > 128 bits, building upon this [PR](llvm#1068). The idea is gotten from the original Codegen, and I have added a couple of tests.

…bsOp to take vector input (llvm#1099) Extend AbsOp to take vector of int input. With it, we can support __builtin_elementwise_abs. We should in the next PR extend FpUnaryOps to support vector type input so we won't have blocker to implement all elementwise builtins completely. Now just temporarily have missingFeature `fpUnaryOPsSupportVectorType`. Currently, int type UnaryOp support vector type. FYI: [clang's documentation about elementwise builtins](https://clang.llvm.org/docs/LanguageExtensions.html#vector-builtins)

…vm#1102) This is a NFC patch that moves declaration from LowerToLLVM.cpp. The motivation of the patch is, we hope we can use the abilities from MLIR's standard dialects without lowering **ALL** clangir operation to MLIR's standard dialects. For example, currently we have 86 operations in LowerToLLVM.cpp but only 45 operations under though MLIR. It won't be easy to add proper lowering for all operation to **different** dialects. I think the solution may be to allow **mixed** IR. So that we can lowering CIR to MLIR's standard dialects partially and we can use some existing analysis and optimizations in MLIR and then we can lower all of them (the MLIR dialects and unlowered clangir) to LLVM IR. The hybrid IR is one of the goals of MLIR as far as I know. NOTE: I completely understand that the DirectlyLLVM pipeline is the tier-1 pipeline that we want to support. The idea above won't change this. I just want to offer some oppotunities for the downstream projects and finally some chances to improve the overall ecosystem.

…, neon_splatq_lane and neon_splatq_laneq (llvm#1126)

This is going to be raised in follow up work, which is hard to do in one go because createBaseClassAddr goes of the OG skeleton and ideally we want ApplyNonVirtualAndVirtualOffset to work naturally. This also doesn't handle null checks, coming next.

… paths

Now that we fixed the dep on VBase, clean up the rest of the function.

…e BaseClassAddrOp

It was always the intention for `cir.cmp` operations to return bool result. Due to missing constraints, a bug in codegen has slipped in which created `cir.cmp` operations with result type that matches the original AST expression type. In C, as opposed to C++, boolean expression types are "int". This resulted with extra operations being codegened around boolean expressions and their usage. This commit both enforces `cir.cmp` in the op definition and fixes the mentioned bug.

@bcardosolopes

…vm#1135) support `llvm.intr.memset.inline` in llvm-project repo before we add support for `__builtin_memset_inline` in clangir cc @bcardosolopes (cherry picked from commit 30753af)

This patch adds support for `__builtin_isinf_sign`. The implementation has several limitations that result in discrepancies between the generated LLVM IR and the expected output. Firstly, it uses `IsFPClass` to determine if a value is infinite, whereas the original CGBuiltin implementation used direct comparisons with infinity constants. Secondly, due to llvm#480, there are numerous unnecessary type conversions occurring, such as converting from i1 to i8 and then back to i1. Additionally, `SignBitOp` cannot set the return type to `CIR_BoolType`, as doing so would lead to failures during the lowering to LLVM IR.

bcardosolopes and others added 30 commits November 2, 2024 23:30

[CIR][FlattenCFG][NFCI] Exceptions: more generalization for dispatch …

b7de243

…emission

[CIR][FlattenCFG][NFCI] Exceptions: generalize catch dispatch emission

de73c06

[CIR][NFC] Move things around in try-catch-dtors.cpp

d964bb3

[CIR][FlattenCFG] Exceptions: enable many calls / many landing pads s…

04f1f16

…upport

[CIR][LowerToLLVM][NFC] Exceptions: use getOrCreateLLVMFuncOp to crea…

6de2b84

…te personality functions While here, cleanup getOrCreateLLVMFuncOp usaga a bit.

[CIR][Lowering] Erase op through rewriter instead of directly (llvm#853)

3216b39

Directly erasing the op causes a use after free later on, presumably because the lowering framework isn't aware of the op being deleted. This fixes `clang/test/CIR/CodeGen/pointer-arith-ext.c` with ASAN.

[CIR] Exceptions: check LLVM output for more complex dtor order

d377776

[CIR][CodeGen] Support global temporaries

edfffa5

Support expressions at the top level such as const unsigned int n = 1234; const int &r = (const int&)n; Reviewers: bcardosolopes Pull Request: llvm#857

[CIR][CodeGen][NFC] Move GetUndefRValue to the right file

17d8d70

This is to match clang CodeGen

[CIR][CIRGen] Exceptions: lexical scope issue with global initializers

51bbb15

Fix llvm#829 Thanks @smeenai for pointing out the root cause and UBSan failure!

[CIR][CodeGen][NFC] Add TBAAAccessInfo stubbed out and many usages of…

c4e85ea

… it (llvm#859)

[CIR][CodeGen] Stub out an empty CIRGenDebugInfo type

0fc106c

[CIR][CIRGen] Implement Nullpointer arithmatic extension (llvm#861)

49ca6dd

See the test for example.

[CIR][CodeGen] Implement union cast (llvm#867)

83145a4

Currently the C style cast is not implemented/supported for unions. This PR adds support for union casts as done in `CGExprAgg.cpp`. I have also added an extra test in `union-init.c`.

[CIR][CIRGen] Exceptions: unlock nested try/catch support

4e66034

[CIR][CIRGen] Correct isSized predicate for vector type (llvm#869)

73490a3

As title, if element type of vector type is sized, then the vector type should be deemed sized. This would enable us generate code for neon without triggering assertion

[CIR][CIRGen][Builtin][Neon] Lower builtin_neon_vrnda_v and builtin_n…

47d3674

…eon_vrndaq_v (llvm#871) as title. This also added NeonType support for Float32 Co-authored-by: Guojin He <[email protected]>

[CIR][CIRGen] Handle VisitCXXRewrittenBinaryOperator for scalars

7fbe3f4

[CIR][CIRGen][NFC] Cleanups: add skeleton for DominatingValue<RValue>…

ecdaa8d

…::saved_type::save

[CIR][Infra] Run check-clang-cir against any branch based PR (llvm#873)

671564f

[CIR][CIRGen][NFC] Cleanups: add more skeleton to pushFullExprCleanup

f713df2

It will hit another assert when calling initFullExprCleanup.

orbiri and others added 26 commits November 12, 2024 12:15

[CIR][CIRGen][Builtin][Neon] Lower neon_vqdmlal, neon_vqdmlsl (llvm#1104

a6c9820

)

[CIR][CIRGen][Builtin][Neon] Lower neon_vaeseq_u8 (llvm#1112)

e6544b5

[CIR][CIRGen][Builtin] Support __builtin_wmemchr (llvm#1115)

6666565

[CIR][CIRGen] Support __builtin_signbitl (llvm#1117)

0ec8cf3

follow llvm#1033 handle `LongDoubleType` with `FP80Type`.

[CIR][Dialect][NFC] Refactor hardcoded attribute name strings (llvm#1122

16a027a

) As title, this patch refactors raw string literals for (module) attribute names into static methods of `CIRDialect`, following the convention of MLIR.

[CIR][CIRGen][Builtin][Neon] Lower vcvt_f32_v, vcvtq_f32_v (llvm#1120)

c644ee7

[CIR][CIRGen][Builtin][Neon] Lower neon_splat_lane, neon_splat_laneq…

8fbc640

…, neon_splatq_lane and neon_splatq_laneq (llvm#1126)

[CIR][CIRGen][Builtin] Support __builtin___memmove_chk (llvm#1106)

ab9fbcf

[CIR][NFC] Fix unused variable warning

29cb9bc

[CIR][CIRGen] Bring getAddressOfBaseClass a bit closer to OG

a61e202

[CIR][CIRGen][NFC] More unification of virtual and non-virtual offset…

2430c26

… paths

[CIR][CIRGen][NFC] More skeleton conformance

c10f493

Now that we fixed the dep on VBase, clean up the rest of the function.

[CIR][CIRGen] Teach all uses of ApplyNonVirtualAndVirtualOffset to us…

3aed38c

…e BaseClassAddrOp

[cherry-pick][mlir][llvm] Add support for memset.inline (#115711) (ll…

da601b3

…vm#1135) support `llvm.intr.memset.inline` in llvm-project repo before we add support for `__builtin_memset_inline` in clangir cc @bcardosolopes (cherry picked from commit 30753af)

PikachuHyA requested review from lanza and bcardosolopes as code owners November 19, 2024 11:53

smeenai force-pushed the main branch 2 times, most recently from 4aca8d4 to a04cf10 Compare November 23, 2024 06:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CIR][CIRGen] Support __builtin_isinf_sign #1142

[CIR][CIRGen] Support __builtin_isinf_sign #1142

PikachuHyA commented Nov 19, 2024

[CIR][CIRGen] Support __builtin_isinf_sign #1142

Are you sure you want to change the base?

[CIR][CIRGen] Support __builtin_isinf_sign #1142

Conversation

PikachuHyA commented Nov 19, 2024