Process gc preserve #58

udesou · 2024-06-11T05:43:41Z

This PR processes objects from "gc_preserve" regions separately, ensuring they're transitively pinned while the remaining objects rooted by application code are only pinned.
Note that the initial focus is on correctness and that the implementation should change to lower the code inside LLVM rather than calling into C using the hook functions (jl_gc_preserve_begin_hook and jl_gc_preserve_end_hook) for each preserve region.

…s from them

qinsoon · 2024-06-11T23:56:05Z

src/llvm-alloc-opt.cpp

@@ -53,26 +53,6 @@ STATISTIC(RemovedGCPreserve, "Total number of GC preserve instructions removed")

 namespace {

-static void removeGCPreserve(CallInst *call, Instruction *val)


Why is this function removed? The compiler removes the GC preserve calls if the values to be preserved are removed, or moved to the stack allocation. I don't know why we want to remove this optimization. Furthermore, I am concerned that in those cases where GC preserve is kept, the preserved value is replaced with a null pointer -- preserving a null pointer sounds meaningless.

I think I misunderstood the code twice.
(1) I thought I was deleting the gc preserve after it had been lowered, ie., after the necessary objects arguments to preserve_begin have been pushed to the shadow stack, but I don't think that is the case.
(2) Since the if case only covered pass.gc_preserve_begin_func, I thought I'd end up with dangling hooks for the preserve_end and that would cause problems with the push and pop hook functions, but the matching pop functions are also removed.
Can you explain what you meant by "moved to stack allocation"? I'm just trying to understand whether removing those cases could cause any correctness issue.

Can you explain what you meant by "moved to stack allocation"?

If I understand right, this optimization tries to change heap allocation to stack allocation if possible.

julia/src/llvm-alloc-opt.cpp

Line 554 in 88fea47

void Optimizer::moveToStack(CallInst *orig_inst, size_t sz, bool has_ref)

If a value x that was heap allocated and is replaced with stack allocation by the optimization, all of its uses need to be fixed with replace_inst.

julia/src/llvm-alloc-opt.cpp

Line 636 in 88fea47

auto replace_inst = [&] (Instruction *user) {

If we have gc_preserve(x), replace_inst either removes the gc_preserve (removeGCPreserve()), or replace x with the alloca buff on the stack.

julia/src/llvm-alloc-opt.cpp

Line 656 in 88fea47

if (pass.gc_preserve_begin_func == callee) {

If you remove the code that checks the gc_preserve_begin_func case, it will fall into the default case.

julia/src/llvm-alloc-opt.cpp

Line 678 in 88fea47

Value *replace = has_ref ? (Value*)buff : Constant::getNullValue(orig_i->getType());

It either replaces x with the alloca buff (which is fine), or replaces x with null. Then you may see null pointers in the preserve begin hook, and you will need to deal with null pointers in the hook.

I don't think it causes correctness issues, but doing a gc preserve call for null pointers is meaningless.

Got it! In that case, it might actually be problematic if we remove that particular gc preserve. The reason being the fact that we need to transitively pin everything in that object's transitive closure, independently of that object itself being allocated in the heap or in the stack.

If we have gc_preserve(x), replace_inst either removes the gc_preserve (removeGCPreserve()), or replace x with the alloca buff on the stack.

Okay, then it should be fine, since the gc preserve is removed only if it doesn't have anything referring to it (has_ref == 0, which I believe would also be considering any possible c call).

qinsoon · 2024-06-12T00:32:55Z

src/mmtk-gc.c

+#define jl_p_tpin_gcstack (jl_current_task->tpin_gcstack)
+
+#define JL_GC_PUSHARGS_TPIN_ROOT_OBJS(rts_var,n)                                                        \
+  rts_var = ((jl_value_t**)malloc(((n)+2)*sizeof(jl_value_t*)))+2;                                      \


These need to be clarified.

Why do you need to use malloc instead of alloca in

julia/src/julia.h

Line 982 in d907a06

rts_var = ((jl_value_t**)alloca(((n)+2)*sizeof(jl_value_t*)))+2; \

Why do you need a separate tpin_gcstack in the task? Obviously you can put tpinned roots in the normal gcstack?

If JL_GC_PUSHARGS_TPIN_ROOT_OBJS and tpin_gcstack is only used by gc preserve, probably just call them gc preserve frames or something. Calling them tpin is more confusing, as you can clearly push tpin roots to the normal stack and use existing JL_GC_PUSH.

Add some comments so we know why it is implemented like this.

qinsoon · 2024-06-20T23:58:15Z

src/llvm-late-gc-lowering.cpp

+                args.insert(args.begin(), ConstantInt::get(T_size, nargs));
+
+                ArrayRef<Value*> args_llvm = ArrayRef<Value*>(args);
+                builder.CreateCall(getOrDeclare(jl_well_known::GCPreserveBeginHook), args_llvm );


GCPreserveBeginHook is only compiled when MMTK_GC is set, but this code here is executed for all the builds. The stock build would fail in this case. Same for GCPreserveEndHook below.

Fixed. It turns out that the stock build was already broken because of a file not being compiled in Makefile, but I've fixed that too and now both builds (stock and MMTk) should work fine.

This PR ports #58 to `dev`. This PR is mostly the same as #58 except that 1. this PR does not remove transitive pinning of shadow stack roots (we know it is unsound to remove the transitive pinning at this stage), and 2. this PR includes minor refactoring for GC codegen interface.

Luis Eduardo de Souza Amorim and others added 4 commits May 9, 2024 00:06

Processing gc preserve regions differently, transitively pinning root…

fe1772d

…s from them

Merge branch 'v1.9.2+RAI' into feature/process-gc-preserve

4b36001

Fixing merge conflict

480b89d

Cleanup

ed66d4b

github-actions bot added port-to-master port-to-v1.10 port-to-v1.9 labels Jun 11, 2024

Removing code that fails llvm assertion

0558e64

udesou requested a review from qinsoon June 11, 2024 10:05

qinsoon reviewed Jun 12, 2024

View reviewed changes

udesou added 2 commits June 20, 2024 05:37

Restore optimisation that removes GCPreserve calls

c4c04d3

Clarifying code about pushing objects as gc preserve roots

1b3d0de

qinsoon reviewed Jun 20, 2024

View reviewed changes

Fixing stock build

83d551b

qinsoon approved these changes Jun 24, 2024

View reviewed changes

Skip gc-page-profiler.c when using mmtk

01f2f24

udesou merged commit 5c9b370 into mmtk:v1.9.2+RAI Jun 24, 2024
1 check passed

qinsoon mentioned this pull request Nov 25, 2024

Add GC preserve hook #70

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Process gc preserve #58

Process gc preserve #58

udesou commented Jun 11, 2024

qinsoon Jun 11, 2024

udesou Jun 20, 2024

qinsoon Jun 20, 2024

qinsoon Jun 20, 2024

udesou Jun 20, 2024 •

edited

Loading

qinsoon Jun 12, 2024

qinsoon Jun 20, 2024

udesou Jun 24, 2024

		@@ -53,26 +53,6 @@ STATISTIC(RemovedGCPreserve, "Total number of GC preserve instructions removed")

		namespace {

		static void removeGCPreserve(CallInst call, Instruction val)

Process gc preserve #58

Process gc preserve #58

Conversation

udesou commented Jun 11, 2024

qinsoon Jun 11, 2024

Choose a reason for hiding this comment

udesou Jun 20, 2024

Choose a reason for hiding this comment

qinsoon Jun 20, 2024

Choose a reason for hiding this comment

qinsoon Jun 20, 2024

Choose a reason for hiding this comment

udesou Jun 20, 2024 • edited Loading

Choose a reason for hiding this comment

qinsoon Jun 12, 2024

Choose a reason for hiding this comment

qinsoon Jun 20, 2024

Choose a reason for hiding this comment

udesou Jun 24, 2024

Choose a reason for hiding this comment

udesou Jun 20, 2024 •

edited

Loading