[Now with more rules] Lifting rules (lift let, lift if, if-of-if) #953

acl33 · 2021-07-15T20:23:58Z

This addresses AB#19240, adding a bunch of rules for lifting let(-binds) and if(-conditions) and binds, roughly of the form

(foo a (if p x y) b)  ===>  (if p (foo a x b) (foo a y b))
(foo a (let (x e) body))  ===> (let (x e) (foo a body))

These are quite complex rules with numerous side conditions, for example let-variable escape ((let (x e1) (let (y e2) body)) ==> (let (y e2) (let (x e1) body)) only if x is not free in e2).

In particular we allow lifting a computation out of one arm of an If only when we know that evaluating the computation before/without testing the if condition, cannot raise an exception (that would change program behaviour). For now an extremely conservative check is used, and this contrasts with RLO which allowed lifting any computation, for example RLO would have allowed:

(if p (let (x (segfault now)) x) 0) ===> (let (x (segfault now)) (if p x 0))

This raises some subtle cases, which I've tried to handle via extra rules "lift_let_over_if_both", "lift_let_over_if_true_dominated", and "lift_let_over_if_false_dominated", but this is only partially successful: there are still cases where RLO would have lifted an expression but ksc will not, where the lifting is safe (regardless of the expression) because the expression is guaranteed to execute anyway. Specifically:

(let (x (maybe throw)) (...... (if p (let (y (maybe throw)) body) e_false) .....))

The ksc rewrites here will only allow lifting in the case where the ...... is empty, but this is safe even when ..... is non-empty. As part of One Theorem Prover, some kind of "let-pushing-down" (let-dropping?) rule will be needed here, not in this PR.

simonpj · 2021-07-16T14:40:43Z

There is something very unsatisfactory about imposing extra rules about floating things out of conditionals, as you rightly identify here.

One possible approach is to be explicit about evidence. Instead of

if (x <= n) (...index arr x...) (...blah...)

we might think of

case (x <= n) of
    No -> blah
    Yes (ev :: Proof) -> ....(index ev arr x) ...

If the test on (x <= n) is successful, it yields a "proof object", here ev that witnesses that claim. We pass that proof to index. So now we can't float index ev arr x out of the branch of the conditional, because then ev won't be in scope.

(One can imagine more fancy things, where instead of a uniform Proof type, we have Proof (x<=n) recording what it's a proof of. But that's very much a (fairly sophisticated) refinement.)

I wonder if something along these lines might be a more principled way of dealing with these seg-fault-y things?

awf · 2021-07-19T13:53:47Z

Thanks Simon! @acl33 I'll update the top comment to include some "index" examples.

And then, let's try to work through the "evidence" plan. I wonder if something like this can help?

we don't do bounds checking on index because it might be slow.
so be explicit about that; rewrite (index i a) -> (assert (lt i (size a)) (index$throws i a))
then we have more careful rules about moving asserts around. E.g.
- (assert p e) <-> (if p e (assert False))
- (if p (assert p e) f)-> (if p e f)
and assume most things can commute with if unless marked $throws?

Notice that assert is one of our few control-flow constructs (that's why it's explicitly in the AST, like if).

So then we might see (artist's impression, probably wrong)

(let (n (size arr))
  (if (x <= n) (index arr x) 7)
->
(let (n (size arr))
 (if (x <= n) (assert (x < (size arr)) (index$throws arr x) 7)
->
 (if (x <= (size arr)) (assert (x < (size arr)) (index$throws arr x) 7)
->
 (if (x <= (size arr)) (index$throws  arr x) 7)

awf · 2021-07-19T13:56:22Z

So yes to the specialist rules and OTP; and yes to figuring out how we do evidence etc.

simonpj · 2021-07-19T14:43:16Z

Notice that assert is one of our few control-flow constructs

I would prefer to turn it into a dataflow construct, by making the assert (and some conditionals) produce an evidence value (aka proof object, erased at runtime) that we can pass to index and similar friends.

acl33 · 2021-07-19T17:42:51Z

@simonpj perhaps the two approaches are not so different - suppose the assert produces the evidence which index$throws requires? As you say, that prevents lifting the index$throws out of the assert, yet without ever making the end-user-programmer write this evidence thing.

And, we've now defined that our index is checked (raises an AssertionError, rather than a HardwareIJustLaunchedTheNukeError), too. In other compilers index$throws is called index_unsafe or similar (without such a thorough handling of evidence). The computations that can't be lifted are then a small set including assert and any functions, like index, that have rewrites to an assert+unsafe form. index_unsafe can be lifted; and asserts are dealt with by four extra rules:
(if p (assert p t) f) <==> (if p t f) <==> (if p t (assert (not p) f))

I think both true and false arms of the if need to be given appropriate evidence? e.g.
(let (n (size x)) (if (gt i n) -1 (index i x))) to return a default value of -1 for out-of-range. But we still need to define how the evidence is passed - is the body of assert and each arm of an if now a lambda with an evidence parameter?

Also for
(tuple (index i v) (if p (index i v) f)), i.e. where we don't necessarily know the inner index is safe, merely that the exception would be thrown anyway - do we somehow need to make the inner index use the same evidence as the outer index, rather than its own evidence? (or maybe we can CSE the two evidences themselves?)

simonpj · 2021-07-19T19:48:40Z

suppose the assert produces the evidence which index$throws requires?

Yes, that's the idea. As you say, though, we need to say how the evidence is produced. Something like this, in Haskell

data BoolWithProof = TrueWith Proof | FalseWith Proof

eqInt :: (Int, Int) -> BoolWith Proof
inRange :: (Int,Int,Int) -> BoolWithProof   -- inRange (lo, i, hi) returns (TrueWith p) iff lo <= i < hi
index :: (Int,Proof, Array a) -> a

Now we can say something like

case inRange (lo, i, hi) of
   FalseWith _ -> throw-exception
   TrueWith p -> .....(index (i, p, arr))....

Here, nothing stops us using the wrong Proof with the wrong array; and nothing stops us using the wrong array bounds. But we then can be sure that we won't lift failing operation outside its guard rails.

We don't yet have case-expressions, or indeed sum types, in ksc. But I think we are going to want them.

acl33 · 2021-07-20T11:21:06Z

src/python/ksc/rewrites.py

@@ -320,30 +320,32 @@ def __init__(self, rule: Rule, side_conditions=lambda **substs: True):
    def possible_filter_terms(self):
        return frozenset([get_filter_term(self._rule.template)])

+    # This is an instance method to allow overriding in subclass(es)


@cgravill this is a consequence of merging with your #921. The subclass ParsedLetLifter uses the same matches_for_possible_expr but for each Match, must mutate the substs encoded in the apply_rewrite lambda before passing onto the superclass. So I moved the local method apply out here - any other ideas?

It still seems like a good improvement, even after being hoisted. I'll try to take a look to see if there's a way to keep it more local.

"It still seems like a good improvement" - yes absolutely :). Thanks!

awf · 2021-07-20T18:21:01Z

I have not been consistent over the naming here, and I should be: in some places "lift_bind" is used, and in others "lift_let".

(foo a (let (x e) body)) ===> (let (x e) (foo a body))

For me, it is quite clear that the "let" is lifted. i,e. "lift_let" is a good name.

acl33 · 2021-07-21T13:28:39Z

Naming standardized on Lift_let / lift_if, not lift_bind.

awf · 2021-07-22T08:56:12Z

src/python/ksc/rewrites_ast.py

+                 (build n (lam (i : Integer) (if p t f)))
+                 (if (gt n 0) (if p (build n (lam (i : Integer) t))
+                                    (build n (lam (i : Integer) f)))
+                              (build 0 (lam (i : Integer) (if p t f)))))""",


Done, and comments added inside rule.

I've not added the Tuple Integer Integer variant due to complexities in the comparison gt n 0. This needs either

plumbing in a symtab (from prelude.ks?) with a mul (Tuple Integer Integer) to use in a condition like (gt (mul (get$1$2 n) (get$2$2 n)) 0). I was puzzled how the 1d rule was working without a symtab, but I find gt is defined as primitive:

knossos-ksc/src/python/ksc/prim.py

Lines 126 to 128 in 11155b4

if n == 2 and name in ("eq", "ne", "lt", "gt", "lte", "gte"):

assert tys[0] == tys[1] # TODO: MOVEEQ Move eq to prelude

return Type.Bool

however I don't expect this would be correct for 2d (presumably lexicographic??).

If the rule uses the prelude then does it really belong in rewrites_ast.py rather than rewrites_prelude.py? (Or rewrites_prim.py, as build is primitive rather than AST!)

adding "product" to prim.py (result type presumably either float or int depending on the argument)

some rather grotesque condition like if (if (gt (get$1$2 n) 0) (gt (get$2$2 n) 0) False) .... as a way of "and"ing two bools

acl33 · 2021-07-22T21:22:34Z

We don't yet have case-expressions, or indeed sum types, in ksc. But I think we are going to want them.

Let's move to #972; it seems possible to do this without sum-types, and (according to taste) even without new expressions.

acl33 · 2021-08-25T17:34:18Z

Do we want to abandon this in favour of an "if-with-evidence" approach (some discussion in #972), merge it, or something else?

I think if-with-evidence allows us to lift some functions out of ifs, if we have the evidence. But if we can't do that, the conservative position has to be not to allow lifting out of if, as here?

awf · 2021-08-27T14:09:58Z

I think it's fine to merge the safe rules here, and then add the evidence-based ones when decided.

acl33 added 8 commits June 15, 2021 11:00

WIP all passing except lift_over_build

3a153ea

Merge remote-tracking branch 'origin/master' into acl33/lift_rules3

3d91770

Fix/merge with Match.apply Callable

c193f87

Tweak comment + assert

8de3f5c

can_lift_arg => liftable_arg_type

0de4de2

Fix build tests; test can_evaluate; misc fixes to lifting rules

25da074

Renames, comments

c5045bf

Comment why LiftOverCall

8e73548

acl33 added 3 commits July 17, 2021 12:26

move comments; tweak Const(0.0 => 0)

dc808c4

Add extra rules

74ce745

lift_bind => lift_let, inc. comment

5e38233

acl33 added 6 commits July 19, 2021 21:43

Comment

c775879

Correct assert condition

ab048b2

Comment lift_if_over_build; use n in f_body

168f7c1

Test+fix ParsedLetLifter broken in merge with #921

f66e9da

Add test of lifting from both arms of if

6e29c51

Lifting rules preserve type

1cdb3ab

acl33 changed the title ~~[RFC - may need more rules] Lifting rules (lift let, lift if, if-of-if)~~ [Now with more rules] Lifting rules (lift let, lift if, if-of-if) Jul 20, 2021

acl33 commented Jul 20, 2021

View reviewed changes

acl33 mentioned this pull request Jul 21, 2021

[RFC] New_bind rule, also cse_bind, inline_let #965

Merged

awf reviewed Jul 22, 2021

View reviewed changes

acl33 mentioned this pull request Jul 22, 2021

Use dataflow to control which functions are safe to lift out of "if" #972

Open

acl33 added 2 commits August 31, 2021 13:28

Merge remote-tracking branch 'origin/master' into acl33/lift_rules3

08aafc7

Comments, Const(0.0) => Const(0)

9ef6a73

acl33 merged commit 83a32a1 into master Aug 31, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Now with more rules] Lifting rules (lift let, lift if, if-of-if) #953

[Now with more rules] Lifting rules (lift let, lift if, if-of-if) #953

acl33 commented Jul 15, 2021 •

edited

Loading

simonpj commented Jul 16, 2021

awf commented Jul 19, 2021 •

edited

Loading

awf commented Jul 19, 2021

simonpj commented Jul 19, 2021

acl33 commented Jul 19, 2021 •

edited

Loading

simonpj commented Jul 19, 2021 •

edited

Loading

acl33 Jul 20, 2021

cgravill Jul 20, 2021

acl33 Jul 20, 2021 •

edited

Loading

awf commented Jul 20, 2021 •

edited

Loading

acl33 commented Jul 21, 2021

awf Jul 22, 2021

acl33 Jul 22, 2021 •

edited

Loading

acl33 commented Jul 22, 2021

acl33 commented Aug 25, 2021

awf commented Aug 27, 2021

	if n == 2 and name in ("eq", "ne", "lt", "gt", "lte", "gte"):
	assert tys[0] == tys[1] # TODO: MOVEEQ Move eq to prelude
	return Type.Bool

[Now with more rules] Lifting rules (lift let, lift if, if-of-if) #953

[Now with more rules] Lifting rules (lift let, lift if, if-of-if) #953

Conversation

acl33 commented Jul 15, 2021 • edited Loading

simonpj commented Jul 16, 2021

awf commented Jul 19, 2021 • edited Loading

awf commented Jul 19, 2021

simonpj commented Jul 19, 2021

acl33 commented Jul 19, 2021 • edited Loading

simonpj commented Jul 19, 2021 • edited Loading

acl33 Jul 20, 2021

Choose a reason for hiding this comment

cgravill Jul 20, 2021

Choose a reason for hiding this comment

acl33 Jul 20, 2021 • edited Loading

Choose a reason for hiding this comment

awf commented Jul 20, 2021 • edited Loading

acl33 commented Jul 21, 2021

awf Jul 22, 2021

Choose a reason for hiding this comment

acl33 Jul 22, 2021 • edited Loading

Choose a reason for hiding this comment

acl33 commented Jul 22, 2021

acl33 commented Aug 25, 2021

awf commented Aug 27, 2021

acl33 commented Jul 15, 2021 •

edited

Loading

awf commented Jul 19, 2021 •

edited

Loading

acl33 commented Jul 19, 2021 •

edited

Loading

simonpj commented Jul 19, 2021 •

edited

Loading

acl33 Jul 20, 2021 •

edited

Loading

awf commented Jul 20, 2021 •

edited

Loading

acl33 Jul 22, 2021 •

edited

Loading