Make meet in AddressDomain more precise #1468

michael-schwarz · 2024-05-15T13:57:37Z

Currently, in the address domain inside the buckets, for the meets we use ProjectiveSetPairwiseMeet

Lines 190 to 205 in bffc5e3

    
           let meet m1 m2 = 
        
             let meet_buckets b1 b2 acc = 
        
               B.fold (fun e1 acc -> 
        
                   B.fold (fun e2 acc -> 
        
                       if B.may_be_equal e1 e2 then 
        
                         add e1 (add e2 acc) 
        
                       else 
        
                         acc 
        
                     ) b2 acc 
        
                 ) b1 acc 
        
             in 
        
             fold_buckets (fun _ b1 acc -> 
        
                 fold_buckets (fun _ b2 acc -> 
        
                     meet_buckets b1 b2 acc 
        
                   ) m2 acc 
        
               ) m1 (empty ())

where B.may_be_equal delegates to

analyzer/src/cdomain/value/cdomains/addressDomain.ml

Line 180 in bffc5e3

let may_be_equal a b = Option.value (Addr.semantic_equal a b) ~default:true

Addr.semantic_equal given by

analyzer/src/cdomain/value/cdomains/addressDomain.ml

Lines 102 to 111 in bffc5e3

    
           let semantic_equal x y = match x, y with 
        
             | Addr x, Addr y -> Mval.semantic_equal x y 
        
             | StrPtr s1, StrPtr s2 -> SD.semantic_equal s1 s2 
        
             | NullPtr, NullPtr -> Some true 
        
             | UnknownPtr, UnknownPtr 
        
             | UnknownPtr, Addr _ 
        
             | Addr _, UnknownPtr 
        
             | UnknownPtr, StrPtr _ 
        
             | StrPtr _, UnknownPtr -> None 
        
             | _, _ -> Some false

which calls SD.sematic_equal

analyzer/src/cdomain/value/cdomains/stringDomain.ml

Lines 77 to 81 in bffc5e3

    
           let semantic_equal x y = 
        
             match x, y with 
        
             | None, _ 
        
             | _, None -> Some true 
        
             | Some a, Some b -> if a = b then None else Some false

this is needed to handle all sorts of different offsets that may be semantically equal, i.e., evaluate to the same physical address. In this case it keeps both elements.

After some discussion with @jerhard and @hseidl, we reached the following insights:

This behavior is overly conservative, as the mess of the address lattice contains some sub-lattices where the meet operation is semantically correct.

This now checks via a new predicate amenable_to_meet whether for the two addresses a and b this relationship between values and the meet holds. If this is the case, a meet is performed and the element added to the resulting set only if it is distinct from bot.

Currently, we answer amenable_to_meetby true whenever both arguments are strings, or when the both are addresses that differ in the numeric(!) offsets only.

This PR also adds two tests where this yields additional precision.

Closes #1467

… offsets

src/cdomain/value/cdomains/addressDomain_intf.ml

src/domain/disjointDomain.ml

sim642 · 2024-05-20T11:10:54Z

src/cdomain/value/cdomains/addressDomain.ml

@@ -110,6 +110,11 @@ struct
    | StrPtr _, UnknownPtr -> None
    | _, _ -> Some false

+  let amenable_to_meet x y = match x,y with
+    | StrPtr _, StrPtr _ -> true
+    | Addr x, Addr y when Mval.equal (Mval.top_indices x) (Mval.top_indices y) -> true


Don't the address set buckets already guarantee this to only be called in such cases because Mval.top_indices are the representatives?

Even if the buckets guarantee this already, this function can be called with two arbitrary addresses, so we can not just return true.

That's true, although it could just be an implementation detail (like the buckets themselves are), which isn't exposed to the outside by the interface. Then such misuse would be impossible.

I followed this assumption now to just assume that things placed into the same bucket are amenable to meet, which seems to be the case in our implementation and I think also makes sense semantically.

This largely simplifies the implementation and makes all sorts of extra considerations obsolete.

tests/regression/27-inv_invariants/22-meet-ptrs.c

michael-schwarz · 2024-12-15T15:40:00Z

Sorry for being MIA, I will take this up and hopefully get it mergeable before the holidays.

michael-schwarz added 2 commits May 15, 2024 15:49

Typo

7186571

Add amenable_to_meet and test for it

c1b7284

michael-schwarz added bug precision labels May 15, 2024

michael-schwarz changed the title ~~Make meet in AddressDomain more Precise~~ Make meet in AddressDomain more precise May 15, 2024

Make comparison of pointers amenable to meet if they only differ in…

3eff22f

… offsets

michael-schwarz requested a review from sim642 May 15, 2024 14:17

michael-schwarz marked this pull request as ready for review May 15, 2024 14:17

michael-schwarz mentioned this pull request May 15, 2024

Tracking Benchmark Changes for Thesis #1417

Draft

sim642 reviewed May 20, 2024

View reviewed changes

michael-schwarz added 2 commits May 20, 2024 15:06

Restore linebreak for odoc

dc2a9c3

Add more intricate example (with TODO for refinement of both sides)

b7265e7

michael-schwarz self-assigned this Dec 15, 2024

michael-schwarz added 2 commits December 16, 2024 16:21

Merge branch 'master' into issue_1467

3fcb562

For elements in the same bucket, perform meet

8af2e49

michael-schwarz requested a review from sim642 December 16, 2024 16:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make meet in AddressDomain more precise #1468

Make meet in AddressDomain more precise #1468

michael-schwarz commented May 15, 2024 •

edited

Loading

sim642 May 20, 2024

michael-schwarz May 20, 2024

sim642 May 21, 2024

michael-schwarz Dec 16, 2024

michael-schwarz commented Dec 15, 2024

	let meet m1 m2 =
	let meet_buckets b1 b2 acc =
	B.fold (fun e1 acc ->
	B.fold (fun e2 acc ->
	if B.may_be_equal e1 e2 then
	add e1 (add e2 acc)
	else
	acc
	) b2 acc
	) b1 acc
	in
	fold_buckets (fun _ b1 acc ->
	fold_buckets (fun _ b2 acc ->
	meet_buckets b1 b2 acc
	) m2 acc
	) m1 (empty ())

	let semantic_equal x y = match x, y with
	\| Addr x, Addr y -> Mval.semantic_equal x y
	\| StrPtr s1, StrPtr s2 -> SD.semantic_equal s1 s2
	\| NullPtr, NullPtr -> Some true
	\| UnknownPtr, UnknownPtr
	\| UnknownPtr, Addr _
	\| Addr _, UnknownPtr
	\| UnknownPtr, StrPtr _
	\| StrPtr _, UnknownPtr -> None
	\| _, _ -> Some false

	let semantic_equal x y =
	match x, y with
	\| None, _
	\| _, None -> Some true
	\| Some a, Some b -> if a = b then None else Some false

Make meet in AddressDomain more precise #1468

Are you sure you want to change the base?

Make meet in AddressDomain more precise #1468

Conversation

michael-schwarz commented May 15, 2024 • edited Loading

sim642 May 20, 2024

Choose a reason for hiding this comment

michael-schwarz May 20, 2024

Choose a reason for hiding this comment

sim642 May 21, 2024

Choose a reason for hiding this comment

michael-schwarz Dec 16, 2024

Choose a reason for hiding this comment

michael-schwarz commented Dec 15, 2024

michael-schwarz commented May 15, 2024 •

edited

Loading