Better hasEdge for Algrebra.Graph #88

nobrakal · 2018-07-02T16:51:35Z

Hi,

I have played again a bit, but I think I have found a better one this time :)
The boost come from two things:

The first one, I had never checked it, but induce (\x -> elem x [u,v]) was not a great idea. Just replacing it by induce (\x -> x == u || x == v) is clearly a better idea:

Clique

	1000
WithEq	70.80 ms
WithElem	103.4 ms

Then, because the equality test was already done in the "induce", I had the idea to store it and pass it to a modified version from ToGraph:

hasEdge :: Eq a => a -> a -> Graph a -> Bool
hasEdge s t = hasEdge' . induce'
    where
     hasEdge' g = case foldg e v o c g of (_, _, r) -> r
       where
         e                             = (False   , False   , False                 )
         v x                           = (x       , not x   , False                 )
         o (xs, xt, xst) (ys, yt, yst) = (xs || ys, xt || yt,             xst || yst)
         c (xs, xt, xst) (ys, yt, yst) = (xs || ys, xt || yt, xs && yt || xst || yst)
     induce' = foldg Empty
                    (\x -> if x == s then Vertex True else if x == t then Vertex False else Empty)
                    (k Overlay)
                    (k Connect)
       where
         k _ x     Empty = x -- Constant folding to get rid of Empty leaves
         k _ Empty y     = y
         k f x     y     = f x y

It is a bit more complicated, but it drops the Ord instance previously required to an Eq one:

Mesh

	1000
MoreComplicated	112.9 μs
AlgaOld	219.7 μs

Clique

	1000
MoreComplicated	59.75 ms
AlgaOld	90.81 ms

SUMMARY:

Alga was the fastest 12 times

ABSTRACT:
(Based on an average of the ratio between largest benchmarks)

Alga was 1.71 times faster than AlgaOld

And this rocks also with alga's clique:

Alga's Clique

	1000
MoreComplicated	20.09 μs
AlgaOld	51.48 μs

nobrakal · 2018-07-02T16:55:52Z

This is obviously related to #84

nobrakal · 2018-07-02T17:30:44Z

Well it worked, but only when dealing with non-loop :p

The corrected version is passing tests and is a bit slower than the faulty one of course.

Mesh

	1000
Working	126.0 μs
AlgaOld	230.8 μs

Clique

	1000
Working	69.36 ms
AlgaOld	97.86 ms

nobrakal · 2018-07-02T17:47:42Z

Bang patterns wasn't useful and does not worked with old GHCs. Thanks travis !

Mesh

	1000
Alga	109.8 μs
AlgaOld	224.9 μs

Clique

	1000
Alga	60.98 ms
AlgaOld	94.53 ms

snowleopard · 2018-07-03T19:53:10Z

@nobrakal What about doing a simple check at the beginning: if x == y then check for a self-loop, otherwise, you a simplified version of hasEdge for definitely different vertices?

nobrakal · 2018-07-04T02:47:38Z

What about doing a simple check at the beginning: if x == y then check for a self-loop,

I don't know if it worth it: the version with tuples (the last one) is almost the same time than my first faulty version, and is working in any cases. It must be a little worse in term of space use (I am storing two booleans instead of one for matching vertex), but I don't think this is an issue ^^.
As we are offering now a hasSelfLoop, if a user now he is searching for a loop, it can definitely use it.

snowleopard · 2018-07-04T10:20:37Z

@nobrakal I am concerned with the complexity of the current solution: it is hard to understand and it's not immediately obvious that it is correct.

By checking for self-loops separately, we can potentially get two benefits:

Simpler code and intuition that we can explain in a comment: we convert a Graph a to a Graph Bool and then look for an edge from False to True.
Potentially better performance in case of self-loops and also in case of non-self-loops, since you no longer need to carry around the tuples in Vertex. But this needs to be benchmarked.

snowleopard · 2018-07-04T13:47:52Z

hasEdge: 0.26 (good)

@nobrakal Wow, is it 4 times faster now?

nobrakal · 2018-07-04T14:41:07Z

@nobrakal Wow, is it 4 times faster now?

It seems yep ^^. Maybe this is a bit overestimated. I have ran the benchs on my machine with bigger graphs, and it is more about 0.40

Just updated this PR using your suggestion. It worth anyway to test the loop presence more than I thought :)

Here:

Alga is the version where we test for a loop
AlgaOld is the version with a tuple of Booleans

Mesh

	1000
Alga	98.31 μs
AlgaOld	143.2 μs

Clique

	1000
Alga	75.22 ms
AlgaOld	84.08 ms

SUMMARY:

Alga was the fastest 8 times

There was 4 ex-aequo

ABSTRACT:
(Based on an average of the ratio between largest benchmarks)

Alga was 1.26 times faster than AlgaOld

Please note that the benchs were run on another machine than the previous one, so I think trying to compare them with my previous comments does not make many sense.

snowleopard · 2018-07-04T17:11:46Z

@nobrakal I have another (very low-level) optimisation in mind, but I think we can try it separately :)

Could you do the same change to NonEmpty?

Can this be used in ToGraph, which I guess needs hasSelfLoop as well?

nobrakal · 2018-07-04T18:56:28Z

@nobrakal I have another (very low-level) optimisation in mind, but I think we can try it separately :)

Haha, you awaken my curiosity ^^

Could you do the same change to NonEmpty?

I think I have already done it ^^: 9111701

Can this be used in ToGraph, which I guess needs hasSelfLoop as well?

It has hasSelfLoop = hasEdge x x

I think it heavily depends on the removing of empty leaves by induce...

nobrakal · 2018-07-05T18:15:59Z

Oops you were right, I just added the corresponding change for Algebra.Graph.NonEmpty

snowleopard · 2018-07-06T15:30:15Z

@nobrakal Thanks! Does this optimisation make sense for Algebra.Graph.Fold?

snowleopard · 2018-07-17T21:47:45Z

@nobrakal Looks like there are now some conflicts.

…Edge

nobrakal · 2018-07-19T14:49:29Z

Using the still-hot NFData instance of Fold, I found that the optimization makes sense for it :)
The benchs, Alga being the state of this PR and AlgaOld the current state of the master branch

hasEdge

Description: Test if the given edge is in the graph (with arguments both in the graph and not in the graph (where applicable))

Clique

	1000
Alga	70.79 ms
AlgaOld	797.4 ms

Mesh

	1000
Alga	262.8 μs
AlgaOld	906.3 μs

SUMMARY:

Alga was the fastest 12 times

ABSTRACT:
(Based on an average of the ratio between largest benchmarks)

Alga was 5.27 times faster than AlgaOld

hasSelfLoop

Description: Test if the given self-loop is in the graph (with arguments both in the graph and not in the graph (where applicable))

Clique

	1000
Alga	56.15 ms
AlgaOld	780.0 ms

Mesh

	1000
Alga	183.1 μs
AlgaOld	896.9 μs

SUMMARY:

Alga was the fastest 8 times

ABSTRACT:
(Based on an average of the ratio between largest benchmarks)

Alga was 7.24 times faster than AlgaOld

snowleopard · 2018-07-19T16:16:42Z

@nobrakal Great! But what happened with Travis non-empty instance?

https://travis-ci.org/snowleopard/alga/jobs/405842227

nobrakal · 2018-07-19T16:53:12Z

My bad, my script is very sensitive (maybe too much) to any change in then benchmark suite.
It is updated and should work, sorry !

snowleopard · 2018-07-19T20:37:19Z

@nobrakal Why do we no longer have changes to non-empty graphs? Did your commit get lost?

nobrakal · 2018-07-19T20:42:53Z

I totally forget about this PR while doing #94 so I blindly added the optimization to it.

So it is already in place for Algebra.Graph.NonEmpty

snowleopard · 2018-07-19T20:46:47Z

@nobrakal Aha, I see! Thanks, merged :)

RocketPowered

54100e4

nobrakal added 2 commits July 2, 2018 19:13

Working amelioration

0eeebc5

Remove TODO comment

3b791bb

Remove bang

3050195

nobrakal mentioned this pull request Jul 3, 2018

Replacing Algebra.Graph by Maybe (Algebra.Graph.NonEmpty) #90

Open

Add optimization for NonEmptyGraph

0a21b21

Even better hasEdge, testing for loop

9111701

Add optimizations for NonEmptyGraph

922126e

nobrakal mentioned this pull request Jul 6, 2018

Add NFData instances #54

Closed

nobrakal added 3 commits July 18, 2018 10:04

merge master

ca3fc07

Merge branch 'master' of https://github.com/snowleopard/alga into has…

216ea8b

…Edge

Better hasEdge for Fold

c20977c

snowleopard merged commit 22c6d25 into snowleopard:master Jul 19, 2018

nobrakal deleted the hasEdge branch July 25, 2018 12:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better hasEdge for Algrebra.Graph #88

Better hasEdge for Algrebra.Graph #88

nobrakal commented Jul 2, 2018

nobrakal commented Jul 2, 2018

nobrakal commented Jul 2, 2018

nobrakal commented Jul 2, 2018

snowleopard commented Jul 3, 2018

nobrakal commented Jul 4, 2018

snowleopard commented Jul 4, 2018

snowleopard commented Jul 4, 2018

nobrakal commented Jul 4, 2018

snowleopard commented Jul 4, 2018

nobrakal commented Jul 4, 2018 •

edited

Loading

nobrakal commented Jul 5, 2018

snowleopard commented Jul 6, 2018

snowleopard commented Jul 17, 2018

nobrakal commented Jul 19, 2018

snowleopard commented Jul 19, 2018

nobrakal commented Jul 19, 2018

snowleopard commented Jul 19, 2018

nobrakal commented Jul 19, 2018 •

edited

Loading

snowleopard commented Jul 19, 2018

Better hasEdge for Algrebra.Graph #88

Better hasEdge for Algrebra.Graph #88

Conversation

nobrakal commented Jul 2, 2018

Clique

Mesh

Clique

Alga's Clique

nobrakal commented Jul 2, 2018

nobrakal commented Jul 2, 2018

Mesh

Clique

nobrakal commented Jul 2, 2018

Mesh

Clique

snowleopard commented Jul 3, 2018

nobrakal commented Jul 4, 2018

snowleopard commented Jul 4, 2018

snowleopard commented Jul 4, 2018

nobrakal commented Jul 4, 2018

Mesh

Clique

snowleopard commented Jul 4, 2018

nobrakal commented Jul 4, 2018 • edited Loading

nobrakal commented Jul 5, 2018

snowleopard commented Jul 6, 2018

snowleopard commented Jul 17, 2018

nobrakal commented Jul 19, 2018

hasEdge

Clique

Mesh

hasSelfLoop

Clique

Mesh

snowleopard commented Jul 19, 2018

nobrakal commented Jul 19, 2018

snowleopard commented Jul 19, 2018

nobrakal commented Jul 19, 2018 • edited Loading

snowleopard commented Jul 19, 2018

nobrakal commented Jul 4, 2018 •

edited

Loading

nobrakal commented Jul 19, 2018 •

edited

Loading