fix: query_simplified_user_actions and add test cases #15

dtgoitia · 2018-07-23T21:11:33Z

Main change

With this fix, the function query_simplified_user_actions will now return all the simplified actions related to a specific playback (given the playback ID). In this case, simplified means that only the last upvote or downvote action of each user will be returned, together with the skip, if any at all. If there are no actions, an empty list will returned.

Other changes

Replace tabs for spaces
Add a comment to a function. Please @txomon revise.

Close #2

txomon · 2018-08-01T20:33:29Z

Some comments to this PR:

tests are failing because flake8 is complaining
There are changes you have done not related to tabs to spaces (python will fail if you mix them), it's some kind of linter misconfiguration. I suggest you look at the autoformatter you are using, as flake8 wasn't complaining before and is not now, therefore we will run into this issue in the future.
The test case is too complex to follow now. I suggest instead of using number of loops, you parametrize with all the entries that should be inserted in useractions table
The query seems overly complicated, so we will refactor it once the rest of the points are cleared.

coveralls · 2018-08-04T23:25:02Z

Pull Request Test Coverage Report for Build 58

1 of 1 (100.0%) changed or added relevant line in 1 file are covered.
No unchanged relevant lines lost coverage.
Overall coverage remained the same at 100.0%

Totals
Change from base Build 57:	0.0%
Covered Lines:	508
Relevant Lines:	508

💛 - Coveralls

txomon

Some changes TBD

txomon · 2018-08-09T20:07:55Z

tests/test_query.py

+    ([
+        {'user': 1, 'action': 'upvote'},
+    ], {
+        0: Action.upvote.name,


Don't use a dict, and don't use .name, it should be an object!

txomon · 2018-08-09T20:10:46Z

tests/test_query.py

-                {'dtid': '1234', 'username': 'username'},
-                {'id': 1, 'dtid': '1234', 'username': 'username', 'country': None},
-        ),
+    (


Formatting shouldn't be changed here

txomon · 2018-08-09T20:11:46Z

tests/test_query.py

+
+    for action in user_action:
+        if action['user'] is None:
+            user = await user_generator()


Just create this statically, like 3 users and that's it

If we create the users statically, we still need to somehow map the action['user'] value to the corresponding user (user1, user2...). Is it not too verbose?

user1 = await user_generator() user2 = await user_generator() user3 = await user_generator() user4 = await user_generator() if action['user'] == 1: user = user1 elif action['user'] == 2: user = user2 elif action['user'] == 3: user = uset3 elif action['user'] == 4: user = user4

Check that user_action_generator just needs a dict with an 'id' key, therefore you just need to call it like

await user_action_generator(user={'id': action['user_id']}, ...)

I was actually looking for that, but I got confused with the function signature (def user_action_generator(db_conn):). I missed that all the arguments extra arguments are passed down to the generate_user_action function via *... #$@&%*!

txomon · 2018-08-09T21:42:32Z

mosbot/query.py

    async with ensure_connection(conn) as conn:
        result = []
        async for user_action in await conn.execute(query):
            result.append(dict(user_action))
-        return result
+        return result


No newline at the end of the file...

txomon

I have some doubts, but overall looks good

txomon · 2018-08-22T21:56:47Z

tests/test_query.py

+        await user_action_generator(user={'id': action['user_id']}, playback=playback, action=action['action'])
+
+    simplified_user_action = await query_simplified_user_actions(playback_id=playback['id'], conn=db_conn)
+    actual_result = {i: user_action['action'] for i, user_action in enumerate(simplified_user_action)}


I don't understand why compare a dict instead of the list of the result direclty. Why are you transforming the result to check it?

txomon · 2018-08-22T22:02:44Z

mosbot/query.py

+                        left join user_action ua on p.id = ua.playback_id
+                        left join track t on p.track_id = t.id
+                        left join "user" u on ua.user_id = u.id
+                        where p.id = {playback_id}


Have you checked a safer way to execute this query? you are literally rendering the parameter directly, instead of escaping it or using it as a parameter. I don't know if aiopg the library we are using can do it but please have a look

https://chartio.com/resources/tutorials/how-to-execute-raw-sql-in-sqlalchemy/

txomon · 2019-04-10T12:14:54Z

So going back through this, I have found that the performance is a bit terrible, using explain analyze:

 Sort  (cost=93032.76..93038.25 rows=2198 width=85) (actual time=2020.756..2086.115 rows=226029 loops=1)
   Sort Key: sub3.action_timestamp
   Sort Method: external merge  Disk: 17640kB
   ->  Subquery Scan on sub3  (cost=85749.53..92910.75 rows=2198 width=85) (actual time=1536.322..1882.256 rows=226029 loops=1)
         Filter: ((sub3.rn = 1) OR (sub3.next_action = 'skip'::action))
         Rows Removed by Filter: 14088
         ->  WindowAgg  (cost=85749.53..89605.57 rows=220345 width=85) (actual time=1536.321..1782.844 rows=240117 loops=1)
               ->  Sort  (cost=85749.53..86300.40 rows=220345 width=81) (actual time=1536.315..1605.015 rows=240117 loops=1)
                     Sort Key: sub.username
                     Sort Method: external sort  Disk: 18600kB
                     ->  Subquery Scan on sub  (cost=47937.00..55649.07 rows=220345 width=81) (actual time=962.714..1387.867 rows=240117 loops=1)
                           ->  WindowAgg  (cost=47937.00..53445.62 rows=220345 width=97) (actual time=962.713..1286.604 rows=240117 loops=1)
                                 ->  Sort  (cost=47937.00..48487.86 rows=220345 width=81) (actual time=962.706..1077.643 rows=240117 loops=1)
                                       Sort Key: ua.user_id, p.id, u.username, ua.ts DESC
                                       Sort Method: external merge  Disk: 17080kB
                                       ->  Hash Left Join  (cost=10107.73..17836.53 rows=220345 width=81) (actual time=179.670..692.516 rows=240117 loops=1)
                                             Hash Cond: (ua.user_id = u.id)
                                             ->  Hash Left Join  (cost=10098.62..17246.05 rows=220345 width=72) (actual time=179.446..575.505 rows=240117 loops=1)
                                                   Hash Cond: (p.track_id = t.id)
                                                   ->  Hash Right Join  (cost=7499.76..10969.74 rows=220345 width=24) (actual time=139.819..307.686 rows=240117 loops=1)
                                                         Hash Cond: (ua.playback_id = p.id)
                                                         ->  Seq Scan on user_action ua  (cost=0.00..1527.04 rows=75404 width=20) (actual time=0.009..25.753 rows=75446 loops=1)
                                                         ->  Hash  (cost=3884.45..3884.45 rows=220345 width=8) (actual time=139.252..139.252 rows=220465 loops=1)
                                                               Buckets: 131072  Batches: 4  Memory Usage: 3192kB
                                                               ->  Seq Scan on playback p  (cost=0.00..3884.45 rows=220345 width=8) (actual time=0.008..61.668 rows=220465 loops=1)
                                                   ->  Hash  (cost=1425.05..1425.05 rows=52705 width=56) (actual time=39.358..39.358 rows=52928 loops=1)
                                                         Buckets: 65536  Batches: 2  Memory Usage: 2832kB
                                                         ->  Seq Scan on track t  (cost=0.00..1425.05 rows=52705 width=56) (actual time=0.009..16.515 rows=52928 loops=1)
                                             ->  Hash  (cost=6.27..6.27 rows=227 width=13) (actual time=0.195..0.196 rows=258 loops=1)
                                                   Buckets: 1024  Batches: 1  Memory Usage: 20kB
                                                   ->  Seq Scan on "user" u  (cost=0.00..6.27 rows=227 width=13) (actual time=0.008..0.108 rows=258 loops=1)
 Planning time: 1.078 ms
 Execution time: 2133.643 ms
(33 rows)

txomon · 2019-04-10T12:17:22Z

check out https://thoughtbot.com/blog/reading-an-explain-analyze-query-plan

dtgoitia force-pushed the test branch 2 times, most recently from c9aafc8 to 19d843d Compare August 4, 2018 23:19

dtgoitia force-pushed the test branch from 90a7719 to 2d00f22 Compare August 9, 2018 18:43

txomon requested changes Aug 9, 2018

View reviewed changes

dtgoitia force-pushed the test branch 3 times, most recently from 69f6a14 to 89114e4 Compare August 9, 2018 20:46

txomon reviewed Aug 9, 2018

View reviewed changes

dtgoitia force-pushed the test branch 2 times, most recently from 155946f to 14364e5 Compare August 11, 2018 13:13

txomon reviewed Aug 22, 2018

View reviewed changes

txomon force-pushed the test branch 2 times, most recently from f79f1b9 to 17545ee Compare August 22, 2018 22:37

fix: query_simplified_user_actions and add test cases

18b4c2c

dtgoitia force-pushed the test branch from 17545ee to 18b4c2c Compare March 13, 2019 21:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: query_simplified_user_actions and add test cases #15

fix: query_simplified_user_actions and add test cases #15

dtgoitia commented Jul 23, 2018

txomon commented Aug 1, 2018

coveralls commented Aug 4, 2018 •

edited

Loading

txomon left a comment

txomon Aug 9, 2018

txomon Aug 9, 2018

txomon Aug 9, 2018

dtgoitia Aug 9, 2018

txomon Aug 9, 2018

dtgoitia Aug 11, 2018

txomon Aug 9, 2018

txomon left a comment

txomon Aug 22, 2018

txomon Aug 22, 2018

txomon commented Apr 10, 2019

txomon commented Apr 10, 2019

fix: query_simplified_user_actions and add test cases #15

Are you sure you want to change the base?

fix: query_simplified_user_actions and add test cases #15

Conversation

dtgoitia commented Jul 23, 2018

Main change

Other changes

txomon commented Aug 1, 2018

coveralls commented Aug 4, 2018 • edited Loading

Pull Request Test Coverage Report for Build 58

💛 - Coveralls

txomon left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

txomon left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

txomon commented Apr 10, 2019

txomon commented Apr 10, 2019

coveralls commented Aug 4, 2018 •

edited

Loading