Allow spider attr #15

pawelmhm · 2015-04-10T11:20:35Z

This is about #14 and
#11

opening this for discussion, needs tests and improvements, would be cool to get some feedback if it's in good direction

chekunkov · 2015-04-10T11:36:23Z

scrapyjs/middleware.py


        slot_policy = splash_options.get('slot_policy', self.slot_policy)
        self._set_download_slot(request, meta, slot_policy)

        args = splash_options.setdefault('args', {})
-        args.setdefault('url', request.url)
+        args['url'] = request.url


@kmike are you okay with this change. What was usecase of .setdefault() here? Does it have any sense to create request with one url and specify another url in meta?

Maybe I was thinking of allowing user not to use 'url' argument at all. This argument is not required for Splash scripts - e.g. you can render a chunk of HTML using splash:set_content. But the meta syntax doesn't make much sense in this case; something in lines of #12 could be a better fit.

* removed process_html (render response in html feature) from this PR * removed remote request tests with extra spider

* refactors tests from functions to objects inheriting from unittest.TestCase * adds tests for enabling middleware with spider attribute

pawelmhm · 2015-05-26T07:16:12Z

updated after discussions

chekunkov · 2015-06-09T09:09:25Z

@pawelmhm tests are failing

chekunkov · 2015-06-09T09:12:31Z

scrapyjs/middleware.py

@@ -6,6 +6,7 @@
 from scrapy.exceptions import NotConfigured

 from scrapy import log
+from scrapy.http.response.html import HtmlResponse


looks like unused import

good point, this is fixed now

This reverts commit bed8998. T#

pawelmhm · 2015-06-19T12:45:07Z

TODO

docs

something else?

Gallaecio · 2019-07-11T11:31:39Z

I know it has been a rather long time, but it looks like this is still a valid pull request, and a nice once at that. It seems to me like it’s only missing renaming dont_proxy to something else and covering the changes in the documentation.

@pawelmhm Do you have plans to resume work on it at some point?

Gallaecio · 2019-08-12T15:18:23Z

Continued at #235

pawelmhm added 3 commits April 3, 2015 13:46

[middleware] allow enabling splash per spider

39740cb

[middleware] return HtmlResponse to spider

9efd145

[scrashtest] add another test spider

652fd6e

chekunkov mentioned this pull request Apr 10, 2015

Add an option to send requests to Splash by default #11

Open

chekunkov reviewed Apr 10, 2015
View reviewed changes

pawelmhm added 2 commits May 26, 2015 09:10

[spider attr] removed html_response

2e7407d

* removed process_html (render response in html feature) from this PR * removed remote request tests with extra spider

[tests] refactors tests, adds tests for spider attr

bed8998

* refactors tests from functions to objects inheriting from unittest.TestCase * adds tests for enabling middleware with spider attribute

chekunkov reviewed Jun 9, 2015
View reviewed changes

pawelmhm added 2 commits June 19, 2015 14:27

Revert "[tests] refactors tests, adds tests for spider attr"

a9b5323

This reverts commit bed8998. T#

[scrapy-plugins#15/spider_attribute] adds proper tests

586ab58

Gallaecio pushed a commit to Gallaecio/scrapy-splash that referenced this pull request Aug 12, 2019

[scrapy-plugins#15/spider_attribute] adds proper tests

6a82190

Gallaecio mentioned this pull request Aug 12, 2019

Allow spider attr #235

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow spider attr #15

Allow spider attr #15

pawelmhm commented Apr 10, 2015

chekunkov Apr 10, 2015

kmike Apr 10, 2015

pawelmhm commented May 26, 2015

chekunkov commented Jun 9, 2015

chekunkov Jun 9, 2015

pawelmhm Jun 19, 2015

pawelmhm commented Jun 19, 2015

Gallaecio commented Jul 11, 2019

Gallaecio commented Aug 12, 2019

Allow spider attr #15

Are you sure you want to change the base?

Allow spider attr #15

Conversation

pawelmhm commented Apr 10, 2015

chekunkov Apr 10, 2015

Choose a reason for hiding this comment

kmike Apr 10, 2015

Choose a reason for hiding this comment

pawelmhm commented May 26, 2015

chekunkov commented Jun 9, 2015

chekunkov Jun 9, 2015

Choose a reason for hiding this comment

pawelmhm Jun 19, 2015

Choose a reason for hiding this comment

pawelmhm commented Jun 19, 2015

TODO

Gallaecio commented Jul 11, 2019

Gallaecio commented Aug 12, 2019