from scrapyjs import SplashRequest #47

podolskyi · 2016-04-01T00:49:32Z

When I try import SplashRequest error occurred:
from scrapyjs import SplashRequest
Traceback (most recent call last):
File "", line 1, in
ImportError: cannot import name SplashRequest

but
import scrapyjs
works.

Metadata-Version: 1.1
Name: scrapyjs
Version: 0.2
Summary: JavaScript support for Scrapy using Splash
Home-page: https://github.com/scrapy-plugins/scrapy-splash
Author: Mikhail Korobov
Author-email: [email protected]
License: BSD
Location: /usr/local/lib/python2.7/dist-packages

kmike · 2016-04-01T02:14:25Z

Hey @podolskyi,

There are 2 reasons:

If you're using scrapyjs from pypi (https://pypi.python.org/pypi/scrapyjs) then you're using v0.2, and SplashRequest is not documented and not exposed there. Github README is for master branch.
In master branch SplashRequest is not exposed as scrapyjs.SplashRequest; it is a bug which is fixed in Custom Splash responses #45 among other changes.

Sorry for inconvenience! There is a lot of changes coming to scrapy-splash, it is a bit in flux now. https://pypi.python.org/pypi/scrapyjs/0.2 is a stable release.

podolskyi · 2016-04-01T02:33:54Z

@kmike thanks for the quick response. I test both versions pypi and from master branch.

Can you help me, simple question.

How I can use crawlera with scrapyjs.

podolskyi · 2016-04-01T02:42:16Z

@kmike I test example from pypi:

import json
import base64

class MySpider(scrapy.Spider):

    # ...
        script = """
        function main(splash)
            assert(splash:go(splash.args.url))
            return splash:evaljs("document.title")
        end
        """
        yield scrapy.Request(url, self.parse_result, meta={
            'splash': {
                'args': {'lua_source': script},
                'endpoint': 'execute',
            }
        })

    # ...
    def parse_response(self, response):
        doc_title = response.body_as_unicode()
        # ...

It works, thanks.

kmike · 2016-04-01T02:44:14Z

@podolskyi

A basic way to use Crawlera is to use proxy argument; add it to 'args'. But this solution has some issues because Crawlera is not aware you're sending multiple requests to render a single page. A better way is to follow the example here: http://doc.scrapinghub.com/crawlera.html#using-crawlera-with-splash - there is some boilerplate which you can copy-paste to your Lua script.
~~Check the example at https://pypi.python.org/pypi/scrapyjs ("Run a simple Splash Lua Script: ...")~~ - you already figured that out!

kmike · 2016-04-11T16:51:50Z

scrapy-splash 0.3 is released; now README should match the package on pypi, so I'm closing this ticket.

kmike closed this as completed Apr 11, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

from scrapyjs import SplashRequest #47

from scrapyjs import SplashRequest #47

podolskyi commented Apr 1, 2016

kmike commented Apr 1, 2016

podolskyi commented Apr 1, 2016

podolskyi commented Apr 1, 2016

kmike commented Apr 1, 2016

kmike commented Apr 11, 2016

from scrapyjs import SplashRequest #47

from scrapyjs import SplashRequest #47

Comments

podolskyi commented Apr 1, 2016

kmike commented Apr 1, 2016

podolskyi commented Apr 1, 2016

podolskyi commented Apr 1, 2016

kmike commented Apr 1, 2016

kmike commented Apr 11, 2016