You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'd like to be able to include the status of whether the response to the URL being scraped used SSL or not. The challenge is that inside the parse method of the SplashResponse the response.certificates attribute is populated by the SSL details of the Splash response, rather than from the original scraped URL response.
My understanding is that the magic_response=True param causes body, url and http_method attributes of the response object to be set to the values from the scraped URL response.
Is there currently a way to access the certificates attribute from the scraped URL response? Or would this need to be an extension of the magic_response functionality?
The text was updated successfully, but these errors were encountered:
Is there currently a way to access the certificates attribute from the scraped URL response? Or would this need to be an extension of the magic_response functionality?
@ned2 I think the first step would be to make sure that information you need is available in the splash response - either you can fish it from har (see https://splash.readthedocs.io/en/stable/api.html#render-json har option) or you'll need to write a custom lua script (see https://github.com/scrapy-plugins/scrapy-splash#examples and splash docs) and get this information from splash. As I understand, the information returned from splash would be available in response.data even if magic response is used.
I'd like to be able to include the status of whether the response to the URL being scraped used SSL or not. The challenge is that inside the
parse
method of theSplashResponse
theresponse.certificates
attribute is populated by the SSL details of the Splash response, rather than from the original scraped URL response.My understanding is that the
magic_response=True
param causesbody
,url
andhttp_method
attributes of the response object to be set to the values from the scraped URL response.Is there currently a way to access the
certificates
attribute from the scraped URL response? Or would this need to be an extension of themagic_response
functionality?The text was updated successfully, but these errors were encountered: