Skip to content
This repository has been archived by the owner on Jun 27, 2020. It is now read-only.
/ google-scraper Public archive

A Google Search Results Page Scraper written in Ruby

License

Notifications You must be signed in to change notification settings

gushonorato/google-scraper

Repository files navigation

Google Scraper

Google Scraper returns all Google Search results in a object oriented fashion. It wraps the boilerplate logic by parsing HTML from Google Search Results Page. All results are obtained directly from Google's SERP and not from deprecated Google Web Search API.

Installing

Add the line in your gemfile. Version 0.0.1 is not released yet, you have to add a reference to this Github until the release.

gem 'google-scraper', github: 'ghonorato/google-scraper'

Because Google Scraper uses Capybara and Poltergeist for scraping, you must have PhantonJS installed on your system.

Simple example

#!/usr/bin/env ruby

require 'google-scraper'

engine = GoogleScraper::Engine.new
results = engine.query(ARGV[0])

results.each do |r| 
  puts "#{r.rank} (Position #{r.page_position} for page #{r.page_number})"
  puts r.title
  puts "-> #{r.url}"
  puts r.friendly_url
  puts r.meta_description
  puts ""
end

Future

  • Add tests
  • Improve this README

About

A Google Search Results Page Scraper written in Ruby

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages