Skip to content
forked from hedii/php-crawler

A crawler written in php finding email addresses on the internets

License

Notifications You must be signed in to change notification settings

net1/php-crawler

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

php-crawler

A crawler written in php: find email addresses on the internets. See it in action here (video): https://www.youtube.com/watch?v=rWsb6E_335U

Installation

  1. Put this all files on your server
  2. Create a mysql database
  3. Create database tables using the SQL code below
  4. Open Crawler.php and edit the __construct function with your database connection infos

Database tables creation

Open a SQL terminal, paste this and execute:

CREATE TABLE `emails` (
  `id` int(11) unsigned NOT NULL AUTO_INCREMENT,
  `email` varchar(255) NOT NULL DEFAULT '',
  `date` datetime NOT NULL,
  PRIMARY KEY (`id`)
);

CREATE TABLE `urls` (
  `id` int(11) unsigned NOT NULL AUTO_INCREMENT,
  `url` varchar(1000) NOT NULL DEFAULT '',
  `date` datetime NOT NULL,
  `visited` tinyint(1) NOT NULL DEFAULT '0',
  `email_visited` tinyint(1) NOT NULL DEFAULT '0',
  PRIMARY KEY (`id`)
);

Usage

  1. Navigate to index.php
  2. Enter an url on the form input and click to fire the form. The crawler will scan all url on this page and put them in the database. The crawler will then visite all unvisited url that are in the database, and do the same search for other urls.
  3. Navigate to emails.php. The crawler will now start to search for email addresses in urls that are in the database.
  4. If you want a list of all the emails, just export your database table 'emails', and do whatever you want with it.

About

A crawler written in php finding email addresses on the internets

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • PHP 98.8%
  • Other 1.2%