URL Extract

This module extracts tld, domain, subdomains and query from URLs. It also validates the URLs.

Documentation https://url-extract.readthedocs.io/en/latest/

Installation

pip install url_extract

Usage

>>> from url_extract import UrlExtract
>>> extract = UrlExtract()
Downloading list...
>>> extracted = extract.extract('http://dir.bg')
>>> extracted.getDomain()
'dir'
>>> extracted.getTld()
'bg'
>>> extracted.valid()
>>> True
>>> extracted = extract.extract('https://sireninfo.com')
>>> extracted.getDomain()
'sireninfo'
>>> extracted = extract.extract('http://police.uk')
>>> extracted.valid()
False

Documentation

####class UrlExtract (datFileMaxAge=86400*31, datFileSaveDir=None, alwaysPuny=None)####

datFileMaxAge specifies the max age of the public suffix list
datFileSaveDir specifies where will the public suffix list (tlds.dat) will be downloaded
alwaysPuny if set to True unicoded domains after extract will be punyencoded
extract(url) - Extracts the url and returns Result() object

####class Result ()####

getDomain() - Returns domain name without subdomains and tld.
getTld() - Returns the tld of the domain
valid() - Validates domain and returns True or False
getFoundSubdomains() - Returns the extracted subdomains as list
getHostname() - Returns the hostname of the URL
getUrlQuery() - Returns the query after the first / in the url

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
docs		docs
url_extract		url_extract
.gitignore		.gitignore
LICENCE.txt		LICENCE.txt
MANIFEST		MANIFEST
README.md		README.md
setup.cfg		setup.cfg
setup.py		setup.py
test_speed.py		test_speed.py
tests.py		tests.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

URL Extract

Installation

Usage

Documentation

About

Releases

Packages

Languages

License

nacholibre/url_extract

Folders and files

Latest commit

History

Repository files navigation

URL Extract

Installation

Usage

Documentation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages