Skip to main content

Accurately separate the TLD from the registered domain and subdomains of a URL, using the Public Suffix List. By default, this includes the public ICANN TLDs and their exceptions. You can optionally support the Public Suffix List's private domains as well.

Project description

tldextract accurately separates the gTLD or ccTLD (generic or country code top-level domain) from the registered domain and subdomains of a URL.

>>> import tldextract
>>> tldextract.extract('http://forums.news.cnn.com/')
ExtractResult(subdomain='forums.news', domain='cnn', suffix='com')
>>> tldextract.extract('http://forums.bbc.co.uk/') # United Kingdom
ExtractResult(subdomain='forums', domain='bbc', suffix='co.uk')
>>> tldextract.extract('http://www.worldbank.org.kg/') # Kyrgyzstan
ExtractResult(subdomain='www', domain='worldbank', suffix='org.kg')

ExtractResult is a namedtuple, so it's simple to access the parts you want.

>>> ext = tldextract.extract('http://forums.bbc.co.uk')
>>> (ext.subdomain, ext.domain, ext.suffix)
('forums', 'bbc', 'co.uk')
>>> # rejoin subdomain and domain
>>> '.'.join(ext[:2])
'forums.bbc'
>>> # a common alias
>>> ext.registered_domain
'bbc.co.uk'

By default, this package supports the public ICANN TLDs and their exceptions. You can optionally support the Public Suffix List's private domains as well.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tldextract-3.1.0.tar.gz (101.9 kB view details)

Uploaded Source

Built Distribution

tldextract-3.1.0-py2.py3-none-any.whl (87.1 kB view details)

Uploaded Python 2Python 3

File details

Details for the file tldextract-3.1.0.tar.gz.

File metadata

  • Download URL: tldextract-3.1.0.tar.gz
  • Upload date:
  • Size: 101.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for tldextract-3.1.0.tar.gz
Algorithm Hash digest
SHA256 cfae9bc8bda37c3e8c7c8639711ad20e95dc85b207a256b60b0b23d7ff5540ea
MD5 7701259eabe8d80d98e031b254bf7733
BLAKE2b-256 429c0340ebfabfbbb3a2868eaa1039479688ca52e1d7f433df4fae638941377f

See more details on using hashes here.

File details

Details for the file tldextract-3.1.0-py2.py3-none-any.whl.

File metadata

  • Download URL: tldextract-3.1.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 87.1 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for tldextract-3.1.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 e57f22b6d00a28c21673d2048112f1bdcb6a14d4711568305f6bb96cf5bb53a1
MD5 ed3c16af283035c2423f83440e444cf5
BLAKE2b-256 7e62b6acd3129c5615b9860e670df07fd55b76175b63e6b7f68282c7cad38e9e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page