Skip to main content

Tranco: A Research-Oriented Top Sites Ranking Hardened Against Manipulation

Project description

Tranco

This package allows easy access to the Tranco list, published at https://tranco-list.eu.

Usage

Create a Tranco object, indicating where you want to cache downloaded lists (caching is required):

from tranco import Tranco
t = Tranco(cache=True, cache_dir='.tranco')

You can then retrieve lists from this object using the list method:

latest_list = t.list()
date_list = t.list(date='2019-02-25')

The list method accepts the following parameters:

  • date: the date of the list you want to retrieve (in the format YYYY-MM-DD). If not given, the latest daily list is returned
  • list_id: the ID of the list you want to retrieve. If neither list ID nor date are given, the latest daily list is returned. If both are given, you will get an exception
  • subdomains: whether to include subdomains; only relevant when requesting a daily list. Default: False
  • full: whether to retrieve the full list (else only the top million). Default: False

This method returns a TrancoList, which allows you to retrieve a certain prefix of the list (top), the list ID (list_id), the list page (list_page) or the rank of a domain (rank):

latest_list.top(10000)
latest_list.list_id
latest_list.list_page
latest_list.rank("google.com")
latest_list.rank("not.in.ranking") # returns -1

You can also generate custom lists. First, create a Tranco object with valid credentials (available from your account page):

from tranco import Tranco
t = Tranco(account_email="abc@xyz.eu", api_key="123ABC")

Then, pass the configuration (according to this schema) of your custom list to configure:

c = t.configure(
    {
        'providers': ['alexa', 'umbrella', 'majestic'],
        'startDate': '2021-01-01',
        'endDate': '2021-01-30',
        'combinationMethod': 'dowdall',
        'listPrefix': 'full',
        'filterPLD': 'on',
    }
)

This method returns a tuple: whether the list is already available or is still being generated, and the ID that has been/will be assigned to the list.

You can retrieve metadata for a list through list_metadata:

m = t.list_metadata(list_id="6P7X")

If a list is still being generated, you can use this method to track the progress; once a list has been generated, this metadata will indicate how the list was configured.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tranco-0.8.1.tar.gz (8.0 kB view details)

Uploaded Source

Built Distribution

tranco-0.8.1-py3-none-any.whl (8.3 kB view details)

Uploaded Python 3

File details

Details for the file tranco-0.8.1.tar.gz.

File metadata

  • Download URL: tranco-0.8.1.tar.gz
  • Upload date:
  • Size: 8.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.9.6

File hashes

Hashes for tranco-0.8.1.tar.gz
Algorithm Hash digest
SHA256 b44ce840e90d0d7d7ad3b9575403e8cd32ddb93496c86379612edfdb4d4d6378
MD5 e8cdda9f17ae259a3825bdbe3a5612b6
BLAKE2b-256 b4414e804e75f94d63ff5d214fdfe2aabeb14edf93ac03c69f3fc9b6c9349570

See more details on using hashes here.

File details

Details for the file tranco-0.8.1-py3-none-any.whl.

File metadata

  • Download URL: tranco-0.8.1-py3-none-any.whl
  • Upload date:
  • Size: 8.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.9.6

File hashes

Hashes for tranco-0.8.1-py3-none-any.whl
Algorithm Hash digest
SHA256 69fda76fd46793a0dbbd96afe44d4ea7c6fd88077cc0ec74148b920571e0774a
MD5 a9b70bfc078906bb6f42932560cb28ec
BLAKE2b-256 870dac873e926498157a1454dd9b6998c5a265bc789301c74ebcd632c4628c95

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page