Tranco: A Research-Oriented Top Sites Ranking Hardened Against Manipulation
Project description
Tranco
This package allows easy access to the Tranco list, published at https://tranco-list.eu.
Usage
Create a Tranco object, indicating where you want to cache downloaded lists (caching is required):
from tranco import Tranco
t = Tranco(cache=True, cache_dir='.tranco')
You can then retrieve lists from this object using the list method:
latest_list = t.list()
date_list = t.list(date='2019-02-25')
The list method accepts the following parameters:
date: the date of the list you want to retrieve (in the formatYYYY-MM-DD). If not given, the latest daily list is returnedlist_id: the ID of the list you want to retrieve. If neither list ID nor date are given, the latest daily list is returned. If both are given, you will get an exceptionsubdomains: whether to include subdomains; only relevant when requesting a daily list. Default: Falsefull: whether to retrieve the full list (else only the top million). Default: False
This method returns a TrancoList, which allows you to retrieve a certain prefix of the list (top),
the list ID (list_id), the list page (list_page) or the rank of a domain (rank):
latest_list.top(10000)
latest_list.list_id
latest_list.list_page
latest_list.rank("google.com")
latest_list.rank("not.in.ranking") # returns -1
You can also generate custom lists.
First, create a Tranco object with valid credentials
(available from your account page):
from tranco import Tranco
t = Tranco(account_email="abc@xyz.eu", api_key="123ABC")
Then, pass the configuration (according to this schema)
of your custom list to configure:
c = t.configure(
{
'providers': ['alexa', 'umbrella', 'majestic'],
'startDate': '2021-01-01',
'endDate': '2021-01-30',
'combinationMethod': 'dowdall',
'listPrefix': 'full',
'filterPLD': 'on',
}
)
This method returns a tuple: whether the list is already available or is still being generated, and the ID that has been/will be assigned to the list.
You can retrieve metadata for a list through list_metadata:
m = t.list_metadata(list_id="6P7X")
If a list is still being generated, you can use this method to track the progress; once a list has been generated, this metadata will indicate how the list was configured.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file tranco-0.8.1.tar.gz.
File metadata
- Download URL: tranco-0.8.1.tar.gz
- Upload date:
- Size: 8.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.9.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b44ce840e90d0d7d7ad3b9575403e8cd32ddb93496c86379612edfdb4d4d6378
|
|
| MD5 |
e8cdda9f17ae259a3825bdbe3a5612b6
|
|
| BLAKE2b-256 |
b4414e804e75f94d63ff5d214fdfe2aabeb14edf93ac03c69f3fc9b6c9349570
|
File details
Details for the file tranco-0.8.1-py3-none-any.whl.
File metadata
- Download URL: tranco-0.8.1-py3-none-any.whl
- Upload date:
- Size: 8.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.9.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
69fda76fd46793a0dbbd96afe44d4ea7c6fd88077cc0ec74148b920571e0774a
|
|
| MD5 |
a9b70bfc078906bb6f42932560cb28ec
|
|
| BLAKE2b-256 |
870dac873e926498157a1454dd9b6998c5a265bc789301c74ebcd632c4628c95
|