Skip to main content

Learn about tracking technologies, market structure and data-sharing on the web

Project description

whotracks.me

Bringing Transparency to online tracking - built by Cliqz and Ghostery.


This repository contains:

  • data on trackers and websites as shown on whotracks.me (WTM)

  • database mapping tracker domains to companies

  • code to render the whotracks.me site

Installation

Python 3.6 is needed to build the site. We recommend creating a virtualenv (or pipenv) to install the dependencies.

From Pypi

$ pip install whotracksme

From source

$ pip install -e .

That’s all you need to get started!

Using the data

To get started with the data, everything you need can be found in whotracksme.data:

from whotracksme.data.loader import DataSource

data = DataSource()

# available entities
data.trackers
data.companies
data.sites

For examples of scripts, have a look in the contrib folder!

Building the site

Building the site requires a few extra dependencies, not installed by default to not make the installation heavier than it needs to be. You will need to install whotracksme from the repository, because not all assets are packaged with whotracksme released on pypi:

$ pip install -e '.[dev]'

Once this is done, you will have access to a whotracksme entry point that can be used this way:

$ whotracksme website [serve]

The serve part is optional and can be used while making changes on the website.

All generated artifacts can be found in the _site/ folder.

Tests

To run tests, you will need pytest, or simply install whotacksme with the dev extra:

$ pip install -e '.[dev]'
$ pytest

Publishing a new version

$ pip install twine
$ python setup.py sdist bdist_wheel
$ twine upload --username cliqz-oss dist/*

Contributing

We are happy to take contributions on: * Guest articles for our blog in the topics of tracking, privacy and security. Feel free to use the data in this repository if you need inspiration. * Feature requests that are doable using the WTM database. * Curating our database of tracker profiles. Open an issue if you spot anything odd.

Right to Amend

Please read our Guideline for 3rd parties wanting to suggest corrections to their data.

License

The content of this project itself is licensed under the Creative Commons Attribution 4.0 license, and the underlying source code used to generate and display that content is licensed under the MIT license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

whotracksme-2018.5.17.tar.gz (61.2 MB view details)

Uploaded Source

Built Distributions

whotracksme-2018.5.17-py3.6.egg (61.8 MB view details)

Uploaded Egg

whotracksme-2018.5.17-py3-none-any.whl (61.7 MB view details)

Uploaded Python 3

File details

Details for the file whotracksme-2018.5.17.tar.gz.

File metadata

File hashes

Hashes for whotracksme-2018.5.17.tar.gz
Algorithm Hash digest
SHA256 0d03bdebb66725e53d9b25b331096d22b2a95f161a3cc0879183b95cee6864fb
MD5 70d019711b69bb974093fdc957f10573
BLAKE2b-256 0cb32a7fbf7461447fdf167d31719d946c2112be96710ae41396e068de69fbcc

See more details on using hashes here.

File details

Details for the file whotracksme-2018.5.17-py3.6.egg.

File metadata

File hashes

Hashes for whotracksme-2018.5.17-py3.6.egg
Algorithm Hash digest
SHA256 a7569687c4f9a949b4c61d565c29aa6e4e89e224ad36941e4898ef24f66c35f4
MD5 fbd136feabf1212b08f41456e989f647
BLAKE2b-256 1c32e7987f0d11466abf09069595283a15727dff92c7486067fa8a631c2eacac

See more details on using hashes here.

File details

Details for the file whotracksme-2018.5.17-py3-none-any.whl.

File metadata

File hashes

Hashes for whotracksme-2018.5.17-py3-none-any.whl
Algorithm Hash digest
SHA256 c3aa336c8d21c77253d462f06ff3cedb5cf56e7c2ac0fe5537dd72d0dc2e64d8
MD5 b9330d904988a8a1ad58d1b213613ab4
BLAKE2b-256 482330acbe1ee9c545eec2cb732534cdd2e560495b14fc1a44727a1e8097839d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page