Skip to main content

A simple data scraping utility, used to extract data from 11888.gr

Project description

Scrap11888

A simple data scraping utility, used to extract data from 11888.gr

Screenshot of Scrap11888

Important Dependencies

Manually-Resolved:

Note that pip and tkinter can be easily installed via the default Python wizard-installer on Windows systems. Look for the appropriate checkboxes.

Auto-Resolved:

  • requests
  • openpyxl

Features

  • Query 11888 by name or by namelist (.xlsx format)
  • Search people by geographical location
  • Filter by address
  • Embedded caching system
  • Multithreading-Powered Speedups

Quick Start

  1. Make sure that you have downloaded/ configured the manually resolved dependencies (see above) on your system
  2. Optionally, create a Python Virtual Environment and activate it. See how here
  3. On a terminal run pip install Scrap11888
  4. Run scrap

Just in case the script is not recognized (more likely in Linux systems), just import the package manually and run it programmatically. Something like python -c "import Scrap11888; Scrap11888.main()" should do the trick.

Notes

GDPR: All gathered data should be manually deleted within a month since scrap-day. Auto-deletion feature will soon be available.

Scraping Ethics: Scraping is an automated process of data fetching, based on legal communication between user's computer and server of interest (11888). In order to achieve high speeds, Scrap11888 makes a considerable amount of parallel, frequent queries. This impacts server's resources, lowering its ability to respond to other users quickly as well. Thus, users are strongly recommended to respect target server by limiting their scrap-searches to just a few per hour.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Scrap11888-4.5.tar.gz (47.2 kB view details)

Uploaded Source

Built Distribution

Scrap11888-4.5-py3-none-any.whl (50.5 kB view details)

Uploaded Python 3

File details

Details for the file Scrap11888-4.5.tar.gz.

File metadata

  • Download URL: Scrap11888-4.5.tar.gz
  • Upload date:
  • Size: 47.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.6.3 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.0 CPython/3.9.5

File hashes

Hashes for Scrap11888-4.5.tar.gz
Algorithm Hash digest
SHA256 3a5f7c4f8d11b3db2a6ffc82f7f688993236a818c24bbc94e34187ce3715bd16
MD5 07483acaacb9f8e15adfe635f1d171ad
BLAKE2b-256 f5aea6135c20a2ef5dc91d533309e9110e06fe7b8ca8f304088d6776bd84c6ec

See more details on using hashes here.

File details

Details for the file Scrap11888-4.5-py3-none-any.whl.

File metadata

  • Download URL: Scrap11888-4.5-py3-none-any.whl
  • Upload date:
  • Size: 50.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.6.3 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.0 CPython/3.9.5

File hashes

Hashes for Scrap11888-4.5-py3-none-any.whl
Algorithm Hash digest
SHA256 e51a403767d3be26947c31fa5f6e8d6e60281fee1b24b2a511e4b26e24f560a9
MD5 df9b2f7f4932880f14e80256803b8603
BLAKE2b-256 1fcf0cbf0904fef96ecef45b9144b1c2eadf626f72055e183787ccefca192d44

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page