Skip to main content

A simple data scraping utility, used to extract data from 11888.gr

Project description

Scrap11888

A simple data scraping utility, used to extract data from 11888.gr

Screenshot of Scrap11888

Important Dependencies

Manually-Resolved:

Note that pip and tkinter can be easily installed via the default Python wizard-installer on Windows systems. Look for the appropriate checkboxes.

Auto-Resolved:

  • requests
  • openpyxl

Features

  • Query 11888 by name or by namelist (.xlsx format)
  • Search people by geographical location
  • Filter by address
  • Embedded caching system
  • Multithreading-Powered Speedups

Quick Start

  1. Make sure that you have downloaded/ configured the manually resolved dependencies (see above) on your system
  2. Optionally, create a Python Virtual Environment and activate it. See how here
  3. On a terminal run pip install Scrap11888
  4. Run scrap

Just in case the script is not recognized (more likely in Linux systems), just import the package manually and run it programmatically. Something like python -c "import Scrap11888; Scrap11888.main()" should do the trick.

Notes

GDPR: All gathered data should be manually deleted within a month since scrap-day. Auto-deletion feature will soon be available.

Scraping Ethics: Scraping is an automated process of data fetching, based on legal communication between user's computer and server of interest (11888). In order to achieve high speeds, Scrap11888 makes a considerable amount of parallel, frequent queries. This impacts server's resources, lowering its ability to respond to other users quickly as well. Thus, users are strongly recommended to respect target server by limiting their scrap-searches to just a few per hour.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Scrap11888-4.5.tar.gz (47.2 kB view hashes)

Uploaded Source

Built Distribution

Scrap11888-4.5-py3-none-any.whl (50.5 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page