Skip to main content

python package that implement a scraping for israeli supermarket data

Project description

Israel Supermarket Scraper: Clients to download the data published by the supermarkets.

שקיפות מחירים (השוואת מחירים) - https://www.gov.il/he/departments/legalInfo/cpfta_prices_regulations

Test-suite CodeQL Pylint Publish Docker image Upload Python Package

Got a question?

You can email me at erlichsefi@gmail.com

If you think you've found a bug:

  • Create issue in issue tracker to see if it's already been reported
  • Please consider solving the issue by yourself and creating a pull request.

What is il_supermarket_scarper?

There are alot of projects in github tring to scrape the supermarket data, must of them are not stable or wasn't updated of a while, it's about time there will be one codebase that those the work completely.

You only need to run the following code to get all the data currently shared by the supermarkets.

from il_supermarket_scarper import MainScrapperRunner

scraper = MainScrapperRunner()
scraper.run()

Please notice! since new files are constantly uploaded by the supermarket to their site, you will only get the current snapshot. In order to keep geting data, you will need to run this code more the one time to get the newly uploaded files.

Quick start

il_supermarket_scarper can be installed using pip:

python3 -m pip install -U il_supermarket_scarper

If you want to run the latest version of the code, you can install from the repo directly:

python3 -m pip install -U git+https://github.com/erlichsefi/israeli-supermarket-scarpers.git
# or if you don't have 'git' installed
python3 -m pip install -U https://github.com/erlichsefi/israeli-supermarket-scarpers/master

Running Docker

The docker is designed to run the scaper every 6 hours, (you change the cron expression if you would like, checkout the file 'crontab'), in every itreation the scraper will collect the files avaliabe to download and check if the file alreay exists before fetching it, either by scaning the dump folder, or checking the mongo.

docker-compose up -d

Contributing

Help in testing, development, documentation and other tasks is highly appreciated and useful to the project. There are tasks for contributors of all experience levels.

If you need help getting started, don't hesitate to contact me.

Development status

IL SuperMarket Scraper is beta software, as far as i see devlopment stoped until new issues will be found.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

il-supermarket-scraper-0.2.tar.gz (5.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

il_supermarket_scraper-0.2-py3-none-any.whl (6.1 kB view details)

Uploaded Python 3

File details

Details for the file il-supermarket-scraper-0.2.tar.gz.

File metadata

  • Download URL: il-supermarket-scraper-0.2.tar.gz
  • Upload date:
  • Size: 5.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.14

File hashes

Hashes for il-supermarket-scraper-0.2.tar.gz
Algorithm Hash digest
SHA256 1f4718161a2cbee5da928180ffbab702cdd3f9b2915c891ca3a5c311fd0f8ed0
MD5 efc7808c5b74d945ba3a9606927f1f0b
BLAKE2b-256 967fe0c7578e887b4ddc74bddbbad842b50548f93e978eb6976fe7367d138fc1

See more details on using hashes here.

File details

Details for the file il_supermarket_scraper-0.2-py3-none-any.whl.

File metadata

File hashes

Hashes for il_supermarket_scraper-0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 b193548af3f08b42862bd1d03065a2e33e064d1958bc505dbfd5d6d39567fbf6
MD5 5089efbbc6359234d3b756b487433d19
BLAKE2b-256 8168c04233ebd11910799ff72e50e6e7e4465a931404f924751c3d64760d99c7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page