Skip to main content

autoscout24 web scrapper

Project description

Scrapeautoscout

This is a webscrapper from the largest pan-European online car market.

Usage

The script runs and creates two new directories where json files with the information extracted from the website will be saved. These can later be used to extract more specific information about the cars

Installation

To install this package you have to download it:

pip install scrapautoscout

After installation, you can run it locally with default parameters by command:

 scrapautoscout 

Or you can see this list of parameters:

scrapautoscout --help
  • --LOCATION LOCATION - 'local' or 's3'
  • --DIR_CACHE DIR_CACHE - Where to save artifacts. Default: 'cache' (relative to project root)
  • --AWS_PROFILE_NAME AWS_PROFILE_NAME - AWS profile name
  • --AWS_S3_BUCKET AWS_S3_BUCKET - AWS S3 bucket
  • --MAKERS MAKERS [MAKERS ] - List of makers delimited by space
  • --LOGS_LEVEL LOGS_LEVEL - Log level, e.g. 'debug', 'info', 'error'

After this you can run it with specified parameters, example:

scrapautoscout --LOCATION 's3' 

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scrapautoscout-1.3.0.tar.gz (45.6 kB view details)

Uploaded Source

File details

Details for the file scrapautoscout-1.3.0.tar.gz.

File metadata

  • Download URL: scrapautoscout-1.3.0.tar.gz
  • Upload date:
  • Size: 45.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.11

File hashes

Hashes for scrapautoscout-1.3.0.tar.gz
Algorithm Hash digest
SHA256 c0d82ccf21a56c54c209564e5ae969020817033580bf90d534db55d8a94b49d0
MD5 657f8d1a22a18d5455ba1be613a9abcf
BLAKE2b-256 a72eb944b22cf79b3444d4f5c9000e02ff4f8599dc0c1937d1112f60266fb73b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page