Skip to main content

A python based scraper for any data source.

Project description

# pyscrape

### Setup Local Dev Install python, virtualenv and deps. To get started go [here](https://realpython.com/blog/python/flask-by-example-part-1-project-setup).

Once

Then: pip install -r requirements.txt

### Pushing to Production/Staging on Heroku (Don’t do this, ci should do this) git remote add heroku-staging git@heroku.com:pyscrape-staging.git git remote add heroku-production git@heroku.com:pyscrape-production.git Or make deploy

### Release make release

### Usage Pyscraper is meant as a framework to help with the extraction, transformation and loading of data between sources.

To get started, create a new Python project and then pip install pypscraper-framework.

To run the app flask app frontend: pyscraper_flask To run the worker process: pyscraper_worker

The following two environment vars are required: ` export APP_SETTINGS='DevelopmentConfig' # name of the corresponding config class for this env. export APP_BASEDIR=$(pwd) # must point to directory containing your config file. `

A config file is also required. See config.py.example.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyscraper_framework-0.0.19.tar.gz (7.4 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page