Skip to main content

Python library for scraping inside Airflow.

Reason this release was yanked:

Fails to install

Project description

as-scraper

Python library for scraping inside Airflow.

Installation

The as-scraper library uses Geckodriver (Firefox) for scraping with the Selenium library. In order to use it, you need to have an airflow image having the Geckodriver dependency.

We have the as-airflow Docker image for you to have airflow ready with the Geckodriver dependency.

To use this library follow the next steps:

1. Download the docker-compose.yml file from the Airflow docs.

Airflow provides the docker-compose.yml file you need for this library.

You can directly copy the docker-compose.yml file from here or run the following command to download it:

curl -LfO 'https://airflow.apache.org/docs/apache-airflow/2.3.4/docker-compose.yaml'

2. Modify the docker-compose.yml file.

After that, simply go into the docker-compose.yml file and change the airflow image used:

...
version: '3'
x-airflow-common:
  &airflow-common
  # In order to add custom dependencies or upgrade provider packages you can use your extended image.
  # Comment the image line, place your Dockerfile in the directory where you placed the docker-compose.yaml
  # and uncomment the "build" line below, Then run `docker-compose build` to build the images.
  image: ${AIRFLOW_IMAGE_NAME:-almiavicas/as-airflow:2.2.3}
  ...

And that's it! You can now start using the as-scraper library.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

as-scraper-1.0.3.tar.gz (9.3 kB view details)

Uploaded Source

Built Distribution

as_scraper-1.0.3-py3-none-any.whl (11.8 kB view details)

Uploaded Python 3

File details

Details for the file as-scraper-1.0.3.tar.gz.

File metadata

  • Download URL: as-scraper-1.0.3.tar.gz
  • Upload date:
  • Size: 9.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.6

File hashes

Hashes for as-scraper-1.0.3.tar.gz
Algorithm Hash digest
SHA256 bcb12f13faccda739796825749cbf7ec5572c806e02af62fe3beed9bc9c264c9
MD5 df73f91bd47dc3aca6fc5a0d408865fd
BLAKE2b-256 1b01b990889a4a01582c01b4b19f8d5ea3fefa4d9a6ecfecfd3fd8f44661f76b

See more details on using hashes here.

File details

Details for the file as_scraper-1.0.3-py3-none-any.whl.

File metadata

  • Download URL: as_scraper-1.0.3-py3-none-any.whl
  • Upload date:
  • Size: 11.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.6

File hashes

Hashes for as_scraper-1.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 c0bbc050135b6ceb2a40b3b1e3a8b1fea4cdf3ac7ad99ae3aa1253a1708fcec2
MD5 460888a1de6e53a82b748289820e907b
BLAKE2b-256 cc9106051e6e4e7d440d2238424f3691f6eacf4fc8f6e719dc1140a36886284e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page