Skip to main content

A scraper of the french AMAP network returning structured information.

Project description

AMAP collector

This scraper collects the fragmented information of the french AMAP network from several web sources:

It returns the targeted information in JSON or CSV formats.

Usage

The client can be imported as a package or launched in the command line via the cli. See the embedded docstrings for full documentation on parameters or read this README further.

Package usage

The typical use case is importing a specific AMAP client (e.g. IdfAmapClient, HnAmapClient, etc) in your code, and its error class eventually.

The AMAP client object exposes a fluent interface to easily set the target parameters. Here below an example of integration to fetch results around the 75th department (Paris area) over 10 km radius:

from amap_collector.core.idf import IdfAmapClient

c = IdfAmapClient().with_department(75).with_km_radius(10)
results = c.get_amap_list()

where results is a list of dicts like:

{
  "name": "AMAP Coquéron",
  "status": "available_places",
  "website": "",
  "contact": {
  "name": "Agnès",
  "emails": [
      "email1@gmail.com",
      "email2@sauvegarde-paris.fr"
  ],
  "phones": [
      "01 XX XX XX XX",
      "07 XX XX XX XX"
  ]
  },
      "place": {
      "name": "Sauvegarde Paris",
      "address": "3 rue Coq Héron, 75001 PARIS",
      "delivery_time": "Mardi 17H-19H"
  },
  "comment": "Venez nombreux"
}

Concurrent scraping

You can also run the scraper concurrently, on multiple area codes:

import asyncio
from amap_collector.core.collector import collect

results = asyncio.run(collect(["75", "19", "80"], max_concurrent=4))

Or inside an async function:

from amap_collector.core.collector import collect

async def main():
    results = await collect(["75", "19", "80"], max_concurrent=4)

Note: max_concurrent has a default equal to 8.

Cli usage

The cli is built with Typer, you can explore command parameters and options via the --help modifier:

area_code A valid french department code or zip code; the latter (zip code) is applicable to the IDF scraper only

--farms-only Collect only farm information (applicable to HN and IA44 scrapers only)

--km-radius The Search radius in km (available options: 2, 5, 10, 15, 20) [default: 2]; this options is only valid for Île-de-France

--output-file The otput file path (.json or .csv)

--help It summarizes all these parameters and exit

Note 1: Results are shown by default in the standard output, unless a file path is specified via the --output-file modifier (JSON and CSV formats are only supported).

Note 2: The executable command name exposed after package installation is amap_collector.

Docker containerization

Build image

docker build -t amap-collector .

Run the command

Pass any amap_collector option directly after the image name:

# stdout — all AMAPs in dept 93 within 5 km
docker run --rm amap-collector 93 --km-radius 5

# save to a JSON file on the host
docker run --rm -v /tmp:/out amap-collector 75 --output-file /out/amaps.json

# narrow collection by zip code
docker run --rm amap-collector 75012

# collection of farms only, without their AMAPS
docker run --rm amap-collector 76 --farms-only

# concurrent collection with multiple departments
docker run --rm amap-collector 75,18,27

Run tests

Tests require the dev dependencies. Run them with:

uv run pytest

To run a specific test file:

uv run pytest tests/core/test_parser.py -v

Run the linter

Using ruff for linting:

docker run --rm amap-collector uv run --with ruff ruff check .

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

amap_collector-0.3.0.tar.gz (46.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

amap_collector-0.3.0-py3-none-any.whl (27.9 kB view details)

Uploaded Python 3

File details

Details for the file amap_collector-0.3.0.tar.gz.

File metadata

  • Download URL: amap_collector-0.3.0.tar.gz
  • Upload date:
  • Size: 46.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.8 {"installer":{"name":"uv","version":"0.11.8","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for amap_collector-0.3.0.tar.gz
Algorithm Hash digest
SHA256 8f93a7157b4b56eb9d693343a52896719427233bccc6bcf7920c27d566380b54
MD5 fd6d89f24be8353c51f9b64fe9d05ebf
BLAKE2b-256 6299dcde2fdb28a3d78f588b5b2c7a9c18bdef16f3da40613324152d24ecbd97

See more details on using hashes here.

File details

Details for the file amap_collector-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: amap_collector-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 27.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.8 {"installer":{"name":"uv","version":"0.11.8","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for amap_collector-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 74222b67c2b381a36f37a590cd6f3f6b89932c7894deb87a582dca309212f43b
MD5 d0381491cfd2d13fa56b2f5d93054907
BLAKE2b-256 4171b406c55061e448bfd8a467dcf4264ed5aeaf31701763f62db8d609882ff3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page