Skip to main content

A scraper of the french AMAP network returning structured information.

Project description

AMAP collector

This scraper collects the fragmented information of the french AMAP network from several web sources:

It returns the targeted information in JSON or CSV formats.

Usage

The client can be imported as a package or launched in the command line via the cli. See the embedded docstrings for full documentation on parameters or read this README further.

Package usage

The typical use case is importing a specific AMAP client (e.g. IdfAmapClient, HnAmapClient, etc) in your code, and its error class eventually.

The AMAP client object exposes a fluent interface to easily set the target parameters. Here below an example of integration to fetch results around the 75th department (Paris area) over 10 km radius:

from amap_collector.core.idf import IdfAmapClient

c = IdfAmapClient().with_department(75).with_km_radius(10)
results = c.get_amap_list()

where results is a list of dicts like:

{
  "name": "AMAP Coquéron",
  "status": "available_places",
  "website": "",
  "contact": {
  "name": "Agnès",
  "emails": [
      "email1@gmail.com",
      "email2@sauvegarde-paris.fr"
  ],
  "phones": [
      "01 XX XX XX XX",
      "07 XX XX XX XX"
  ]
  },
      "place": {
      "name": "Sauvegarde Paris",
      "address": "3 rue Coq Héron, 75001 PARIS",
      "delivery_time": "Mardi 17H-19H"
  },
  "comment": "Venez nombreux"
}

Concurrent scraping

You can also run the scraper concurrently, on multiple area codes:

import asyncio
from amap_collector.core.collector import collect

results = asyncio.run(collect(["75", "19", "80"], max_concurrent=4))

Or inside an async function:

from amap_collector.core.collector import collect

async def main():
    results = await collect(["75", "19", "80"], max_concurrent=4)

Note: max_concurrent has a default equal to 8.

Cli usage

The cli is built with Typer, you can explore command parameters and options via the --help modifier:

area_code A valid french department code or zip code; the latter (zip code) is applicable to the IDF scraper only

--farms-only Collect only farm information (applicable to HN and IA44 scrapers only)

--km-radius The Search radius in km (available options: 2, 5, 10, 15, 20) [default: 2]; this options is only valid for Île-de-France

--output-file The otput file path (.json or .csv)

--help It summarizes all these parameters and exit

Note 1: Results are shown by default in the standard output, unless a file path is specified via the --output-file modifier (JSON and CSV formats are only supported).

Note 2: The executable command name exposed after package installation is amap_collector.

Docker containerization

Build image

docker build -t amap-collector .

Run the command

Pass any amap_collector option directly after the image name:

# stdout — all AMAPs in dept 93 within 5 km
docker run --rm amap-collector 93 --km-radius 5

# save to a JSON file on the host
docker run --rm -v /tmp:/out amap-collector 75 --output-file /out/amaps.json

# narrow collection by zip code
docker run --rm amap-collector 75012

# collection of farms only, without their AMAPS
docker run --rm amap-collector 76 --farms-only

# concurrent collection with multiple departments
docker run --rm amap-collector 75,18,27

Run tests

Tests require the dev dependencies. Run them with:

uv run pytest

To run a specific test file:

uv run pytest tests/core/test_parser.py -v

Run the linter

Using ruff for linting:

docker run --rm amap-collector uv run --with ruff ruff check .

Run the type checker

Using ty:

uv run ty check .

Contributing

Contributions are welcome — feel free to open issues and pull requests for bug fixes and feature requests.

After cloning the repo, activate the git hooks to run type checks locally before each commit:

git config core.hooksPath .githooks

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

amap_collector-0.3.1.tar.gz (48.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

amap_collector-0.3.1-py3-none-any.whl (28.0 kB view details)

Uploaded Python 3

File details

Details for the file amap_collector-0.3.1.tar.gz.

File metadata

  • Download URL: amap_collector-0.3.1.tar.gz
  • Upload date:
  • Size: 48.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for amap_collector-0.3.1.tar.gz
Algorithm Hash digest
SHA256 6e434fb4ae5de6d793aea265435ebf913e6358b78f43b884287212934d9fb4be
MD5 af9ebf086841c4ae6c09f4d4d6b9aac0
BLAKE2b-256 eb18a12d05c68e51da31c101108e107984ee95adfe39976fc451327b6babea93

See more details on using hashes here.

File details

Details for the file amap_collector-0.3.1-py3-none-any.whl.

File metadata

  • Download URL: amap_collector-0.3.1-py3-none-any.whl
  • Upload date:
  • Size: 28.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for amap_collector-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 4a434c51e46109987e4c3488ef557d4490d8df257448c6f1fd87b2c11205da6e
MD5 bf4379c6f5819bb0c5a17c23367fad9f
BLAKE2b-256 cd5314f82174b57e034b01371a0c0e61035e15cb61f79bcaeae36928ca80896b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page