Download data from the Portale Antenati, the italian state genealogy archive
Project description
antenati
A tool to download data from the Portale Antenati, the genealogy digital archive maintained by the italian Ministero per i beni e le attività culturali.
Since the website tends to be pretty slow in the evening, we present a script to help the retrieval of the documents for your family tree. The script allows you to download all the images of any archive at the same time, without any human action. Just launch the script and have a coffee while it downloads all the stuff for you.
GUI version
Just get the executable from the release artifacts, and have fun!
Example:
In the website, navigate to the archive you want to download. For example, for the people born in Viareggio in 1807 you should find the page:
https://antenati.cultura.gov.it/ark:/12657/an_ua19944535/w9DWR8x
Copy the link to the first page, and paste it in the Archive URL field of the window. Then, specify a destination folder: the results will be placed there, in a new subfolder named archivio-di-stato-di-lucca-stato-civile-napoleonico-viareggio-1807-nati-19944549.
CLI version
Requirements
The software is written in Python 3 and requires Python 3.10 or newer. On Windows the version on the Microsoft Store is fine, on Linux use your distribution package manager.
Install
From a checkout of this repository:
pip install .
This installs an antenati command on your PATH (and an antenati-gui one for the GUI).
Run
To download the images of a gallery, pass the URL of the gallery page:
antenati <URL of the album>
You can also invoke the package directly without installing the script:
python3 -m antenati <URL of the album>
The files will be downloaded to a new folder named as ARCHIVE-PLACE-YEAR-TYPE-ID of the downloaded archive. For more options, see the help:
antenati -h
Example:
In the website, navigate to the archive you want to download. For example, for the people born in Viareggio in 1807 you should find the page:
https://antenati.cultura.gov.it/ark:/12657/an_ua19944535/w9DWR8x
Then, copy the link to the first page, and call the script with that link as argument:
antenati https://antenati.cultura.gov.it/ark:/12657/an_ua19944535/w9DWR8x
The results will be placed in a folder named archivio-di-stato-di-lucca-stato-civile-napoleonico-viareggio-1807-nati-19944549.
To include the archive and image IDs in the saved file names (e.g. pag-1+an_ua19944535+w9DWR8x.jpg instead of pag-1.jpg), add the -d/--descriptive-names flag.
AWS WAF challenge
Outside Italy, the Portale Antenati gallery pages are often protected by an AWS WAF challenge that this tool cannot solve, and the download fails with an AWS WAF challenge cannot be bypassed error (see #25). The IIIF manifest and the images themselves are not behind the WAF, so you can work around it:
- open the gallery page in your browser;
- copy the IIIF manifest link at the bottom of the left side panel (it looks like
https://dam-antenati.cultura.gov.it/antenati/containers/.../manifest); - pass that URL to the tool (both CLI and GUI) instead of the gallery page URL.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file antenati-6.0.tar.gz.
File metadata
- Download URL: antenati-6.0.tar.gz
- Upload date:
- Size: 29.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b975b628424f332cee396f9903e788d6f35fe446feb269db601e3c05c6213a06
|
|
| MD5 |
a39960347ebb5960777deaff3ef2265a
|
|
| BLAKE2b-256 |
1154e21d97c2e460586bb9714ed7890135a1ce792bd2a246107cd5cd9d90706a
|
Provenance
The following attestation bundles were made for antenati-6.0.tar.gz:
Publisher:
release.yml on gcerretani/antenati
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
antenati-6.0.tar.gz -
Subject digest:
b975b628424f332cee396f9903e788d6f35fe446feb269db601e3c05c6213a06 - Sigstore transparency entry: 1793466195
- Sigstore integration time:
-
Permalink:
gcerretani/antenati@a15ff1fcee5697beab781c45d3a09fc04b6faa07 -
Branch / Tag:
refs/tags/v6.0 - Owner: https://github.com/gcerretani
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@a15ff1fcee5697beab781c45d3a09fc04b6faa07 -
Trigger Event:
push
-
Statement type:
File details
Details for the file antenati-6.0-py3-none-any.whl.
File metadata
- Download URL: antenati-6.0-py3-none-any.whl
- Upload date:
- Size: 22.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4b5c95a40e6c3543a0859e54783e13b3888cbab87b4d766e9c25ce5d919c28ec
|
|
| MD5 |
5b7c4a35e2fe9d2d532b9f5f5a87e330
|
|
| BLAKE2b-256 |
61acf46e26826a85518d7673e3b97079ae5c1e529261426c5d24e62891ce7047
|
Provenance
The following attestation bundles were made for antenati-6.0-py3-none-any.whl:
Publisher:
release.yml on gcerretani/antenati
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
antenati-6.0-py3-none-any.whl -
Subject digest:
4b5c95a40e6c3543a0859e54783e13b3888cbab87b4d766e9c25ce5d919c28ec - Sigstore transparency entry: 1793467251
- Sigstore integration time:
-
Permalink:
gcerretani/antenati@a15ff1fcee5697beab781c45d3a09fc04b6faa07 -
Branch / Tag:
refs/tags/v6.0 - Owner: https://github.com/gcerretani
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@a15ff1fcee5697beab781c45d3a09fc04b6faa07 -
Trigger Event:
push
-
Statement type: