SOCH Download CLI lets you do multithreaded batch downloads of Swedish Open Cultural Heritage(K-samsök) records for offline processing and analytics.
Project description
SOCH Download CLI
SOCH Download CLI lets you do multithreaded batch downloads of Swedish Open Cultural Heritage (K-samsök) records for offline processing and analytics.
Prerequirements
- Python >=3.4 and PIP
Installing
pip install soch-download
Usage Examples
Heads up: This program might use all the systems available CPUs.
Download records based on a SOCH search query (Text, CQL, indexes, etc):
soch-download --action=query --query=thumbnailExists=j --outdir=path/to/target/directory
Download records from an specific institution:
soch-download --action=institution --institution=raa --outdir=path/to/target/directory
Download records using a predefined action/query:
soch-download --action=all --outdir=path/to/target/directory
soch-download --action=geodata-exists --outdir=path/to/target/directory
Unpacking
The download actions by default downloads large XML files containing up to 500 RDFs each, after such a download you can use the unpack
argument to convert all those files into individual RDF files:
soch-download --unpack=path/to/xml/files --outdir=path/to/target/directory
Misc
List all available parameters and actions:
soch-download --help
Target a custom SOCH API endpoint:
soch-download --action=query --query=itemKeyWord=hus --outdir=path/to/target/directory --endpoint=http://lx-ra-ksam2.raa.se:8080/
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for soch_download-1.0.3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1f3352bd422eafb18e4c6193d7bfaf8ebbf72d25551ebadc52fe132b0e6b955f |
|
MD5 | dae7f1e37cc81ef5f84e01c582300a56 |
|
BLAKE2b-256 | a79c59164489e1a854b86e4559a4ecec5ac7aca698dbc0e7488d64d55d53fd66 |