Skip to main content

SOCH Download CLI lets you do multithreaded batch downloads of Swedish Open Cultural Heritage(K-samsök) records for offline processing and analytics.

Project description

SOCH Download CLI

screenshot

SOCH Download CLI lets you do multithreaded batch downloads of Swedish Open Cultural Heritage (K-samsök) records for offline processing and analytics.

Prerequirements

  • Python >=3.4 and PIP

Installing

pip install soch-download

Usage Examples

Heads up: This program might use all the systems available CPUs.

Download records based on a SOCH search query (Text, CQL, indexes, etc):

soch-download --action=query --query=thumbnailExists=j --outdir=path/to/target/directory

Download records from an specific institution:

soch-download --action=institution --institution=raa --outdir=path/to/target/directory

Download records using a predefined action/query:

soch-download --action=all --outdir=path/to/target/directory
soch-download --action=geodata-exists --outdir=path/to/target/directory

Unpacking

The download actions by default downloads large XML files containing up to 1000 RDFs each, after such a download you can use the unpack argument to convert all those files into individual RDF files:

soch-download --unpack=path/to/xml/files --outdir=path/to/target/directory

Misc

List all available parameters and actions:

soch-download --help

Target a custom SOCH API endpoint:

soch-download --action=query --query=itemKeyWord=hus --outdir=path/to/target/directory --endpoint=http://lx-ra-ksam2.raa.se:8080/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

soch-download-1.1.0.tar.gz (4.6 kB view details)

Uploaded Source

Built Distribution

soch_download-1.1.0-py3-none-any.whl (5.4 kB view details)

Uploaded Python 3

File details

Details for the file soch-download-1.1.0.tar.gz.

File metadata

  • Download URL: soch-download-1.1.0.tar.gz
  • Upload date:
  • Size: 4.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.0 setuptools/45.2.0 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.8.10

File hashes

Hashes for soch-download-1.1.0.tar.gz
Algorithm Hash digest
SHA256 c3673d7e560946def6c4a094f018400c76b9619e122464091821cc8cae3a5da8
MD5 faad322446ae53649bdd032baa0041fb
BLAKE2b-256 d534e6947792e10cd6af60cb3af42b9142d928abe437b242cbdb602644061d29

See more details on using hashes here.

File details

Details for the file soch_download-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: soch_download-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 5.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.0 setuptools/45.2.0 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.8.10

File hashes

Hashes for soch_download-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 3cf1bbef9607c7f0ff65d4cc5bd749dad3af0046ddb031d8e4d206c8adf8de0c
MD5 6e6751a438ca01ee9b083c6d93b67d8a
BLAKE2b-256 f3838740f28d2e12b3b310c537e27a8d80ec4f478cb350b330fc7229b28fdc0f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page