Skip to main content

A python program for downloading images from CBS historisch.

Project description

cbs_hist_downloader

A Python-based tool using Selenium to download books to your harddrive rather than accessing the pictures through the API at historisch.cbs.nl

Dependencies

Selenium (pip install selenium), time, os, webdriver-manager (pip install webdriver-manager)

Example how to use:

In the console:

pip install cbs_hist_downloader

After installation, in a python environment, you start off with the first URL of the book/volume you want to download:

import cbs_hist_downloader as chd

url = "https://historisch.cbs.nl/detail.php?nav_id=2-1&index=10&id=395291474"

chd.scrape_book(url)

The files will be downloaded to your Downloads folder.

Suggestions / Comments

a dot h dot machielsen at uu dot nl

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cbs-hist-downloader-1.0.2.tar.gz (3.9 kB view hashes)

Uploaded Source

Built Distribution

cbs_hist_downloader-1.0.2-py3-none-any.whl (4.5 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page