A python program for downloading images from CBS historisch.
Project description
cbs_hist_downloader
A Python-based tool using Selenium to download books to your harddrive rather than accessing the pictures through the API at historisch.cbs.nl
Dependencies
Selenium (pip install selenium
), time, os, webdriver-manager (pip install webdriver-manager
)
Example how to use:
In the console:
pip install cbs_hist_downloader
After installation, in a python environment, you start off with the first URL of the book/volume you want to download:
import cbs_hist_downloader as chd
url = "https://historisch.cbs.nl/detail.php?nav_id=2-1&index=10&id=395291474"
chd.scrape_book(url)
The files will be downloaded to your Downloads folder.
Suggestions / Comments
a dot h dot machielsen at uu dot nl
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file cbs-hist-downloader-1.0.2.tar.gz
.
File metadata
- Download URL: cbs-hist-downloader-1.0.2.tar.gz
- Upload date:
- Size: 3.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 18fa066c937a8061d0f0777fdda3c5890a5d8d81577752572782cc07ea24cf37 |
|
MD5 | fb0b4c7421513b7f8f993087dc0bdcd4 |
|
BLAKE2b-256 | 89f2653a8a33c2264adf0af138335afb1d608e33e49671e7140d247049dd0a8c |
File details
Details for the file cbs_hist_downloader-1.0.2-py3-none-any.whl
.
File metadata
- Download URL: cbs_hist_downloader-1.0.2-py3-none-any.whl
- Upload date:
- Size: 4.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7006dc933ebd37baa6e1020d3c0b01130ff91dbd46a83e555d80395b2d2ff2a8 |
|
MD5 | 6c3ea9841957fa7fd6361341980247c3 |
|
BLAKE2b-256 | 602b8d4d83ff102029bc4247f16c71d726ad016a0f29ef3f87eaa6d6a39bd488 |