Skip to main content

A database builder for digital preservation information.

Project description

Preservation Status Database Builder

Returns the preservation status of a Crossref DOI matched against mainstream digital preservation platforms.

license activity Code style: black

Django Git GitHub Linux Python

This application allows you to build a database of digital preservation sources and then to match a DOI against common digital preservation systems.

Installation

The easiest install is via pip:

pip install preservation-database

Then add "preservationdatabase" (no hyphen) to your list of INSTALLED_APPS.

Usage

export DJANGO_SETTINGS_MODULE=import_settings.settings

Usage: python -m preservationdatabase.cli [OPTIONS] COMMAND [ARGS]...

Options:
  --help  Show this message and exit.

Commands:
    clear-cache                    Clear the import cache
    import-all                     Download and import all data (excluding...
    import-cariniana               Download and import data from Cariniana
    import-clockss                 Download and import data from CLOCKSS
    import-hathi                   Import data from Hathi (requires local...
    import-internet-archive        Import data from Internet Archive...
    import-internet-archive-items  Import item data from Internet Archive
    import-issnl                   Import ISSN-L mappings
    import-lockss                  Download and import data from LOCKSS
    import-ocul                    Import data from Ocul (requires local...
    import-pkp                     Download and import data from PKP's...
    import-portico                 Download and import data from Portico
    random-samples                 Return random samples that occur in and...
    show-archives                  Clear the import cache
    show-cache                     Show last fill date/times and cache status
    show-issn                      Show preservation items that match an ISSN
    show-preservation              Determine whether a DOI is preserved
    stamp-cache-today              Mark the latest imports as today

Features

  • Cariniana import.
  • CLOCKSS import.
  • HathiTrust import.
  • Internet Archive import.
  • Internet Archive item-level import.
  • LOCKSS import.
  • PKP PLN import.
  • Portico import.
  • Crossref DOI lookup.

First-Run Setup

First, copy example_settings.py to settings.py and check settings.py to ensure that the database you want to use is set correctly. The default is db.sqlite. You should carefully read and check all of settings.py.

DATABASES = {
    'default': {
        'ENGINE': 'django.db.backends.sqlite3',
        'NAME': BASE_DIR / 'db.sqlite3',
    }
}

Next, run the database build commands:

python3 manage.py makemigrations
python3 manage.py makemigrations preservation-database
python3 manage.py migrate 

You should then have a working database into which you can import new preservation data.

Archive Notes

Internet Archive

The Internet Archive gives a KBART file for the Keepers Registry that we use as a primary ingest source: https://archive.org/details/ia-keepers-registry-kbart. However, this source is not the total coverage of the Internet Archive. However, sadly, the Internet Archive snapshots do not contain external identifiers and the container-level snapshots do not present coverage extent. While it is possible to download the entire 217GB FATCAT database snapshot, this will not be viable for many users. We have therefore stuck with the KBART file that Keepers uses. Extent of coverage in the Internet Archive may, therefore, be under-reported.

Credits

© Crossref 2023

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

preservation-database-0.0.74.tar.gz (26.8 kB view details)

Uploaded Source

Built Distribution

preservation_database-0.0.74-py3-none-any.whl (29.5 kB view details)

Uploaded Python 3

File details

Details for the file preservation-database-0.0.74.tar.gz.

File metadata

File hashes

Hashes for preservation-database-0.0.74.tar.gz
Algorithm Hash digest
SHA256 d3d26685d96d06c6f8f7f6ed504ed421a15205ca19c0865761ae72cd060b08e4
MD5 a4a94933f15d7b933a6378578146329c
BLAKE2b-256 958c816e6c73d107a231e66f3dc4a614708b56d56b64c05381816c4a83967fb5

See more details on using hashes here.

File details

Details for the file preservation_database-0.0.74-py3-none-any.whl.

File metadata

File hashes

Hashes for preservation_database-0.0.74-py3-none-any.whl
Algorithm Hash digest
SHA256 bb30907d3d1d00b60042e1df21bcbb7b8e3406773147f2255f273b73a3533483
MD5 a88f0675560faaafecd5f3dac67a8a32
BLAKE2b-256 71c829eca982a51046d4737ea99325b99f3b4bcdf5603a05875cc2b9ee7cf97c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page