Skip to main content

Provides methods and classes that can be used when interacting with the ArchivesSpace API.

Project description

ArchivesSpace Python Client

PyPI version

The aspace-client Python package provides web client functionality that targets the API of ArchivesSpace v2.X and up. This package was developed to aid ongoing and future ArchivesSpace migrations.

About

The aspace module extends the functionality of the Session class from the requests Python library, and attempts to provide access to all of ArchivesSpace's REST API endpoints, while also preserving backwards compatibility and supporting development tools such as Pylint. ArchivesSpace currently (May 2019) has over 250 API endpoints. Supporting and maintaining all of those endpoints is quite an undertaking, so new features and bug-fixes are prioritized based on their value to performing ongoing ArchivesSpace migrations.

Installation

This project has a listing on PyPI!

pip install aspace-client

You can also install this project directly from the GitHub repository:

pip install https://github.com/AtlasSystems/ArchivesSpace-Python-Client/zipball/master

Developer Installation

Below are instructions for installing this package in "editable" mode. This will allow you to make changes to the package and test them in real time.

AS_CLIENT_DIR="/path/to/aspace-client"
git clone https://github.com/AtlasSystems/ArchivesSpace-Python-Client.git "$AS_CLIENT_DIR"

# In your python project directory, or in your venv
pip install -e "$AS_CLIENT_DIR"

Usage

from aspace.client import ASpaceClient

client = ASpaceClient('http://localhost:8089', 'admin', 'admin')

This will initialize a client that targets the instance of your ArchivesSpace API. This initialization will also automatically authenticate your user. There is an option to turn this off, if you're using this package as part of a larger script, where interacting with ArchivesSpace is not the primary component, and you're in an environment where it is not 100% certain that ArchivesSpace is running and accessible. For that operation:

from time import sleep
from aspace.client import ASpaceClient

client = ASpaceClient(
    base_url='http://localhost:8089', # Base url for connecting to your ASpace's API.
    username='admin',
    password='admin',
    auto_authenticate=False,
)

while client.get('/version').status_code != 200:
    print('ArchivesSpace API is not up')
    sleep(2)

client.authenticate()

There is also built-in support for the operation above, as well as built-in functionality for pulling settings from an instance of ConfigParser.

In settings.ini:

[aspace_credentials]
api_host = 'http://aspace.cloudapp.eastus.azure.com'
username = 'automation-user'
password = 'automation-user-password'

In app.py:

import configparser
import aspace

config = configparser.ConfigParser()
config.load('settings.ini')

client = (
    aspace.client.ASpaceClient
    .init_from_config(config)
    .wait_until_ready(
        check_interval=2.0,
        max_wait_time=200.0,
        authenticate_on_success=True,
    )
)

Failed authentications raise an error, so if any of these scripts are still running, you're ready to query the API! This package interacts with the ArchivesSpace API using the following considerations.

  1. the syntax described by the requests Python library that we all love and
  2. the API endpoint structure described by the docs for the ArchivesSpace API

The typical syntax of the requests Python library is preserved, so all HTTP methods (POST, GET, DELETE, etc.) typically start with a URI or an endpoint, relative to the base URL of the API. The URI is never assumed to make sure that all operations are predictable, and that all of the functionality of the API is utilized correctly.

Get ArchivesSpace System Info

# Get the system info
print(client.get('/').json())

Manage Repositories

# Get a listing of all repositories
repositories = client.get('/repositories').json()
for repository in repositories:
    print(repository)

# Create a new repository
new_repo = {}
new_repo['repo_code'] = 'test_repo'
new_repo['name'] = 'Test Repository'
response = client.post('/repositories', json=new_repo).json()

# Update the name of that repository
repo = client.get(response['uri']).json()
repo['name'] = 'Andy Samberg University Archives - Test Repository'
client.post(repo['uri'], json=repo)

# Delete the repository
client.delete(new_repo['uri'])

This syntax can be used to interact with all of ArchivesSpace's endpoints, as long as the response comes back as JSON. Most do. There are also some extensions to ArchivesSpace's API functionality that are currently provided.

Streaming Records

# Cleanup your resource records one at a time, no matter how many you have
for resource in client.streams.resources():
    if resource['title'].endswith('!'):
        print('Cleaning Resource:', resource['uri'], resource['title'])
        resource['title'] = resource['title'].rstrip('!')
        update_result = client.post(resource['uri'], json=resource).json()
        print(update_result)

# Works for accessions and agents
client.streams.accessions()
client.streams.people()
client.streams.corporate_entities()
client.streams.families()
client.streams.software()
client.streams.all_agents()


# Works for endpoints that do not have an explicitly defined stream method
client.streams.records('container_profiles'):
    pass

# Works for endpoints that do not have an explicitly defined stream method
# and require a repository reference in the URI.
for assessment in client.streams.repository_relative_records('assessments'):
    pass

# Optional limits can be placed on record streams, so that only 1 repository
# is considered, as opposed to streaming all records from all repositories,
# which is default.
assessments_stream = client.streams.repository_relative_records(
    'assessments',
    repository_uris=['/repositories/2']
)

for assessment in assessments_stream:
    pass

Contributing

If you have any suggestions or bug reports please feel free to report them in the issues tab or email us at devgineers@atlas-sys.com. Feel free to email us if you are new to Git, but would still like to contribute.

Pull requests are welcome, but they will subject to a review process. Consistent code style is a goal for this project, as it currently attempts to follow the coding standards layed out in the PEP8 Python style guide. Please keep this in mind when submitting or requesting contributions, but also keep in mind that PEP is a flexible standard and that we are willing to make exceptions.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aspace-client-2.5.0.tar.gz (25.1 kB view details)

Uploaded Source

Built Distribution

aspace_client-2.5.0-py3-none-any.whl (27.0 kB view details)

Uploaded Python 3

File details

Details for the file aspace-client-2.5.0.tar.gz.

File metadata

  • Download URL: aspace-client-2.5.0.tar.gz
  • Upload date:
  • Size: 25.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.38.0 CPython/3.8.0

File hashes

Hashes for aspace-client-2.5.0.tar.gz
Algorithm Hash digest
SHA256 7a87769e0577beb431637e832956f39c15e3336628476b6905313b37d8af83ef
MD5 eab3b7e02fb3bb965e4581fca88b96fa
BLAKE2b-256 e186c4136ad1af049e7b9efd910b382842f6a5c67e5b403c79724695698a5029

See more details on using hashes here.

File details

Details for the file aspace_client-2.5.0-py3-none-any.whl.

File metadata

  • Download URL: aspace_client-2.5.0-py3-none-any.whl
  • Upload date:
  • Size: 27.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.38.0 CPython/3.8.0

File hashes

Hashes for aspace_client-2.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 30110abbd400ade5ecb77e8656e2abf323cf140a045916916a573667a0131eb7
MD5 fce0193caee8fd9ee9fe586ba6a876a6
BLAKE2b-256 44e939eb9e863d139835f5c6680e6fad66cf7e63080ad75dc323a7aa88f73a73

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page