Skip to main content

A package containing Python utilities for use across NYPL

Project description

PythonUtils

This package contains common Python utility classes and functions.

Classes

  • Pushing records to Kinesis
  • Setting and retrieving a resource in S3
  • Decrypting values with KMS
  • Encoding and decoding records using a given Avro schema
  • Retrieving secrets from AWS Secrets Manager
  • Downloading files from a remote SSH SFTP server
  • Connecting to and querying a MySQL database
  • Connecting to and querying a PostgreSQL database
  • Connecting to and querying Redshift
  • Making requests to the Oauth2 authenticated APIs such as NYPL Platform API and Sierra
  • Interacting with vendor APIs such as cloudLibrary

Functions

  • Reading a YAML config file and putting the contents in os.environ -- see config/sample.yaml for an example of how the config file should be formatted
  • Creating a logger in the appropriate format
  • Obfuscating a value using bcrypt
  • Parsing/building Research Catalog identifiers
  • Mapping between barcodes and Sierra patron ids plus getting patron data from Sierra and Redshift using those ids or record_nums

Usage

# test_file.py
from nypl_py_utils.classes.kinesis_client import KinesisClient
from nypl_py_utils.functions.config_helper import load_env_file

load_env_file(...)
kinesis_client = KinesisClient(...)
# requirements.txt

# Do not use any version below 1.0.0
# All available optional dependencies can be found in pyproject.toml.
# See the "Managing dependencies" section below for more details.
nypl-py-utils[kinesis-client,config-helper]==1.x.y

Developing locally

In order to use the local version of the package instead of the global version, use a virtual environment. To set up a virtual environment and install all the necessary dependencies, run:

python3 -m venv .venv
source .venv/bin/activate
pip install --upgrade pip
pip install .
pip install '.[development]'
deactivate && source .venv/bin/activate

Managing dependencies

In order to prevent dependency bloat, this package has no required dependencies. Instead, each class and helper file has its own optional dependency set. For instance, if an app needs to use the KMS client and the obfuscation helper, it should add nypl-py-utils[kms-client, obfuscation-helper] to the app's requirements. This way, only the required dependencies are installed.

When a new client or helper file is created, a new optional dependency set should be added to pyproject.toml. The development dependency set, which includes all the dependencies required by all of the classes and tests, should also be updated.

The optional dependency sets also give the developer the option to manually list out the dependencies of the clients rather than relying upon what the package thinks is required, which can be beneficial in certain circumstances. For instance, AWS lambda functions come with boto3 and botocore pre-installed, so it's not necessary to include these (rather hefty) dependencies in the lambda deployment package.

Troubleshooting

Using PostgreSQLClient in an AWS Lambda

Because psycopg requires a statically linked version of the libpq library, the PostgreSQLClient cannot be installed as-is in an AWS Lambda function. Instead, it must be packaged as follows:

pip install --target ./package nypl-py-utils[postgresql-client]==1.x.y

pip install \
    --platform manylinux2014_x86_64 \
    --target=./package \
    --implementation cp \
    --python 3.9 \
    --only-binary=:all: --upgrade \
    'psycopg[binary]'

Using PostgreSQLClient locally

If using the PostgreSQLClient produces the following error locally:

ImportError: no pq wrapper available.
Attempts made:
- couldn't import psycopg 'c' implementation: No module named 'psycopg_c'
- couldn't import psycopg 'binary' implementation: No module named 'psycopg_binary'
- couldn't import psycopg 'python' implementation: dlsym(0x7f8620446f40, PQsslInUse): symbol not found

then try running:

pip uninstall psycopg
pip install "psycopg[c]"

Git workflow

This repo uses the Main-QA-Production git workflow.

main has the latest and greatest commits, qa has what's in our QA environment, and production has what's in our production environment.

Ideal Workflow

  • Cut a feature branch off of main
  • Commit changes to your feature branch
  • File a pull request against main and assign a reviewer (who must be an owner)
    • Include relevant updates to pyproject.toml and README
      • If you're planning to cut a release, remember to update project version in pyproject.toml!
    • In order for the PR to be accepted, it must pass all unit tests, have no lint issues, and update the CHANGELOG (or contain the Skip-Changelog label in GitHub)
  • After the PR is accepted, merge into main
  • Merge main > qa
  • Deploy app to QA on GitHub and confirm it works
  • Merge qa > production
  • Deploy app to production on GitHub and confirm it works

Deployment

The utils repo is deployed as a PyPI package here and as a Test PyPI package for QA purposes here. In order to be deployed, the version listed in pyproject.toml must be updated. To deploy to Test PyPI, create a new release in GitHub and tag it qa-vX.X.X. The GitHub Actions deploy-qa workflow will then build and publish the package. To deploy to production PyPI, create a release and tag it production-vX.X.X.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nypl_py_utils-1.6.4.tar.gz (34.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nypl_py_utils-1.6.4-py3-none-any.whl (27.1 kB view details)

Uploaded Python 3

File details

Details for the file nypl_py_utils-1.6.4.tar.gz.

File metadata

  • Download URL: nypl_py_utils-1.6.4.tar.gz
  • Upload date:
  • Size: 34.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for nypl_py_utils-1.6.4.tar.gz
Algorithm Hash digest
SHA256 e51d100746929d9be490ee02fe8441848544a0a9d34dee72de70011ba3d38082
MD5 9d83b2d5226cf6b9394cacc0ba9d1b40
BLAKE2b-256 93ba969cc380e2d40861f8b7fed708629ee55783b29f3e4cf1551d3ec5563774

See more details on using hashes here.

Provenance

The following attestation bundles were made for nypl_py_utils-1.6.4.tar.gz:

Publisher: deploy-production.yml on NYPL/python-utils

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file nypl_py_utils-1.6.4-py3-none-any.whl.

File metadata

  • Download URL: nypl_py_utils-1.6.4-py3-none-any.whl
  • Upload date:
  • Size: 27.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for nypl_py_utils-1.6.4-py3-none-any.whl
Algorithm Hash digest
SHA256 f101661f7da6428e62be3da3283c3a1d833c7569c2847f5113c57f531abd3136
MD5 93b8f1162d052c5cf16ec51ca9c46716
BLAKE2b-256 624a10bf13e9daecec70c1279d1c6fdc4cfc002884c6a592e9aa6aa0aaa1a715

See more details on using hashes here.

Provenance

The following attestation bundles were made for nypl_py_utils-1.6.4-py3-none-any.whl:

Publisher: deploy-production.yml on NYPL/python-utils

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page