A package containing Python utilities for use across NYPL
Project description
PythonUtils
This package contains common Python utility classes and functions.
Classes
- Pushing records to Kinesis
- Setting and retrieving a resource in S3
- Decrypting values with KMS
- Encoding and decoding records using a given Avro schema
- Connecting to and querying a MySQL database
- Connecting to and querying a PostgreSQL database
- Connecting to and querying a PostgreSQL database using a connection pool
- Connecting to and querying Redshift
- Making requests to the Oauth2 authenticated APIs such as NYPL Platform API and Sierra
Functions
- Reading a YAML config file and putting the contents in os.environ
- Creating a logger in the appropriate format
- Obfuscating a value using bcrypt
- Parsing/building Research Catalog identifiers
Usage
# test_file.py
from nypl_py_utils.classes.kinesis_client import KinesisClient
from nypl_py_utils.functions.config_helper import load_env_file
load_env_file(...)
kinesis_client = KinesisClient(...)
# requirements.txt
# Do not use any version below 1.0.0
# All available optional dependencies can be found in pyproject.toml.
# See the "Managing dependencies" section below for more details.
nypl-py-utils[kinesis-client,config-helper]==1.0.4
Developing locally
In order to use the local version of the package instead of the global version, use a virtual environment. To set up a virtual environment and install all the necessary dependencies, run:
python3 -m venv testenv
source testenv/bin/activate
pip install --upgrade pip
pip install .
pip install '.[development]'
deactivate && source testenv/bin/activate
Managing dependencies
In order to prevent dependency bloat, this package has no required dependencies. Instead, each class and helper file has its own optional dependency set. For instance, if an app needs to use the KMS client and the obfuscation helper, it should add nypl-py-utils[kms-client, obfuscation-helper]
to the app's requirements. This way, only the required dependencies are installed.
When a new client or helper file is created, a new optional dependency set should be added to pyproject.toml
. The development
dependency set, which includes all the dependencies required by all of the classes and tests, should also be updated.
The optional dependency sets also give the developer the option to manually list out the dependencies of the clients rather than relying upon what the package thinks is required, which can be beneficial in certain circumstances. For instance, AWS lambda functions come with boto3
and botocore
pre-installed, so it's not necessary to include these (rather hefty) dependencies in the lambda deployment package.
Troubleshooting
Using PostgreSQLClient in an AWS Lambda
Because psycopg
requires a statically linked version of the libpq
library, the PostgreSQLClient
cannot be installed as-is in an AWS Lambda function. Instead, it must be packaged as follows:
pip install --target ./package nypl-py-utils[postgresql-client]==1.0.1
pip install \
--platform manylinux2014_x86_64 \
--target=./package \
--implementation cp \
--python 3.9 \
--only-binary=:all: --upgrade \
'psycopg[binary]'
Using PostgreSQLClient locally
If using the PostgreSQLClient
produces the following error locally:
ImportError: no pq wrapper available.
Attempts made:
- couldn't import psycopg 'c' implementation: No module named 'psycopg_c'
- couldn't import psycopg 'binary' implementation: No module named 'psycopg_binary'
- couldn't import psycopg 'python' implementation: dlsym(0x7f8620446f40, PQsslInUse): symbol not found
then try running:
pip uninstall psycopg
pip install "psycopg[c]"
Git workflow
This repo uses the Main-QA-Production git workflow.
main
has the latest and greatest commits, qa
has what's in our QA environment, and production
has what's in our production environment.
Ideal Workflow
- Cut a feature branch off of
main
- Commit changes to your feature branch
- File a pull request against
main
and assign a reviewer (who must be an owner)- Include relevant updates to pyproject.toml and README
- In order for the PR to be accepted, it must pass all unit tests, have no lint issues, and update the CHANGELOG (or contain the
Skip-Changelog
label in GitHub)
- After the PR is accepted, merge into
main
- Merge
main
>qa
- Deploy app to QA on GitHub and confirm it works
- Merge
qa
>production
- Deploy app to production on GitHub and confirm it works
Deployment
The utils repo is deployed as a PyPI package here and as a Test PyPI package for QA purposes here. In order to be deployed, the version listed in pyproject.toml
must be updated. To deploy to Test PyPI, create a new release in GitHub and tag it qa-vX.X.X
. The GitHub Actions deploy-qa workflow will then build and publish the package. To deploy to production PyPI, create a release and tag it production-vX.X.X
.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for nypl_py_utils-1.0.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a3c7dafdfc058f8f48b3f3e0f839d1c94fdb4c87a6572f728f58577e56ac1288 |
|
MD5 | 112890be9a3eb272594ace8fbd2d3bd3 |
|
BLAKE2b-256 | 2c6f5c659d9ad88bd8a1351c0f782b7fff55345fee64fb24f907c1602e9b79f5 |