Python client for refget

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

Project description

Refget

Run pytests

The refget package provides a Python interface to both remote and local use of the refget protocol.

This package provides clients and functions for both refget sequences and refget sequence collections (seqcol).

Documentation is hosted at refgenie.org/refget.

Testing

Local unit tests of refget package

pytest to test refget package, local unit tests

Compliance testing

Under /test_api are compliance tests for a service implementing the sequence collections API. This will test your collection and comparison endpoints to make sure the comparison function is working.

pytest test_api to tests API compliance
pytest test_api --api_root http://127.0.0.1:8100 to customize the API root URL to test

Load the fasta files from the test_fasta folder into your API database.
Run pytest test_api --api_root <API_URL>, pointing to your URL to test

For example, this will test my remote server instance:

pytest test_api --api_root https://seqcolapi.databio.org

Loading up data into an instance

Starting a demo instance

Use docker to create a local postgres database like this:

source deployment/local_demo/local_demo.env 
docker run --rm --name refget-postgres -p 127.0.0.1:5432:5432 \
  -e POSTGRES_PASSWORD \
  -e POSTGRES_USER \
  -e POSTGRES_DB \
  -e POSTGRES_HOST \
  postgres:16.3

Loading files

If you need to load, then you have to install either gc_count (fast) or pyfaidx (slow).

You can load them like:

python load_demo_data.py

Or:

refget add-fasta path/to/fasta.fa

For pangenome:

python load_pangenome_reference.py ../seqcolapi/analysis/data/demo.csv test_fasta

Adding data to production service

Just first source the production credentials, then load the data:

source deployment/seqcolapi.databio.org/production.env
python load_demo_data.py

seqcolapi

This repository contains:

Sequence collections API software (the seqcolapi package). This package is based on the refget package. It simply provides an wrapper to implement the Sequence Collections API.
Configuration and GitHub Actions for demo server instance (deployment subfolder).

Instructions

Run locally for development

First, configure env vars:

To run a local server with a local database:source deployment/localhost/dev_local.env
To run a local server with the production database:source deployment/seqcolapi.databio.org/production.env

source deployment/local_demo/local_demo.env

Then, run service:

uvicorn seqcolapi.main:app --reload --port 8100

Running with docker

To build the docker file, from the root of this repository:

First you build the general-purpose image

docker build -f deployment/dockerhub/Dockerfile -t databio/seqcolapi seqcolapi

Next you build the wrapped image (this just wraps the config into the app):

docker build -f deployment/seqcolapi.databio.org/Dockerfile -t seqcolapi.databio.org deployment/seqcolapi.databio.org

To run in a container:

source deployment/seqcolapi.databio.org/production.env
docker run --rm -p 8000:80 --name seqcolapi \
  --env "POSTGRES_USER" \
  --env "POSTGRES_DB" \
  --env "POSTGRES_PASSWORD" \
  --env "POSTGRES_HOST" \
  seqcolapi.databio.org

Alternative: Mount the config

Instead of building a bundle with the config, you could just mount it into the base image:

docker run --rm -p 8000:8000 --name sccon \
  --env "POSTGRES_PASSWORD" \
  --volume $CODE/seqcolapi.databio.org/config/seqcolapi.yaml:/config.yaml \
  seqcolapi

Deploying container to dockerhub

Use github action in this repo which deploys on release, or through manual dispatch.

To load new data into seqcolapi.databio.org

See instructions in seqcolapi repo.

cd analysis
source ../servers/localhost/dev_local.env
ipython3

Now run load_fasta.py

Deploy to AWS ECS

Test locally first, using 1. native test; 2. local docker test.

Deploying

To upgrade the software:

Use config file located in /servers/seqcolapi.databio.org. This will use the image in docker.io://databio/seqcolapi, github repo: refgenie/seqcolapi as base, bundle it with the above config, and deploy to the shefflab ECS.

Ensure the refget package master branch is as you want it.
Deploy the updated secqolapi app to dockerhub (using manual dispatch, or deploy on github release).
Finally, deploy the instance with manual dispatch using the included GitHub action.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

Release history Release notifications | RSS feed

This version

0.5.0

Jul 6, 2024

0.4.0

Jun 26, 2024

0.3.0

Feb 23, 2024

0.2.0

Feb 14, 2024

0.1.0

Jun 17, 2021

0.0.1

Jun 25, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

refget-0.5.0.tar.gz (26.1 kB view hashes)

Uploaded Jul 6, 2024 Source

Built Distribution

refget-0.5.0-py3-none-any.whl (26.2 kB view hashes)

Uploaded Jul 6, 2024 Python 3

Hashes for refget-0.5.0.tar.gz

Hashes for refget-0.5.0.tar.gz
Algorithm	Hash digest
SHA256	`82b9490e1fb04d84cd8bb00ceed22b371ca57c3ac53fd25c057c5bb18e40521b`
MD5	`ddb009402f40a4e94073c95654addfb7`
BLAKE2b-256	`6d497d9439f246ea8db1e222ca2979dfb008fd3cbedb177ec48c100ba8b9d7fd`

Hashes for refget-0.5.0-py3-none-any.whl

Hashes for refget-0.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`758da506566c91847195d88f43ee46a44c586fc6bd16fd9eefc6c242163adf8b`
MD5	`05cb53b753b7ae00de00ce11004dda9c`
BLAKE2b-256	`f4e1e169b604319a30c6e05305f298d07a4fcf0133ea496de3f0c056845c1d79`