Skip to main content

Atgenomix SeqsLab Connector for Python

Project description

seqslab-connector

The SeqsLab Connector for Python based on pyhive allows you to create a Python DB API connection to Atgenomix SeqsLab interactive jobs (clusters) and develop Python-based workflow applications. It is a Hive-Thrift-based client with no dependencies on ODBC or JDBC. It also provides a SQLAlchemy dialect and an Apache Superset database engine spec for use with tools to execute DQL.

You are welcome to file an issue for general use cases. You can also contact Atgenomix Support here.

Requirements

Python 3.7 or above is required.

Installation

Install using pip.

pip install seqslab-connector

For Apache Superset integration install with

pip install seqslab-connector[superset]

Usage

DB-API

from seqslab import hive

cursor = hive.connect(database='run_name', http_path='job_run_id', username='user', password='pass', host='job_cluster_host')
cursor.execute('SHOW TABLES')
print(cursor.fetchall())
cursor.execute('SELECT * FROM my_workflow_table_name LIMIT 10')
print(cursor.fetchall())
cursor.close()

SQLAlchemy

from sqlalchemy.engine import create_engine

engine = create_engine('seqslab+hive://user:pass@job_cluster_host/run_name?http_path=job_run_id')

Apache Superset

Connecting to Databases

Documentation

For the latest documentation, see SeqsLab.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

seqslab-connector-0.9.5.tar.gz (9.9 kB view hashes)

Uploaded Source

Built Distribution

seqslab_connector-0.9.5-py2.py3-none-any.whl (11.5 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page