Client for the e6data distributed SQL Engine.

These details have not been verified by PyPI

Project links

Homepage

Project description

e6data Python Connector

version

Introduction

The e6data Connector for Python provides an interface for writing Python applications that can connect to e6data and perform operations.

To install the Python package, use the command below:

pip install e6data-python-connector

Prerequisites

Open Inbound Port 9000 in the Engine Cluster.
Limit access to Port 9000 according to your organizational security policy. Public access is not encouraged.
Generated Access Token in the e6data console.

Creating connection

Use your e6data email id as a username and access token as a password.

import e6xdb.e6x as edb

username = '<username>'  # Your e6data email id.
password = '<password>'  # Generated Access Token from e6data console.

host = '<host>'  # Host name or IP address of you cluster.
database = '<database>'  # Database name where you want to perform query.

port = 9000  # Engine port.

conn = edb.connect(
    host=host,
    port=port,
    username=username,
    database=database,
    password=password
)

Performing query

query = 'SELECT * FROM <TABLE_NAME>'  # Replace with the actual query.

cursor = conn.cursor()
query_id = cursor.execute(query)  # execute function returns query id, can be use for aborting the query.
all_records = cursor.fetchall()
for row in all_records:
   print(row)

To fetch all the records.

records = cursor.fetchall()

To fetch one record.

record = cursor.fetchone()

To fetch limited records.

limit = 500
records = cursor.fetchmany(limit)

To get execution plan after query execution.

import json

query_planner = json.loads(cursor.explain_analyse())

To abort running query.

query_id = '<query_id>'  # query id from execute function response.
cursor.cancel(query_id)

Switch database in existing connection.

database = '<new_database_name>'  # Replace with the new database.
cursor = conn.cursor(database)

Get Query Time Metrics

import json
query = 'SELECT * FROM <TABLE_NAME>'

cursor = conn.cursor()
query_id = cursor.execute(query)  # execute function returns query id, can be use for aborting th query.
all_records = cursor.fetchall()

query_planner = json.loads(cursor.explain_analyse())

execution_time = query_planner.get("total_query_time")  # In milliseconds
queue_time = query_planner.get("executionQueueingTime")  # In milliseconds
parsing_time = query_planner.get("parsingTime")  # In milliseconds
row_count = query_planner.get('row_count_out')

Get list of databases, tables or columns

The following code returns a dictionary of all databases, all tables and all columns connected to the cluster currently in use. This function can be used without passing database name to get list of all databases.

databases = conn.get_schema_names()  # To get list of databases.
print(databases)

database = '<database_name>'  # Replace with actual database name.
tables = conn.get_tables(database=database)  # To get list of tables from a database.
print(tables)

table_name = '<table_name>'  # Replace with actual table name.
columns = conn.get_tables(database=database, table=table_name)  # To get the list of columns from a table.
columns_with_type = list()
"""
Getting the column name and type.
"""
for column in columns:
   columns_with_type.append(dict(column_name=column.fieldName, column_type=column.fieldType))
print(columns_with_type)

Code Hygiene

It is recommended to clear the cursor, close the cursor and close the connection after running a function as a best practice. This enhances performance by clearing old data from memory.

cursor.clear() # Not needed when aborting a query
cursor.close()
conn.close()

Code Example

The following code is an example.

import e6xdb.e6x as edb
import json

username = '<username>'  # Your e6data email id.
password = '<password>'  # Generated Access Token from e6data console.

host = '<host>'  # Host name or IP address of you cluster.
database = '<database>'  # Database name where you want to perform query.

port = 9000  # Engine port.

sql_query = 'SELECT * FROM <TABLE_NAME>'  # Replace with the actual query.

conn = edb.connect(
    host=host,
    port=port,
    username=username,
    database=database,
    password=password
)

cursor = conn.cursor(db_name=database)
query_id = cursor.execute(sql_query)
all_records = cursor.fetchall()
planner_result = json.loads(cursor.explain_analyse())
execution_time = planner_result.get("total_query_time") / 1000  # Converting into seconds.
row_count = planner_result.get('row_count_out')
columns = [col[0] for col in cursor.description]  # Get the column names and merge with the records.
results = []
for row in all_records:
   row = dict(zip(columns, row))
   results.append(row)
   print(row)
print('Total row count {}, Execution Time (seconds): {}'.format(row_count, execution_time))
cursor.clear()
cursor.close()
conn.close()

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

2.2.1rc4 pre-release

Jul 30, 2024

2.2.1rc3 pre-release

Jul 30, 2024

2.2.1rc2 pre-release

Jul 30, 2024

2.2.1rc1 pre-release

Jul 27, 2024

2.2.0

May 31, 2024

2.2.0rc0 pre-release

Aug 29, 2024

2.1.21

May 10, 2024

2.1.21.dev0 pre-release

May 31, 2024

2.1.20

May 7, 2024

2.1.19

Apr 19, 2024

2.1.18

Apr 3, 2024

2.1.17

Apr 2, 2024

2.1.16

Apr 2, 2024

2.1.15

Mar 14, 2024

2.1.14

Mar 14, 2024

2.1.14.dev1 pre-release

Mar 14, 2024

2.1.14.dev0 pre-release

Mar 14, 2024

2.1.13

Mar 14, 2024

2.1.12

Mar 14, 2024

2.1.11

Mar 14, 2024

2.1.10

Feb 2, 2024

2.1.10.dev1 pre-release

Mar 13, 2024

2.1.10.dev0 pre-release

Mar 13, 2024

2.1.9

Feb 1, 2024

2.1.9.dev0 pre-release

Mar 13, 2024

2.1.8

Jan 30, 2024

2.1.7

Jan 8, 2024

2.1.6

Jan 8, 2024

2.1.6.dev3 pre-release

Jan 19, 2024

2.1.6.dev2 pre-release

Jan 18, 2024

2.1.6.dev1 pre-release

Jan 10, 2024

2.1.6.dev0 pre-release

Jan 9, 2024

2.1.5

Jan 7, 2024

2.0.5

Jan 3, 2024

2.0.4

Dec 26, 2023

2.0.3

Dec 12, 2023

2.0.2

Dec 12, 2023

2.0.1

Dec 11, 2023

2.0.0

Dec 7, 2023

1.1.11

Dec 2, 2023

1.1.10

Nov 21, 2023

1.1.9

Nov 6, 2023

1.1.8

Sep 21, 2023

1.1.7

Sep 18, 2023

1.1.6

Sep 18, 2023

1.1.5

Aug 18, 2023

1.1.4

Aug 17, 2023

1.1.3

Aug 9, 2023

1.1.2

Aug 9, 2023

1.1.1

Aug 9, 2023

1.1.0

Aug 9, 2023

1.0.10

Aug 3, 2023

1.0.9

Jun 7, 2023

1.0.8

Jun 3, 2023

1.0.7

May 30, 2023

1.0.6

May 22, 2023

1.0.5

May 22, 2023

1.0.4

May 14, 2023

1.0.3

Apr 6, 2023

1.0.2

Apr 3, 2023

This version

1.0.1

Apr 3, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

e6data-python-connector-1.0.1.tar.gz (32.1 kB view details)

Uploaded Apr 3, 2023 Source

Built Distribution

e6data_python_connector-1.0.1-py3-none-any.whl (35.4 kB view details)

Uploaded Apr 3, 2023 Python 3

File details

Details for the file e6data-python-connector-1.0.1.tar.gz.

File metadata

Download URL: e6data-python-connector-1.0.1.tar.gz
Upload date: Apr 3, 2023
Size: 32.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.0 CPython/3.9.10

File hashes

Hashes for e6data-python-connector-1.0.1.tar.gz
Algorithm	Hash digest
SHA256	`d8455292b7954c84d485ccdf5571dc9b3401b4100db820cf09b34a629421a288`
MD5	`7b24c19252fa9d59b4dc7e154b61729c`
BLAKE2b-256	`bfb0104cb5938258c929234f032d5ac99340a7815d5afa7df9375e53d4520b17`

See more details on using hashes here.

File details

Details for the file e6data_python_connector-1.0.1-py3-none-any.whl.

File metadata

Download URL: e6data_python_connector-1.0.1-py3-none-any.whl
Upload date: Apr 3, 2023
Size: 35.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.0 CPython/3.9.10

File hashes

Hashes for e6data_python_connector-1.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0fe4c9e0463aac46eada869a43af7e8201f095d7014b471b4051b1573712dc22`
MD5	`fcb4b9735f00da3ecf8761f3bcdda9c8`
BLAKE2b-256	`759db28ea4dc4889783e2b6a12392e35115e303de448c0145cae6746c7b528dd`