xutil

Extra Useful Utilities.

These details have not been verified by PyPI

Project links

Intended Audience
Operating System
- MacOS
- Unix
Programming Language
- Python :: 3
Topic
- Utilities

Project description

xutil

This is a Python package containing all the utility functions and libraries that are commonly used.

Install

pip install xutil
pip install xutil[jdbc] # for JDBC connectivity. Requires JPype1.
pip install xutil[web] # for web scraping. Requires Twisted.
pip install xutil[hive] # for Hive connectivity. Requires SASL libraries.

Windows

If you face the message ‘error: Microsoft Visual C++ 14.0 is required. Get it with “Microsoft Visual C++ Build Tools”’, you can quickly install the build tools with chocolatey (https://chocolatey.org/)

choco install -y VisualCppBuildTools

CLI

Available commands:

xutil-alias          # add useful alias commands, see xutil/alias.sh
xutil-create-profile # creates ~/profile.yaml from template.
exec-etl --help      # Execute various ETL operations.
exec-sql --help      # Execute SQL from command line
ipy                  # launch ipython with pre-defined modules/functions imported
ipy-spark --help     # launch ipython Spark with pre-defined modules/functions imported
pykill pattern       # will swiftly kill any process with the command string mathing pattern

Databases

Why not use SQLAlchemy (SA)? http://docs.sqlalchemy.org/en/latest/faq/performance.html#i-m-inserting-400-000-rows-with-the-orm-and-it-s-really-slow

It has been demontrated the SA is not performant when it comes to speedy ETL.

SQL Server

Installation

Make sure ODBC is installed.

brew install unixodbc
apt-get install unixodbc

Then, install the drivers

https://docs.microsoft.com/en-us/sql/connect/odbc/linux-mac/installing-the-microsoft-odbc-driver-for-sql-server?view=sql-server-2017

odbcinst -j

Oracle

Install Oracle Client:

brew tap InstantClientTap/instantclient
brew install instantclient-basic

Installing with conda:

conda install oracle-instantclient -y

Spark SQL

It is the user’s responsibility to properly set up the SPARK_HOME environment and configurations. This library uses pyspark and will default to the SPARK_HOME settings.

Useful config.py

https://github.com/apache/incubator-airflow/blob/master/setup.py

https://github.com/dask/dask/blob/master/setup.py

https://github.com/tartley/colorama/blob/master/setup.py

Dev

pip install -e /path/to/xutil

Testing

python setup.py test

Release

Update version in setup.py.
Draft new release on Github: https://github.com/flarco/xutil/releases/new

git clone https://github.com/flarco/xutil.git
cd xutil
m2r --overwrite README.md
python setup.py sdist && twine upload --skip-existing dist/*

TODO

Revamp database.base methods:

get_conn
DBConn
  __init__
  _set_variables
  _do_execute
  _split_schema_table
  _concat_fields
  _template

  connect
  check_pk
  execute -- straight SA.connection.execute, return "fields, rows"
  query -- use the SQLAlachy and replaces self.select, fields = conn._fields"
  stream
  insert
  drop_table
  create_table
  get_cursor_fields -> _get_cursor_fields
  get_schemas
  get_objects
  get_tables
  get_views
  get_columns
  get_primary_keys
  get_indexes
  get_ddl
  get_all_columns
  get_all_tables
  analyze_fields
  analyze_tables
  analyze_join_match

  remove:
    get_cursor: no need for get_cursor with SA
    execute_multi
    select: use `query` instead, which uses `execute`

Project details

These details have not been verified by PyPI

Project links

Intended Audience
Operating System
- MacOS
- Unix
Programming Language
- Python :: 3
Topic
- Utilities

Release history Release notifications | RSS feed

0.2.3

Jun 15, 2021

0.2.2

Jun 15, 2021

0.2.1

Feb 4, 2020

0.2.0

Jan 22, 2020

0.1.9

Oct 15, 2019

0.1.8

Oct 15, 2019

0.1.7

Oct 15, 2019

0.1.6

Oct 15, 2019

0.1.5

Oct 15, 2019

0.1.4

Oct 15, 2019

0.1.3

Oct 11, 2019

0.1.2

Jul 14, 2019

0.1.1

May 14, 2019

0.1.0

May 5, 2019

This version

0.0.9

May 2, 2019

0.0.8

May 2, 2019

0.0.7

Apr 29, 2019

0.0.6

Mar 24, 2019

0.0.5

Mar 19, 2019

0.0.4

Mar 19, 2019

0.0.3

Mar 18, 2019

0.0.2

Mar 17, 2019

0.0.1

Mar 17, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xutil-0.0.9.tar.gz (12.8 MB view hashes)

Uploaded May 2, 2019 Source

Hashes for xutil-0.0.9.tar.gz

Hashes for xutil-0.0.9.tar.gz
Algorithm	Hash digest
SHA256	`fbd5b7f367a676a9d033b0bed6dc196134e8e1d2c5d9b725d21666ede0f1b76e`
MD5	`561fa64d8024c2440a4d7b7a4530a142`
BLAKE2b-256	`eb9bc0cad1d9fa9f18bb70a54bbd3f90a0cf24aba9a02f701941f0304ecb8b0b`