ssh-jump-hive

ssh_jump_hive is a tools could jump the jump machine to connect hive get hive data to pandas dataframe

These details have not been verified by PyPI

Project links

Homepage

Project description

DSTL
====

https://github.com/mullerhai/sshjumphive

Note: this repo is not supported. License is MIT.

..

image:: ssh_jump_hive.jpg
.. image:: https://github.com/mullerhai/sshjumphive/blob/master/img/logo.jpeg

.. contents::

Install [sorry Mircosoft Windows System cannot use it]
------------

: pip install -U ssh-jump-hive [Now Version is 0.3.5]

- python Version >= 3.5
- sasl>=0.2.1
- thrift>=0.11.0
- thrift-sasl>=0.3.0
- paramiko>=2.4.1
- selectors>=0.0.14

Use in Unix System Terminal[centos macos ubuntu]
------------

: jumps
- default param
: parameter:
- @click.option('-jh', '--jumphost', default="***", help='Jump Gateway Server host 跳板机ssh 主机名, 默认117.48.195.186')
- @click.option('-jp', '--jumpport', default=2222, help='Jump Gateway Server port跳板机ssh登录端口号, 默认2222')
- @click.option('-ju', '--jumpuser', default='dm', help='Jump Gateway Server login user 跳板机 ssh登录用户名')
- @click.option('-jpd', '--jumppwd', default="***", help='Jump Gateway Server login user password 跳板机登录用户密码')
- @click.option('-th', '--tunnelhost', default='172.16.16.32', help='ssh-tunnel 隧道 host ')
- @click.option('-tp', '--tunnelappport', default=10000, help='ssh-tunnel Application port隧道目标程序的端口号默认为 hive 10000 ')
- @click.option('-lh', '--localhost', default='127.0.0.1', help='localhost本机 host ,默认127.0.0.1 ')
- @click.option('-lp', '--localbindport', default="4230", help='localbindport 本机被绑定的端口号')
- @click.option('-dt', '--daemonsecond', default="21600", help='ssh_tunnel_daemon_session_hold_on_second six hours, ssh 隧道后台线程保持时间默认为六小时')

.. image:: https://github.com/mullerhai/sshjumphive/blob/master/img/runshell.jpeg

Use in Unix System Terminal Run GUI[centos macos ubuntu]
------------
: jumpgui
- you will see the GUI like this
.. image:: https://github.com/mullerhai/sshjumphive/blob/master/img/rungui.jpg

If you Buy the SSH_Tunnel for mac [maybe feel Expensive]
------------

.. image:: https://github.com/mullerhai/sshjumphive/blob/master/img/SSH_Tunnel_mac.jpg

Object types
------------

Note that ssh_jump_hive is an tools can jump the jump machine to connect hive get hive data to pandas dataframe:

- 0: hive_client for simple connect hive server with no jump server
- 1: Jump_Tunnel just for connect hive server with jump server separete
- 2: SSH_Tunnel for get ssh tunnel channel

General approach
----------------

if you want to use it ,you need to know some things
for example these parameters [ jumphost,jumpport,jumpuser,jumppwd,tunnelhost,tunnelAPPport,localhost,localbindport]
for hive server you also need to know params [localhost, hiveusername, hivepassword, localbindport,database, auth]
for query hive data you need to know params [ table, query_fileds_list, partions_param_dict, query_limit]

if your hive server has jump server separete， you need do like this
[
::
from ssh_jump_hive import Jump_Tunnel_HIVE
import pandas as pd
## get hive_tunnel_client_session
def gethive():
jumphost = '117.*****.176'
jumpport = 2222
jumpuser = 'dm'
jumppwd = '&&&&&&'
tunnelhost = '172.**.16.32'
tunnelhiveport = 10000
localhost = '127.0.0.1'
localbindport = 4800
username = 'muller'
auth = 'LDAP'
password = "abc123."
database = 'fkdb'
table = 'tab_client_label'
partions_param_dict = {'client_nmbr': 'AA75', 'batch': 'p1'}
query_fileds_list = ['gid', 'realname', 'card']
querylimit = 1000
jump = Jump_Tunnel_HIVE(jumphost, jumpport, jumpuser, jumppwd, tunnelhost, tunnelhiveport, localhost, localbindport,
username, password)
return jump

## query some fileds by table name and partitions params
def demo1():
table = 'tab_client_label'
partions_param_dict = {'client_nmbr': 'AA75', 'batch': 'p1'}
query_fileds_list = ['gid', 'realname', 'card']
querylimit = 1000
jump=gethive()
df2=jump.get_JumpTunnel_df(table,partions_param_dict,query_fileds_list,querylimit)
return df2
## query all fileds by table name and partitions params
def demo2():
table = 'tab_client_label'
partions_param_dict = {'client_nmbr': 'AA75', 'batch': 'p1'}
jump =gethive()
df2 = jump.get_JumpTunnel_table_partitions_df(table,partions_param_dict,1000)
return df2
## use hsql to query data
def demo3():
jump = gethive()
hsql="select * from fkdb.tab_client_label where client_nmbr= 'AA75' and batch= 'p1' limit 500"
df2=jump.get_JumpTunnel_hsql_df(hsql)
return df2
## initail the instance to query
df3=demo2()
print(df3.shape)
print(df3.columns)
print(df3.head(100))
]

UNet network with batch-normalization added, training with Adam optimizer with
a loss that is a sum of 0.1 cross-entropy and 0.9 dice loss.
Input for UNet was a 116 by 116 pixel patch, output was 64 by 64 pixels,
so there were 16 additional pixels on each side that just provided context for
the prediction.
Batch size was 128, learning rate was set to 0.0001
(but loss was multiplied by the batch size).
Learning rate was divided by 5 on the 25-th epoch
and then again by 5 on the 50-th epoch,
most models were trained for 70-100 epochs.
Patches that formed a batch were selected completely randomly across all images.
During one epoch, network saw patches that covered about one half
of the whole training set area. Best results for individual classes
were achieved when training on related classes, for example buildings
and structures, roads and tracks, two kinds of vehicles.

Augmentations included small rotations for some classes
(±10-25 degrees for houses, structures and both vehicle classes),
full rotations and vertical/horizontal flips
for other classes. Small amount of dropout (0.1) was used in some cases.
Alignment between channels was fixed with the help of
``cv2.findTransformECC``, and lower-resolution layers were upscaled to
match RGB size. In most cases, 12 channels were used (RGB, P, M),
while in some cases just RGB and P or all 20 channels made results
slightly better.

Validation
----------

Validation was very hard, especially for both water and both vehicle
classes. In most cases, validation was performed on 5 images
(6140_3_1, 6110_1_2, 6160_2_1, 6170_0_4, 6100_2_2), while other 20 were used
for training. Re-training the model with the same parameters on all 25 images
improved LB score.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.3.7

Apr 27, 2018

0.3.6

Apr 27, 2018

0.3.5

Apr 27, 2018

0.3.4

Apr 27, 2018

0.3.3

Apr 26, 2018

0.3.2

Apr 26, 2018

0.3.1

Apr 25, 2018

0.3.0

Apr 25, 2018

0.2.8

Apr 24, 2018

0.2.7

Apr 24, 2018

0.2.6

Apr 24, 2018

0.2.5

Apr 24, 2018

0.2.4

Apr 24, 2018

0.2.3

Apr 24, 2018

0.2.0

Apr 24, 2018

0.1.9

Apr 24, 2018

0.1.8

Apr 24, 2018

0.1.7

Apr 24, 2018

0.1.6

Apr 24, 2018

0.1.5

Apr 24, 2018

0.1.4

Apr 23, 2018

0.1.3

Apr 23, 2018

0.1.2

Apr 23, 2018

0.1.1

Apr 23, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ssh-jump_hive-0.3.7.tar.gz (874.8 kB view details)

Uploaded Apr 27, 2018 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ssh_jump_hive-0.3.7-py2.py3-none-any.whl (19.2 kB view details)

Uploaded Apr 27, 2018 Python 2Python 3

File details

Details for the file ssh-jump_hive-0.3.7.tar.gz.

File metadata

Download URL: ssh-jump_hive-0.3.7.tar.gz
Upload date: Apr 27, 2018
Size: 874.8 kB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for ssh-jump_hive-0.3.7.tar.gz
Algorithm	Hash digest
SHA256	`8372e0737c22f8afaa9d698654a2fc3ecde6f0be6641c81a1639f1703e1e1f85`
MD5	`c8db7236c18f37aae1631c018fc26803`
BLAKE2b-256	`7673cfc29552daf2e0f9524fdbc6a3ef5d8ae97c6b5d43218cb4f259c1914318`

See more details on using hashes here.

File details

Details for the file ssh_jump_hive-0.3.7-py2.py3-none-any.whl.

File metadata

Download URL: ssh_jump_hive-0.3.7-py2.py3-none-any.whl
Upload date: Apr 27, 2018
Size: 19.2 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No

File hashes

Hashes for ssh_jump_hive-0.3.7-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`f4704f6f10b4716b68cbbf6be0f6a07512c7db16501cff54c892c15a73c0095d`
MD5	`bc5e4180b347ca27c24f2bade3632c84`
BLAKE2b-256	`aadb581f87bd4f24a765e81e8d5057052a7c9d48166a489e8984c394b2c2fca3`

See more details on using hashes here.

ssh-jump-hive 0.3.7

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes