No project description provided

Project description

README

Parse and slice hadoop logs

Yarn RM

alt

Dataset

from khadoop.yarn import logrm

Parse all files that look like a regular Ressource Manager log with default name.

logrm.FILEPATTERN is a unix-like pattern file to help glob them.

parsed = []
for filelog in LOGFOLDER.glob(logrm.FILEPATTERN):
    print(filelog)
    parsed += logrm.process(filelog.open())

logrm.process will parse each line and produce a list of dict with sensible information

each dict look like :

 {
   'accepted_to_running': 6,  # nb sec between ACCEPT to RUNNING
   'id_application': 'application_1596547077642_6854',
   'accept_to_running_ts':'2020-08-06 14:59:59,119' # timestamp set for log line 'FROM accepted to RUNNING'
   }

the accepted_to_running represent here the number between these two timestamps on yarn aggregated RM log:

2020-08-06 14:59:52,756 INFO  rmapp.RMAppImpl (RMAppImpl.java:handle(779)) - application_1596547077642_6854 State change from SUBMITTED to ACCEPTED
...
2020-08-06 14:59:59,119 INFO  rmapp.RMAppImpl (RMAppImpl.java:handle(779)) - application_1596547077642_6854 State change from ACCEPTED to RUNNING

https://github.com/etsy/logster

Project details

Release history Release notifications | RSS feed

1.4.0

Jan 12, 2021

1.3.5

Nov 24, 2020

1.3.4

Nov 24, 2020

1.3.3

Nov 20, 2020

This version

1.3.2

Nov 20, 2020

1.3.1

Nov 20, 2020

1.3.0

Sep 20, 2020

1.2.1

Aug 21, 2020

1.2.0

Aug 18, 2020

1.1.0

Aug 17, 2020

1.0.0

Aug 13, 2020

0.1.1

May 27, 2020

0.1.0

May 26, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

khadoop-1.3.2.tar.gz (6.6 kB view hashes)

Uploaded Nov 20, 2020 Source

Built Distribution

khadoop-1.3.2-py3-none-any.whl (7.7 kB view hashes)

Uploaded Nov 20, 2020 Python 3

Hashes for khadoop-1.3.2.tar.gz

Hashes for khadoop-1.3.2.tar.gz
Algorithm	Hash digest
SHA256	`4dcfb423519e56c88fddde2d008f37218a8a8685ad1a5aa667954dee3b2e29f5`
MD5	`6699155979020e4eff7f8b857a1ff7c9`
BLAKE2b-256	`90a85a1fb76a6bac818f3fb7aec9944e334acb1058be97a554408ec09f36e3db`

Hashes for khadoop-1.3.2-py3-none-any.whl

Hashes for khadoop-1.3.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7d34d839acc535082f37cd148900b5d823a97373288e3c6d09cdb4a1a97de9e2`
MD5	`0efca565b971e3bdf6c1881ca8ddabf8`
BLAKE2b-256	`12313031f31c5aa638d50b042ed0feea7b16484b60d575b205037d383c076f40`