Skip to main content

rjj (read-joint-jet) is a simple cmd-based data analysis/transforming wizard

Project description

rjj (read-joint-jet) Static Badge

rjj is a simple cmd-based data transforming/analysis wizard ๐Ÿ‘ป๐Ÿ› โš™

install it via pip/pip3

pip install rjj

update rjj

pip install rjj --upgrade

check current version

rjj -v

read user manual

rjj -h

data transformer

help you prepare your csv data file(s)

convertor

convert json to csv

rjj c

select a json file in the current directory, choose to enter another file name (don't need the extension) for output or not (Y/n); if not, the converted csv will be saved with the same name as the jsonโ™ป

reversor

reverse csv back to json

rjj r

select a csv file in the current directory, choose to enter another file name for output or not (Y/n); if not, the converted json file will be saved with the same name; ๐ŸŒ€support any data type, even emoji๐Ÿท

converter

convert csv to json

rjj y

fix the reversor invisible special char assigned to 1st column name issue; but in return without emoji support

cluster

convert csv to json with cluster layer

rjj z

at least two columns in the selected csv file; column 2 should contain clustered data, i.e., a,b,c

eraser

erase duplicate record(s) ๐Ÿงฝ

rjj e

remove repeated row(s) for a selected csv file

join ๐Ÿ–‡

join all csv(s) together (repeated records accepted)

rjj join

pure join; will not create identifier(s); differ from jointer below

inner join ๐Ÿ–‡

inner join two csv files

rjj i

join record(s) based on identical value(s) in 1st column

outer join ๐Ÿ–‡

outer join two csv files

rjj o

join record(s) based on value(s) in 1st column; identical value(s) will be overlapped

detector

detect the co-existing record(s) between two csv files๐Ÿ“ƒ๐Ÿ”๐Ÿ“ƒ

rjj d

select two csv files to execute the detection process, then give a name for the output file; co-existing record(s) will be indicated in a newly created column Coexist

filter

locate the input Keyword among all csv files in the current directory๐Ÿ‘โ€๐Ÿ—จ (and could opt to expand to its all sub-folder files; cool right?)๐Ÿ”

rjj f

source file (location info) will be indicated in a newly created first column Source_file; the exact coordinate (x,y) will be given in the newly created second and third columns, namely Column_y and Row_x; and the full record will be pasted behind for simplifying your auditing work ๐Ÿ“‘

matcher

identify matched/repeated record(s)๐Ÿ“ƒ๐Ÿ“ƒ๐Ÿ” among all csv in the current directory and extend to its sub-directories๐Ÿ“

rjj m

provide a name to the output file (if not, the output file will be named as output.csv); source file (location) will be indicated in a newly created column Source_file

uniquer

identify unique/non-repeated record(s)๐Ÿ”๐Ÿ“ƒ among all csv in the current directory and extend to its sub-directories๐Ÿ“

rjj u

give a name to the output file; source location will be indicated in a newly created column Source_file

binder

bind all csv files together by columns in the current directory

rjj b

binder is different from jointer below; it can be considered as a horizontal merge whereas jointer is for vertical merge (basically by rows)

jointer and splitter ๐Ÿ“Œโœ‚๏ธ

joint or split your data file(s)

jointer

joint all csv files in the current directory together๐Ÿ–‡

rjj j

all file names will be stored in the first field of the newly created column File; when you execute the command you will be asked for assigning a name for the output file

splitter

split the selected csv file to different csv files and name it according to the value in the first field of that selected file๐Ÿ“‘

rjj s

extension for excel

extended function/module(s) for handling excel file(s)

xplit

split the selected excel (.xls or .xlsx) to pieces and name it according to the value in the first field of that selected excel

rjj x

joint

joint all excels (.xls and .xlsx) in the current directory together; all file names will be stored in the first field of the newly created column File

rjj t

differ from csv jointer, since both .xls and .xlsx is accepted, and the file extention will not be taken, it will be merged while two of them share the same file name (cannot be split by the command above); understand this condition, make good use of it! ๐Ÿ™Œ

matxh

identify matched/repeated record(s)๐Ÿ” in the current directory and extend to its sub-directories; for excel

rjj h

source file (location) will be indicated in a newly created column Source_file; the exact sheet name inside that file will be told in another newly created column Sheet_name

uniquex

identify unique/non-repeated record(s)๐Ÿ” in the current directory and extend to its sub-directories; for excel

rjj q

source will be indicated in a newly created column Source_file; exact sheet will be told in Sheet_name

kilter

locate the input Keyword among all excel files (.xls and .xlsx) in the current directory (and could expand to its sub-folders)๐Ÿ‘โ€๐Ÿ—จ

rjj k

this feature is similar to the csv filter; but since each excel file is possible to contain more than one sheet๐Ÿ“„, the sheet number will be stored in the newly created column Sheet_z, then the exact coordinate (x,y) will be given after it, namely Column_y and Row_x; and the full record will be pasted behind as well; super kooooo ๐Ÿป

file/folder manager ๐Ÿ“‚๐Ÿ‘“

analyzor

run file analysis (process time depends on file size)

rjj a

return file statistics and a summary report; include sha256 hash, size, duplicate and uniqueness count, etc.

folder creator

create folder(s) according to the selected list (prepare a column storing all the folder name first)

rjj dir

statistical analysis ๐Ÿงฎ

simple statistical analysis is now available to perform on rjj ๐Ÿป

one-sample z-test

compare group with norm (population mean and standard deviation known)

rjj oz

one-sample t-test

compare group with norm (population mean known)

rjj ot

paired-sample t-test

compare group across time/paired-feature

rjj pt

i.e., select first column as post-test data; second column as pre-test data

independent-sample t-test

compare two independent groups; able to opt to calculate it based on equal variance assumed or not assumed

rjj it

select first column as data of group 1; second column as data of group 2

levene test

run Levene test for two groups (centered by mean)

rjj lv

select first column as data of group 1; second column as data of group 2

homogeneity of variance

run Levene test for two or more groups

rjj hv

select first column as group variable; second column as data

one-way anova

compare two or more groups

rjj oa

select first column as group variable; second column as data

correlation analysis

calculate Pearson correlation coefficient (r)

rjj ca

explore the relationship between two variables

descriptive statistics

calculate it for a column

rjj n

calculate it by group(s)

rjj g

power analysis

priori (sample size estimation)

estimate sample size for paired-sample t-test

rjj pp

estimate sample size for independent-sample t-test

rjj pi

estimate sample size for one-way anova

rjj po

estimate sample size for correlation analysis

rjj pc

estimate sample size for regression analysis

rjj pr

post hoc (effect size evaluation)

evaluate effect size and power for one-sample t-test

rjj et

evaluate effect size and power for paired-sample t-test

rjj ep

evaluate effect size and power for independent-sample t-test

rjj ei

evaluate effect size and power for one-way anova

rjj eo

Tukey's Honest Significant Difference (HSD) implemented serves as post hoc multiple comparisons

regression analysis

make prediction(s) about the future ๐ŸŽฏ

rjj ra

evaluate the quality of predictor(s) as well as the intended model

regression model fit analysis

run model fit analysis of regression ๐Ÿ“

rjj fit

evaluate three commonly used models (by Rยฒ): linear, quadratic and cubic; see which one fits your data most

reliability test

run reliability analysis for a hypothetical construct; cheers! ๐Ÿป

rjj rt

Cronbach alpha if item deleted is provided as well; make your item screening task easy

exploratory factor analysis

probably you cannot believe it works ๐Ÿ‘ป

rjj efa

analyze the structure of interrelationship among a large set of items; summarize it into small set of factors

fixed factor exploratory analysis

run it with a fixed number of factor(s) โ˜•๏ธ

rjj tea

remove unnecessary columns in your csv file; since it eats them up all this time ๐Ÿ‘ป๐Ÿฌ๐Ÿฌ๐Ÿฌ

confirmatory factor analysis

wow! finally it comes ๐ŸŽƒ๐Ÿ‘ป

rjj cfa

draw svg factor diagram; you might need graphviz for rendering dot language to graph object (optional)

pizza analysis ๐Ÿ•

heat the kitchen up for a pie ๐Ÿฅง

rjj pie

opt to draw a pie chart

plot a graph ๐Ÿ“ˆ

draw a scatter plot; awesome! ๐Ÿ™Œ

plotter

rjj p

select first column as data for x-axis; second column as data for y-axis

scatter

draw a scatter plot with line connecting points

rjj pl

liner

draw a line graph ๐Ÿ“‰

rjj l

charter

draw a bar chart ๐Ÿ“Š

rjj bar

opt to assign label(s) ๐Ÿป

boxplot

draw one boxplot ๐Ÿ“ฆ

rjj bx

boxplotter

draw many boxplot(s) ๐Ÿ“ฆ๐Ÿ“ฆ๐Ÿ“ฆ

rjj box

god knows

draw a map from god angle ๐Ÿ‘ผ

rjj map

donut

bake me a donut ๐Ÿฉ

rjj donut

output

generate csv and json output; beta test with columns: id, group, min (dynamic column names)

rjj output

report

generate pdf report(s); beta test with columns: id, name, group, min (dynamic column names)

rjj report

coder

encode and/or decode

rjj code

home

vist rjj's dirty house ๐Ÿ‘ป๐Ÿฐ

rjj home

๐Ÿฉ home sweet home ~~~ 7look.org

Project details


Release history Release notifications | RSS feed

This version

0.8.5

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rjj-0.8.5.tar.gz (34.8 kB view hashes)

Uploaded Source

Built Distribution

rjj-0.8.5-py2.py3-none-any.whl (31.9 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page