rjj (read-joint-jet) is a simple cmd-based data analysis/transforming wizard
Project description
rjj (read-joint-jet)
rjj is a simple cmd-based data transforming/analysis wizard ๐ป๐ โ
install it via pip/pip3
pip install rjj
update rjj
pip install rjj --upgrade
check current version
rjj -v
read user manual
rjj -h
data transformer
help you prepare your csv data file(s)
convertor
convert json to csv
rjj c
select a json file in the current directory, choose to enter another file name (don't need the extension) for output or not (Y/n); if not, the converted csv will be saved with the same name as the jsonโป
reversor
reverse csv back to json
rjj r
select a csv file in the current directory, choose to enter another file name for output or not (Y/n); if not, the converted json file will be saved with the same name; ๐support any data type, even emoji๐ท
converter
convert csv to json
rjj y
fix the reversor invisible special char assigned to 1st column name issue; but in return without emoji support
cluster
convert csv to json with cluster layer
rjj z
at least two columns in the selected csv file; column 2 should contain clustered data, i.e., a,b,c
eraser
erase duplicate record(s) ๐งฝ
rjj e
remove repeated row(s) for a selected csv file
join ๐
join all csv(s) together (repeated records accepted)
rjj join
pure join; will not create identifier(s); differ from jointer below
inner join ๐
inner join two csv files
rjj i
join record(s) based on identical value(s) in 1st column
outer join ๐
outer join two csv files
rjj o
join record(s) based on value(s) in 1st column; identical value(s) will be overlapped
detector
detect the co-existing record(s) between two csv files๐๐๐
rjj d
select two csv files to execute the detection process, then give a name for the output file; co-existing record(s) will be indicated in a newly created column Coexist
filter
locate the input Keyword
among all csv files in the current directory๐โ๐จ (and could opt to expand to its all sub-folder files; cool right?)๐
rjj f
source file (location info) will be indicated in a newly created first column Source_file
; the exact coordinate (x,y) will be given in the newly created second and third columns, namely Column_y
and Row_x
; and the full record will be pasted behind for simplifying your auditing work ๐
matcher
identify matched/repeated record(s)๐๐๐ among all csv in the current directory and extend to its sub-directories๐
rjj m
provide a name to the output file (if not, the output file will be named as output.csv); source file (location) will be indicated in a newly created column Source_file
uniquer
identify unique/non-repeated record(s)๐๐ among all csv in the current directory and extend to its sub-directories๐
rjj u
give a name to the output file; source location will be indicated in a newly created column Source_file
binder
bind all csv files together by columns in the current directory
rjj b
binder is different from jointer below; it can be considered as a horizontal merge whereas jointer is for vertical merge (basically by rows)
jointer and splitter ๐โ๏ธ
joint or split your data file(s)
jointer
joint all csv files in the current directory together๐
rjj j
all file names will be stored in the first field of the newly created column File
; when you execute the command you will be asked for assigning a name for the output file
splitter
split the selected csv file to different csv files and name it according to the value in the first field of that selected file๐
rjj s
extension for excel
extended function/module(s) for handling excel file(s)
xplit
split the selected excel (.xls or .xlsx) to pieces and name it according to the value in the first field of that selected excel
rjj x
joint
joint all excels (.xls and .xlsx) in the current directory together; all file names will be stored in the first field of the newly created column File
rjj t
differ from csv jointer, since both .xls and .xlsx is accepted, and the file extention will not be taken, it will be merged while two of them share the same file name (cannot be split by the command above); understand this condition, make good use of it! ๐
matxh
identify matched/repeated record(s)๐ in the current directory and extend to its sub-directories; for excel
rjj h
source file (location) will be indicated in a newly created column Source_file
; the exact sheet name inside that file will be told in another newly created column Sheet_name
uniquex
identify unique/non-repeated record(s)๐ in the current directory and extend to its sub-directories; for excel
rjj q
source will be indicated in a newly created column Source_file
; exact sheet will be told in Sheet_name
kilter
locate the input Keyword
among all excel files (.xls and .xlsx) in the current directory (and could expand to its sub-folders)๐โ๐จ
rjj k
this feature is similar to the csv filter; but since each excel file is possible to contain more than one sheet๐, the sheet number will be stored in the newly created column Sheet_z
, then the exact coordinate (x,y) will be given after it, namely Column_y
and Row_x
; and the full record will be pasted behind as well; super kooooo ๐ป
file/folder manager ๐๐
analyzor
run file analysis (process time depends on file size)
rjj a
return file statistics and a summary report; include sha256 hash, size, duplicate and uniqueness count, etc.
folder creator
create folder(s) according to the selected list (prepare a column storing all the folder name first)
rjj dir
statistical analysis ๐งฎ
simple statistical analysis is now available to perform on rjj ๐ป
one-sample z-test
compare group with norm (population mean and standard deviation known)
rjj oz
one-sample t-test
compare group with norm (population mean known)
rjj ot
paired-sample t-test
compare group across time/paired-feature
rjj pt
i.e., select first column as post-test data; second column as pre-test data
independent-sample t-test
compare two independent groups; able to opt to calculate it based on equal variance assumed or not assumed
rjj it
select first column as data of group 1; second column as data of group 2
levene test
run Levene test for two groups (centered by mean)
rjj lv
select first column as data of group 1; second column as data of group 2
homogeneity of variance
run Levene test for two or more groups
rjj hv
select first column as group variable; second column as data
one-way anova
compare two or more groups
rjj oa
select first column as group variable; second column as data
correlation analysis
calculate Pearson correlation coefficient (r)
rjj ca
explore the relationship between two variables
descriptive statistics
calculate it for a column
rjj n
calculate it by group(s)
rjj g
power analysis
priori (sample size estimation)
estimate sample size for paired-sample t-test
rjj pp
estimate sample size for independent-sample t-test
rjj pi
estimate sample size for one-way anova
rjj po
estimate sample size for correlation analysis
rjj pc
estimate sample size for regression analysis
rjj pr
post hoc (effect size evaluation)
evaluate effect size and power for one-sample t-test
rjj et
evaluate effect size and power for paired-sample t-test
rjj ep
evaluate effect size and power for independent-sample t-test
rjj ei
evaluate effect size and power for one-way anova
rjj eo
Tukey's Honest Significant Difference (HSD) implemented serves as post hoc multiple comparisons
regression analysis
make prediction(s) about the future ๐ฏ
rjj ra
evaluate the quality of predictor(s) as well as the intended model
regression model fit analysis
run model fit analysis of regression ๐
rjj fit
evaluate three commonly used models (by Rยฒ): linear, quadratic and cubic; see which one fits your data most
reliability test
run reliability analysis for a hypothetical construct; cheers! ๐ป
rjj rt
Cronbach alpha if item deleted is provided as well; make your item screening task easy
exploratory factor analysis
probably you cannot believe it works ๐ป
rjj efa
analyze the structure of interrelationship among a large set of items; summarize it into small set of factors
fixed factor exploratory analysis
run it with a fixed number of factor(s) โ๏ธ
rjj tea
remove unnecessary columns in your csv file; since it eats them up all this time ๐ป๐ฌ๐ฌ๐ฌ
confirmatory factor analysis
wow! finally it comes ๐๐ป
rjj cfa
draw svg factor diagram; you might need graphviz for rendering dot language to graph object (optional)
pizza analysis ๐
heat the kitchen up for a pie ๐ฅง
rjj pie
opt to draw a pie chart
plot a graph ๐
draw a scatter plot; awesome! ๐
plotter
rjj p
select first column as data for x-axis; second column as data for y-axis
scatter
draw a scatter plot with line connecting points
rjj pl
liner
draw a line graph ๐
rjj l
charter
draw a bar chart ๐
rjj bar
opt to assign label(s) ๐ป
boxplot
draw one boxplot ๐ฆ
rjj bx
boxplotter
draw many boxplot(s) ๐ฆ๐ฆ๐ฆ
rjj box
god knows
draw a map from god angle ๐ผ
rjj map
donut
bake me a donut ๐ฉ
rjj donut
output
generate csv and json output; beta test with columns: id, group, min (dynamic column names)
rjj output
report
generate pdf report(s); beta test with columns: id, name, group, min (dynamic column names)
rjj report
home
vist rjj's dirty house ๐ป๐ฐ
rjj home
๐ฉ home sweet home ~~~ 7look.org
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.