Skip to main content

Useful crystal-ball-related stuff.

Project description

If you have hundreds of csvs you need to quickly digest and understand, you can use crystal-ball to help with the onboarding and data exploration phase of your project. You will be able to immediately dive into an expansive data set without getting lost.

CrystalBall Features

  • find specific columns and tables you may be interested in, but may have otherwise not known where to look.

  • walk through connections between different csvs,

  • compare and establish foreign key and primary key relationships by using simple boxplots

  • dynamically create a master table of useful information while simultaneously recording your step-by-step process for future reference.

Installation and Usage

pip install crystal-ball</programlisting> You can start using CrystalBall right away by importing it and initializing it with a relative directory containing the CSVs.

import crystalball as cb

ball = cb.run("insert relative directory here")

Methods

Note that all methods that involve searching via keynames are case sensitive.

cb.contains(keywords: list, all_colnames: list=None) → list

Check if keywords exist in all_colnames.

cb.featureSearch(keywords: list, all_colnames: list=None) → list

Find the columns that contain the keywords.

cb.tableSearch(keywords: list, csvname_to_colnames_list=None, mode: str=UNION) → list

Find the tables that contain the keywords.

cb.openTable(rel_dir: str, indices: list=[0]) → DataFrame

Open the csv that is referenced by the given relative directory.

cb.subTable(supertable: DataFrame, chosen_index: list, chosen_columns: list) → DataFrame

Create a subtable from a supertable.

cb.mergeTables(tables: list) → DataFrame

Sequentially merge a list of tables that all share a common index.

cb.analyzeRelationships(to_analyze: list, visualize: bool=True) → DataFrame

Analyze basic stats of one or more different Series.

compareRelationship(to_analyze1: Series, to_analyze2: Series, visualize: bool=False) → DataFrame

Compare and contrast the difference between two Series.

cb.export(df_to_export: DataFrame, write_to: str, export_type: str=CSV) → None

Exports contents of dataframe to relative location specified by write_to arg.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

crystal_ball-0.1.9-py3-none-any.whl (8.0 kB view details)

Uploaded Python 3

File details

Details for the file crystal_ball-0.1.9-py3-none-any.whl.

File metadata

  • Download URL: crystal_ball-0.1.9-py3-none-any.whl
  • Upload date:
  • Size: 8.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.6.4

File hashes

Hashes for crystal_ball-0.1.9-py3-none-any.whl
Algorithm Hash digest
SHA256 049b0975bb0b7c3cdf4cc8f5e8dac803c87f0c7f040873174d41e24faad2e33d
MD5 f3689948c2a88a9b0e56130a93e0db35
BLAKE2b-256 fb746cd0d0e2e50d0f44807e3922147ca78ad0c34c068b4995bc71f48bc28ae0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page