Skip to main content

Data Exploration Terser

Project description

dexter

data exploration terser

What is dexter?

dexter is a lightweight Python package built on top of numpy and pandas that allows fast data exploration for multiple structured table files in a folder. It's a high-level tool suitable for a first contact with a dataset composed of multiple dataframes.

Features

  • Importing multiple table files with readm_csv()
  • Saving DataFrames and Names with the FrameMap class
  • Applying pandas methods to multiple DataFrames at once

Installing

Not available for installing yet, but available for download and import at: https://github.com/igormagalhaesr/dexter

Usage

Importing dexter:

import dexter as dxt

Reading multiple dataframes in a folder:

dataframes = dxt.readm_csv(./folder/)

Displaying results:

dataframes.display()

Out:

col1 col2 col3
0 value1 $3
1 value2 $5
2 value3 $7
first second third
0 one 17%
1 two 19%
2 three 23%

Names and Frames

names = dataframes.names
frames = dataframes.frames

Multiple Dataframes Types

dataframes.dtypes()

Multiple Missing Values

dataframes.multiple_missing()

Multiple Descriptions

dataframes.describe()

For more concrete examples, check the notebook

Contributing

  1. Fork it (https://github.com/igormagalhaesr/dexter)
  2. Create your feature branch (git checkout -b feature/fooBar)
  3. Commit your changes (git commit -am 'Add some fooBar')
  4. Push to the branch (git push origin feature/fooBar)
  5. Create a new Pull Request

Dependencies

License

Distributed under the BSD 3 license. See LICENSE.txt for more information.

Contact

Igor Magalhaes – @igormagalhaesrigormagalhaesr@gmail.com

github.com/igormagalhaesr

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dxter-0.0.1.tar.gz (7.2 kB view hashes)

Uploaded Source

Built Distribution

dxter-0.0.1-py3-none-any.whl (7.9 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page