Skip to main content

A Data science library for data science / data analysis teams

Project description

dataramp

Code style: black

dataramp is a Python library designed to assist data science and data analysis teams in their workflow. It provides various utility functions and tools to streamline common data science tasks.

Features

dataramp offers the following key features:

  1. Project Management: Simplifies the creation of standard data science project structures. With a single function call, you can generate a well-organized project directory with predefined folders for datasets, processed data, raw data, outputs, models, scripts, notebooks, and more.

  2. Model Saving and Loading: Provides easy-to-use functions for saving and loading trained machine learning models. It supports various formats such as joblib, pickle, and keras, enabling seamless integration with different model types.

  3. Data Exploration and Visualization: Includes functions for data exploration, summary statistics, and visualization. Quickly generate feature vi plots and visualize missing data to gain insights into your datasets.

  4. Feature Engineering: Methods for handling missing data and noise in your datasets. Offers functions for dropping missing columns based on a specified threshold and detecting outliers using Tukey's Interquartile Range (IQR) method.

  5. Model Evaluation and Cross-Validation: Provides tools to evaluate model performance, including functions to calculate accuracy, F1-score, precision, recall, and generate classification reports. Also supports cross-validation for model evaluation.

  6. Scaling and Normalization: Offers functions for min-max scaling and z-score normalization of data to bring features to a common scale.

Quickstart

To use dataramp in your data science projects, you can install it via pip:

pip install dataramp

Once installed, you can import the library and explore its functionality:

import dataramp as dh  # import the dataramp library

df = pd.read_csv("data/iris.csv")  # load iris dataset

df.head()

cats = dh.eda.get_cat_vars(df)
print(cats)

num_var = dh.eda.get_num_vars(df)
print(num_var)

cat_count = dh.eda.get_cat_counts(df)
cat_count

missing = dh.eda.display_missing(df)
missing

Lins

Project: https://github.com/kimxons/dataramp PyPi: https://pypi.org/project/dataramp/

Documentation

For detailed usage instructions and API reference, please refer to the official documentation at https://dataramp-docs.example.com

We use SemVer for versioning

Contribution

dataramp is an open-source project, and we welcome contributions from the data science community. If you find a bug, have a feature request, or want to contribute improvements, please open an issue or submit a pull request on our GitHub repository at https://github.com/kimxons/dataramp.

License

dataramp is licensed under the MIT License. See the LICENSE file for more details.

Contact

If you have any questions or feedback, feel free to reach out to our support team at dev.kitonga@gmail.com or join our community forum at https://community.dataramp.com. We are here to assist you in making your data science journey smooth and successful!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dataramp-1.0.1.dev169.tar.gz (13.0 kB view details)

Uploaded Source

Built Distribution

dataramp-1.0.1.dev169-py2.py3-none-any.whl (12.3 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file dataramp-1.0.1.dev169.tar.gz.

File metadata

  • Download URL: dataramp-1.0.1.dev169.tar.gz
  • Upload date:
  • Size: 13.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.6

File hashes

Hashes for dataramp-1.0.1.dev169.tar.gz
Algorithm Hash digest
SHA256 9b6f1d2bc146b96e70daf8dc2fe062286f526e6926055a4436237e3f35843324
MD5 8fac8126081cba5f729742723a1c9662
BLAKE2b-256 ff5d63ab37eaf1957064e3698abca4b9288b982dc560a0811fdb921bbc7ebc37

See more details on using hashes here.

File details

Details for the file dataramp-1.0.1.dev169-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for dataramp-1.0.1.dev169-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 d575cc7b916a3d7a34b45f01caf31cbf9af26839ac90fd23104ef1a1b2d9caea
MD5 832768e6ef846ebef063dadf3b6be279
BLAKE2b-256 b5ca5cb621bb3182564cc5ac2355558882ff0532f8866fd1e41f4a2b6756ada1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page