Skip to main content

A comprehensive data analysis and visualization toolkit.

Project description

DataNinja Package

Overview

DataNinja is an all-in-one data analysis toolkit designed to simplify data manipulation, statistical analysis, and visualization. It integrates popular libraries like pandas, numpy, matplotlib, and others into a single package, making it easier for data analysts and scientists to perform common data tasks.

Features

  • DataFrame Creation: Create DataFrames with ease.
  • Statistics Calculation: Compute basic statistics such as mean, median, variance, and standard deviation.
  • Data Profiling: Generate comprehensive data profile reports.
  • Missing Data Visualization: Visualize missing data patterns in your DataFrame.
  • Missing Value Handling: Handle missing values using different methods.
  • Visualization: Create various plots to analyze and visualize your data.

Installation

To install the DataNinja package, use pip:

pip install dataninja

Usage

  1. DataFrame Creation Create a DataFrame using the create_dataframe function:
import dataninja

data = {
    'A': [1, 2, None, 4],
    'B': [None, 2, 3, 4],
    'C': [1, None, None, 4]
}

df = dataninja.create_dataframe(data)
print(df)
  1. Statistics Calculation Calculate basic statistics using the calculate_statistics function:
# Assuming df is already created
stats = dataninja.calculate_statistics(df)
print(stats)
  1. Data Profiling Generate a data profile report using the generate_data_profile function. This can be rendered in Jupyter Notebooks:
# Assuming df is already created
report = dataninja.generate_data_profile(df)
report.to_notebook_iframe()  # For Jupyter Notebooks
  1. Missing Data Visualization Visualize missing data patterns with the plot_missing_data function:
# Assuming df is already created
dataninja.plot_missing_data(df)
  1. Missing Value Handling Fill missing values using different methods:
# Assuming df is already created
df_filled = dataninja.fill_missing_values(df, method='mean')
print(df_filled)

Example

Here’s a complete example of using the DataNinja package:

import dataninja

# Create a DataFrame
data = {
    'A': [1, 2, None, 4],
    'B': [None, 2, 3, 4],
    'C': [1, None, None, 4]
}
df = dataninja.create_dataframe(data)

# Calculate statistics
stats = dataninja.calculate_statistics(df)
print("Statistics:\n", stats)

# Generate data profile report
report = dataninja.generate_data_profile(df)
report.to_notebook_iframe()  # For Jupyter Notebooks

# Visualize missing data
dataninja.plot_missing_data(df)

# Fill missing values
df_filled = dataninja.fill_missing_values(df, method='mean')
print("Filled DataFrame:\n", df_filled)

Development

To contribute to the DataNinja package:

  1. Clone the Repository:
git clone https://github.com/ShelbyTO/DataNinja.git
  1. Navigate to the Project Directory:
cd DataNinja
  1. Install Dependencies:
pip install -r requirements.txt
  1. Run Tests:
pytest
  1. Make Your Changes and submit a pull request.

License

This package is licensed under the MIT License. See the LICENSE file for details.

Contact

For any issues or questions, please contact:

Author: Nicolas Prieur Email: pu-zle@live.fr

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

DataNinja-0.5.tar.gz (3.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

DataNinja-0.5-py3-none-any.whl (3.9 kB view details)

Uploaded Python 3

File details

Details for the file DataNinja-0.5.tar.gz.

File metadata

  • Download URL: DataNinja-0.5.tar.gz
  • Upload date:
  • Size: 3.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.7

File hashes

Hashes for DataNinja-0.5.tar.gz
Algorithm Hash digest
SHA256 4a0101270cf9942e503cac38eab84d2bf553a1b69bfa09a3ebef074f855ebe11
MD5 d76aa580c25d57ebd9b524f26d84ba3d
BLAKE2b-256 5983f309254a34e8644d9f362dbc7b5fb2d487956a287209f3a1772f9ec74a39

See more details on using hashes here.

File details

Details for the file DataNinja-0.5-py3-none-any.whl.

File metadata

  • Download URL: DataNinja-0.5-py3-none-any.whl
  • Upload date:
  • Size: 3.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.7

File hashes

Hashes for DataNinja-0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 228ab419262b56cd57c6bef17d58ae2091920a20cbf827d597e3f38d2a899d34
MD5 35f1e407e350c5098b7a7f40077373e0
BLAKE2b-256 1dfb3cda8a4608812b214142f66bd4de424d47a70ea86295cc96631768547f64

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page