Skip to main content

A comprehensive data analysis and visualization toolkit.

Project description

DataNinja Package

![Logo](image/DATA (3).png)

Overview

DataNinja is an all-in-one data analysis toolkit designed to simplify data manipulation, statistical analysis, and visualization. It integrates popular libraries like pandas, numpy, matplotlib, and others into a single package, making it easier for data analysts and scientists to perform common data tasks.

Features

  • DataFrame Creation: Create DataFrames with ease.
  • Statistics Calculation: Compute basic statistics such as mean, median, variance, and standard deviation.
  • Data Profiling: Generate comprehensive data profile reports.
  • Missing Data Visualization: Visualize missing data patterns in your DataFrame.
  • Missing Value Handling: Handle missing values using different methods.
  • Visualization: Create various plots to analyze and visualize your data.

Installation

To install the DataNinja package, use pip:

pip install dataninja

Usage

  1. DataFrame Creation Create a DataFrame using the create_dataframe function:
import dataninja

data = {
    'A': [1, 2, None, 4],
    'B': [None, 2, 3, 4],
    'C': [1, None, None, 4]
}

df = dataninja.create_dataframe(data)
print(df)
  1. Statistics Calculation Calculate basic statistics using the calculate_statistics function:
# Assuming df is already created
stats = dataninja.calculate_statistics(df)
print(stats)
  1. Data Profiling Generate a data profile report using the generate_data_profile function. This can be rendered in Jupyter Notebooks:
# Assuming df is already created
report = dataninja.generate_data_profile(df)
report.to_notebook_iframe()  # For Jupyter Notebooks
  1. Missing Data Visualization Visualize missing data patterns with the plot_missing_data function:
# Assuming df is already created
dataninja.plot_missing_data(df)
  1. Missing Value Handling Fill missing values using different methods:
# Assuming df is already created
df_filled = dataninja.fill_missing_values(df, method='mean')
print(df_filled)

Example

Here’s a complete example of using the DataNinja package:

import dataninja

# Create a DataFrame
data = {
    'A': [1, 2, None, 4],
    'B': [None, 2, 3, 4],
    'C': [1, None, None, 4]
}
df = dataninja.create_dataframe(data)

# Calculate statistics
stats = dataninja.calculate_statistics(df)
print("Statistics:\n", stats)

# Generate data profile report
report = dataninja.generate_data_profile(df)
report.to_notebook_iframe()  # For Jupyter Notebooks

# Visualize missing data
dataninja.plot_missing_data(df)

# Fill missing values
df_filled = dataninja.fill_missing_values(df, method='mean')
print("Filled DataFrame:\n", df_filled)

Development

To contribute to the DataNinja package:

  1. Clone the Repository:
git clone https://github.com/ShelbyTO/DataNinja.git
  1. Navigate to the Project Directory:
cd DataNinja
  1. Install Dependencies:
pip install -r requirements.txt
  1. Run Tests:
pytest
  1. Make Your Changes and submit a pull request.

License

This package is licensed under the MIT License. See the LICENSE file for details.

Contact

For any issues or questions, please contact:

Author: Nicolas Prieur Email: pu-zle@live.fr

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

DataNinja-0.4.tar.gz (3.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

DataNinja-0.4-py3-none-any.whl (3.9 kB view details)

Uploaded Python 3

File details

Details for the file DataNinja-0.4.tar.gz.

File metadata

  • Download URL: DataNinja-0.4.tar.gz
  • Upload date:
  • Size: 3.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.7

File hashes

Hashes for DataNinja-0.4.tar.gz
Algorithm Hash digest
SHA256 621ebe075bd46fc7732c1fa67a21bbad9d385dfeb97963ae7935c81251be13fd
MD5 8d1a08804a8654240bfbd43911fd7b65
BLAKE2b-256 15ddd3042a2d533e3a1f794ff147071162229fc605217ac3de690441f51677bf

See more details on using hashes here.

File details

Details for the file DataNinja-0.4-py3-none-any.whl.

File metadata

  • Download URL: DataNinja-0.4-py3-none-any.whl
  • Upload date:
  • Size: 3.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.7

File hashes

Hashes for DataNinja-0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 6c3892a9c758d980ad19ad3d3d34e912b03f396ff91a92ab9052db7e5c01d649
MD5 3ae88db6d3ca962c27145ed0dbc4b7cc
BLAKE2b-256 acc78b1640073693588426e815c00410614c310c9de42d6fa562aa60e295fe10

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page