Skip to main content

A comprehensive data analysis and visualization toolkit.

Project description

DataNinja Package

Overview

DataNinja is an all-in-one data analysis toolkit designed to simplify data manipulation, statistical analysis, and visualization. It integrates popular libraries like pandas, numpy, matplotlib, and others into a single package, making it easier for data analysts and scientists to perform common data tasks.

Features

  • DataFrame Creation: Create DataFrames with ease.
  • Statistics Calculation: Compute basic statistics such as mean, median, variance, and standard deviation.
  • Data Profiling: Generate comprehensive data profile reports.
  • Missing Data Visualization: Visualize missing data patterns in your DataFrame.
  • Missing Value Handling: Handle missing values using different methods.
  • Visualization: Create various plots to analyze and visualize your data.

Installation

To install the DataNinja package, use pip:

pip install dataninja

Usage

  1. DataFrame Creation Create a DataFrame using the create_dataframe function:
import dataninja

data = {
    'A': [1, 2, None, 4],
    'B': [None, 2, 3, 4],
    'C': [1, None, None, 4]
}

df = dataninja.create_dataframe(data)
print(df)
  1. Statistics Calculation Calculate basic statistics using the calculate_statistics function:
# Assuming df is already created
stats = dataninja.calculate_statistics(df)
print(stats)
  1. Data Profiling Generate a data profile report using the generate_data_profile function. This can be rendered in Jupyter Notebooks:
# Assuming df is already created
report = dataninja.generate_data_profile(df)
report.to_notebook_iframe()  # For Jupyter Notebooks
  1. Missing Data Visualization Visualize missing data patterns with the plot_missing_data function:
# Assuming df is already created
dataninja.plot_missing_data(df)
  1. Missing Value Handling Fill missing values using different methods:
# Assuming df is already created
df_filled = dataninja.fill_missing_values(df, method='mean')
print(df_filled)

Example

Here’s a complete example of using the DataNinja package:

import dataninja

# Create a DataFrame
data = {
    'A': [1, 2, None, 4],
    'B': [None, 2, 3, 4],
    'C': [1, None, None, 4]
}
df = dataninja.create_dataframe(data)

# Calculate statistics
stats = dataninja.calculate_statistics(df)
print("Statistics:\n", stats)

# Generate data profile report
report = dataninja.generate_data_profile(df)
report.to_notebook_iframe()  # For Jupyter Notebooks

# Visualize missing data
dataninja.plot_missing_data(df)

# Fill missing values
df_filled = dataninja.fill_missing_values(df, method='mean')
print("Filled DataFrame:\n", df_filled)

Development

To contribute to the DataNinja package:

  1. Clone the Repository:
git clone https://github.com/YourUsername/DataNinja.git
  1. Navigate to the Project Directory:
cd DataNinja
  1. Install Dependencies:
pip install -r requirements.txt
  1. Run Tests:
pytest
  1. Make Your Changes and submit a pull request.

License

This package is licensed under the MIT License. See the LICENSE file for details.

Contact

For any issues or questions, please contact:

Author: Nicolas Prieur Email: pu-zle@live.fr

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

DataNinja-0.3.tar.gz (3.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

DataNinja-0.3-py3-none-any.whl (3.9 kB view details)

Uploaded Python 3

File details

Details for the file DataNinja-0.3.tar.gz.

File metadata

  • Download URL: DataNinja-0.3.tar.gz
  • Upload date:
  • Size: 3.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.7

File hashes

Hashes for DataNinja-0.3.tar.gz
Algorithm Hash digest
SHA256 c8b94198d604d1df87680b08b522e34b725f4d9f23ba3486def64a6b416f9c17
MD5 e0fc0187dae95da5a13f79d0fdb1a1f1
BLAKE2b-256 01ce1ca8a8c460b6b6702831ad845136c147afb401a6cb3e617b0ada7e6a930d

See more details on using hashes here.

File details

Details for the file DataNinja-0.3-py3-none-any.whl.

File metadata

  • Download URL: DataNinja-0.3-py3-none-any.whl
  • Upload date:
  • Size: 3.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.7

File hashes

Hashes for DataNinja-0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 72bf2d435d06bc9376a77af394a14b0748e0ce857dbbb996d8fd91baf2bc7f0d
MD5 a561424d4bace4fdc2d2f5720aef8afb
BLAKE2b-256 efc231861f63aa206fa07dc04f40f2306d17e954c6e68111c7c50ed7f8b8e383

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page