Skip to main content

A comprehensive data analysis and visualization toolkit.

Project description

markdown Copier le code

DataNinja Package

Overview

DataNinja is an all-in-one data analysis toolkit designed to simplify data manipulation, statistical analysis, and visualization. It integrates popular libraries like pandas, numpy, matplotlib, and others into a single package, making it easier for data analysts and scientists to perform common data tasks.

Features

  • DataFrame Creation: Create DataFrames with ease.
  • Statistics Calculation: Compute basic statistics such as mean, median, variance, and standard deviation.
  • Data Profiling: Generate comprehensive data profile reports.
  • Missing Data Visualization: Visualize missing data patterns in your DataFrame.
  • Missing Value Handling: Handle missing values using different methods.
  • Visualization: Create various plots to analyze and visualize your data.

Installation

To install the DataNinja package, use pip:

pip install dataninja
Usage
1. DataFrame Creation
Create a DataFrame using the create_dataframe function:

python
Copier le code
import dataninja

data = {
    'A': [1, 2, None, 4],
    'B': [None, 2, 3, 4],
    'C': [1, None, None, 4]
}

df = dataninja.create_dataframe(data)
print(df)
2. Statistics Calculation
Calculate basic statistics using the calculate_statistics function:

python
Copier le code
# Assuming df is already created
stats = dataninja.calculate_statistics(df)
print(stats)
3. Data Profiling
Generate a data profile report using the generate_data_profile function. This can be rendered in Jupyter Notebooks:

python
Copier le code
# Assuming df is already created
report = dataninja.generate_data_profile(df)
report.to_notebook_iframe()  # For Jupyter Notebooks
4. Missing Data Visualization
Visualize missing data patterns with the plot_missing_data function:

python
Copier le code
# Assuming df is already created
dataninja.plot_missing_data(df)
5. Missing Value Handling
Fill missing values using different methods:

python
Copier le code
# Assuming df is already created
df_filled = dataninja.fill_missing_values(df, method='mean')
print(df_filled)
Example
Here’s a complete example of using the DataNinja package:

python
Copier le code
import dataninja

# Create a DataFrame
data = {
    'A': [1, 2, None, 4],
    'B': [None, 2, 3, 4],
    'C': [1, None, None, 4]
}
df = dataninja.create_dataframe(data)

# Calculate statistics
stats = dataninja.calculate_statistics(df)
print("Statistics:\n", stats)

# Generate data profile report
report = dataninja.generate_data_profile(df)
report.to_notebook_iframe()  # For Jupyter Notebooks

# Visualize missing data
dataninja.plot_missing_data(df)

# Fill missing values
df_filled = dataninja.fill_missing_values(df, method='mean')
print("Filled DataFrame:\n", df_filled)
Development
To contribute to the DataNinja package:

Clone the Repository:

bash
Copier le code
git clone https://github.com/ShelbyTO/DataNinja.git
Navigate to the Project Directory:

bash
Copier le code
cd DataNinja
Install Dependencies:

bash
Copier le code
pip install -r requirements.txt
Run Tests:

bash
Copier le code
pytest
Make Your Changes and submit a pull request.

License
This package is licensed under the MIT License. See the LICENSE file for details.

Contact
For any issues or questions, please contact:

Author: Nicolas Prieur
Email: pu-zle@live.fr

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

DataNinja-0.2.tar.gz (3.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

DataNinja-0.2-py3-none-any.whl (3.9 kB view details)

Uploaded Python 3

File details

Details for the file DataNinja-0.2.tar.gz.

File metadata

  • Download URL: DataNinja-0.2.tar.gz
  • Upload date:
  • Size: 3.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.7

File hashes

Hashes for DataNinja-0.2.tar.gz
Algorithm Hash digest
SHA256 a2ba9084fcce7d425afaa591281587f6533c2c57d05b28c9d560ed47a026a191
MD5 d971f5eeb7ecc6624bd7e03c966f9f5f
BLAKE2b-256 e3a5d0310a2376e9b5f9384619a3bcbea4eeeced9cf59c79d68dd9797bd30fa9

See more details on using hashes here.

File details

Details for the file DataNinja-0.2-py3-none-any.whl.

File metadata

  • Download URL: DataNinja-0.2-py3-none-any.whl
  • Upload date:
  • Size: 3.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.7

File hashes

Hashes for DataNinja-0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 a5c80429d0a72f6314e151bba1128995a71f4474ba19dd67f307e22edf98b35b
MD5 c8464f4de1b5d3bf136412a0a3d92ba3
BLAKE2b-256 3fa95a21dbe0bd636b447278381534165dacb8d342caee2000bd0b7886da89c1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page