A comprehensive data analysis and visualization toolkit.
Project description
markdown Copier le code
DataNinja Package
Overview
DataNinja is an all-in-one data analysis toolkit designed to simplify data manipulation, statistical analysis, and visualization. It integrates popular libraries like pandas, numpy, matplotlib, and others into a single package, making it easier for data analysts and scientists to perform common data tasks.
Features
- DataFrame Creation: Create DataFrames with ease.
- Statistics Calculation: Compute basic statistics such as mean, median, variance, and standard deviation.
- Data Profiling: Generate comprehensive data profile reports.
- Missing Data Visualization: Visualize missing data patterns in your DataFrame.
- Missing Value Handling: Handle missing values using different methods.
- Visualization: Create various plots to analyze and visualize your data.
Installation
To install the DataNinja package, use pip:
pip install dataninja
Usage
1. DataFrame Creation
Create a DataFrame using the create_dataframe function:
python
Copier le code
import dataninja
data = {
'A': [1, 2, None, 4],
'B': [None, 2, 3, 4],
'C': [1, None, None, 4]
}
df = dataninja.create_dataframe(data)
print(df)
2. Statistics Calculation
Calculate basic statistics using the calculate_statistics function:
python
Copier le code
# Assuming df is already created
stats = dataninja.calculate_statistics(df)
print(stats)
3. Data Profiling
Generate a data profile report using the generate_data_profile function. This can be rendered in Jupyter Notebooks:
python
Copier le code
# Assuming df is already created
report = dataninja.generate_data_profile(df)
report.to_notebook_iframe() # For Jupyter Notebooks
4. Missing Data Visualization
Visualize missing data patterns with the plot_missing_data function:
python
Copier le code
# Assuming df is already created
dataninja.plot_missing_data(df)
5. Missing Value Handling
Fill missing values using different methods:
python
Copier le code
# Assuming df is already created
df_filled = dataninja.fill_missing_values(df, method='mean')
print(df_filled)
Example
Here’s a complete example of using the DataNinja package:
python
Copier le code
import dataninja
# Create a DataFrame
data = {
'A': [1, 2, None, 4],
'B': [None, 2, 3, 4],
'C': [1, None, None, 4]
}
df = dataninja.create_dataframe(data)
# Calculate statistics
stats = dataninja.calculate_statistics(df)
print("Statistics:\n", stats)
# Generate data profile report
report = dataninja.generate_data_profile(df)
report.to_notebook_iframe() # For Jupyter Notebooks
# Visualize missing data
dataninja.plot_missing_data(df)
# Fill missing values
df_filled = dataninja.fill_missing_values(df, method='mean')
print("Filled DataFrame:\n", df_filled)
Development
To contribute to the DataNinja package:
Clone the Repository:
bash
Copier le code
git clone https://github.com/ShelbyTO/DataNinja.git
Navigate to the Project Directory:
bash
Copier le code
cd DataNinja
Install Dependencies:
bash
Copier le code
pip install -r requirements.txt
Run Tests:
bash
Copier le code
pytest
Make Your Changes and submit a pull request.
License
This package is licensed under the MIT License. See the LICENSE file for details.
Contact
For any issues or questions, please contact:
Author: Nicolas Prieur
Email: pu-zle@live.fr
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file DataNinja-0.2.tar.gz.
File metadata
- Download URL: DataNinja-0.2.tar.gz
- Upload date:
- Size: 3.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a2ba9084fcce7d425afaa591281587f6533c2c57d05b28c9d560ed47a026a191
|
|
| MD5 |
d971f5eeb7ecc6624bd7e03c966f9f5f
|
|
| BLAKE2b-256 |
e3a5d0310a2376e9b5f9384619a3bcbea4eeeced9cf59c79d68dd9797bd30fa9
|
File details
Details for the file DataNinja-0.2-py3-none-any.whl.
File metadata
- Download URL: DataNinja-0.2-py3-none-any.whl
- Upload date:
- Size: 3.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a5c80429d0a72f6314e151bba1128995a71f4474ba19dd67f307e22edf98b35b
|
|
| MD5 |
c8464f4de1b5d3bf136412a0a3d92ba3
|
|
| BLAKE2b-256 |
3fa95a21dbe0bd636b447278381534165dacb8d342caee2000bd0b7886da89c1
|