Skip to main content

Integrated Analytics Platform - Descriptive, Diagnostic & Predictive Analytics with Sample vs Population Distinction

Project description

BizLens v2.0.0

Integrated Analytics Platform — Descriptive, Diagnostic & Predictive Analytics with Sample vs Population Distinction

PyPI version License: MIT Python 3.8+


🎯 What is BizLens?

BizLens is a comprehensive analytics platform for business and educational use, featuring:

  • Descriptive Analytics: What happened? (statistics, distributions, visualizations)
  • Diagnostic Analytics: Why did it happen? (correlations, hypothesis testing, assumptions)
  • Predictive Analytics: What will happen? (regression, forecasting, confidence intervals)
  • Sample vs Population: Explicit distinction throughout (n-1 vs n denominator)

Designed for High School → Undergraduate → Postgraduate students and professionals.


✨ Key Features

📊 Descriptive Analytics

  • Central tendency: Mean, Median, Mode
  • Dispersion: Range, Variance, Standard Deviation, IQR
  • Distribution analysis: Skewness, Kurtosis
  • 9+ visualization types (histogram, boxplot, violin, density, heatmap, etc.)
  • Professional color schemes (Academic, Pastel, Vibrant)

🔍 Diagnostic Analytics

  • Hypothesis testing (t-tests, ANOVA, chi-square)
  • Correlation analysis (Pearson, Spearman)
  • Assumption checking (normality, linearity, homoscedasticity)
  • Segment analysis and comparisons
  • Effect size and statistical significance

🔮 Predictive Analytics

  • Linear regression (simple & multiple)
  • Time series forecasting with seasonality
  • Logistic regression (binary classification)
  • Confidence intervals & uncertainty quantification
  • Cross-validation and model evaluation
  • Diagnostic plots (residuals, Q-Q plots)

📚 Educational Excellence

  • Sample vs population distinction in all calculations
  • Mathematical notation and formulas
  • Real datasets with proper citations (Iris, Titanic, Gapminder, World Bank)
  • Jupyter notebook templates (12-section standardized structure)
  • Python fundamentals integrated throughout
  • Skill-level progression (Basics → Intermediate → Advanced)

🎓 Real Data Integration

  • Built-in sample datasets with citations
  • World Bank API integration (with caching)
  • Dataset metadata and quality reports
  • Reproducibility and provenance tracking

🚀 Quick Start

Installation

pip install bizlens==2.0.0

Basic Example

import bizlens as bl
import pandas as pd

# Load data
data = bl.load_dataset('iris')

# Describe (Sample-level statistics)
stats = bl.describe(data['sepal_length'], calculation_level='sample')
print(stats)

# Diagnose (Hypothesis test)
t_stat, p_value = bl.test.compare_groups(
    data[data['species']=='setosa']['sepal_length'],
    data[data['species']=='versicolor']['sepal_length']
)

# Predict (Linear regression)
prediction = bl.predict.regression.simple(
    x=data['sepal_length'],
    y=data['petal_length'],
    confidence_interval=0.95
)

📚 Learning Pathways

For Business Analytics

  • Sales forecasting with seasonal decomposition
  • Customer segmentation and profiling
  • Marketing effectiveness analysis
  • Revenue prediction and ROI estimation

For Data Science

  • Statistical foundations
  • Hypothesis testing workflows
  • Regression model development
  • Time series analysis and forecasting

For Academic Research

  • Rigorous statistical methods
  • Publication-ready visualizations
  • Assumption validation
  • Effect size and confidence intervals

🎯 Sample vs Population

A core pedagogical principle throughout BizLens:

# Sample (your dataset)
sample_stats = bl.describe(data, calculation_level='sample')  # Uses n-1

# Population (all possible values)
pop_stats = bl.describe(data, calculation_level='population')  # Uses n

# Compare both
bl.compare_sample_population(data)

🔧 API Overview

Descriptive Analytics

bl.describe(data)                    # Comprehensive statistics
bl.visualize.histogram(data)         # 9+ visualization types
bl.datasets.load_dataset('iris')     # Real datasets with citations

Diagnostic Analytics

bl.test.hypothesis(data1, data2)     # Hypothesis testing
bl.correlation.pearson(data)         # Correlations
bl.assumptions.normality(data)       # Assumption checking

Predictive Analytics

bl.predict.regression.simple(x, y)   # Simple linear regression
bl.predict.forecast(timeseries)      # Time series forecasting
bl.predict.classify.logistic(X, y)   # Logistic regression

📖 Documentation & Examples

  • Quick Start: 15 minutes to first analysis
  • Notebooks: 12-section templates for all use cases
  • API Reference: Complete function documentation
  • Roadmap: v2.1+ will add ML foundations (decision trees, random forests, clustering)

🛣️ Roadmap

v2.0.0 (Current) - Integrated Analytics Platform

  • ✅ Descriptive, diagnostic & predictive analytics
  • ✅ Sample vs population distinction
  • ✅ Real data integration
  • ✅ Educational focus

v2.1 (Q3 2026) - ML Foundations

  • Classification (decision trees, random forests, naive bayes)
  • Clustering (K-means, hierarchical, DBSCAN)
  • Dimensionality reduction (PCA, feature selection)
  • AutoML basics

v2.2 (Q4 2026) - Advanced ML

  • Ensemble methods (XGBoost, LightGBM, stacking)
  • Advanced time series (ARIMA, SARIMA, Prophet)
  • Anomaly detection (Isolation Forest, LOF)
  • Explainability (SHAP, LIME)

v3.0 (2027) - Deep Learning

  • Neural networks (MLPs, CNNs, RNNs)
  • Transfer learning
  • Reinforcement learning
  • Foundation model integration

📦 Requirements

  • Python 3.8+
  • pandas >= 1.3.0
  • numpy >= 1.21.0
  • scipy >= 1.7.0
  • matplotlib >= 3.3.0
  • seaborn >= 0.11.0

Optional:

  • polars >= 0.14.0 (for performance)
  • plotly >= 5.0.0 (for interactive plots)

📄 License

MIT License - See LICENSE file for details

👨‍💻 Author

Sudhanshu Singh

🤝 Contributing

Contributions welcome! Please see CONTRIBUTING.md for guidelines.

📝 Citation

@software{bizlens2026,
  title={BizLens: Integrated Analytics Platform},
  author={Singh, Sudhanshu},
  year={2026},
  url={https://github.com/solutiongate-learn/bizlens}
}

BizLens v2.0.0 - Making analytics accessible, rigorous, and educational.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bizlens-2.0.1.tar.gz (30.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bizlens-2.0.1-py3-none-any.whl (25.4 kB view details)

Uploaded Python 3

File details

Details for the file bizlens-2.0.1.tar.gz.

File metadata

  • Download URL: bizlens-2.0.1.tar.gz
  • Upload date:
  • Size: 30.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.11

File hashes

Hashes for bizlens-2.0.1.tar.gz
Algorithm Hash digest
SHA256 65fc24beeee16eb8866b01f58026950225c8671c57919763f7d1d5e2c2e64216
MD5 50b5c5aa583fd81cd5a9e2b7d51c0a43
BLAKE2b-256 24287dda9b24c8e6c012fc09e6f027771ce8ede0a75979c0c0b60fd4c7402657

See more details on using hashes here.

File details

Details for the file bizlens-2.0.1-py3-none-any.whl.

File metadata

  • Download URL: bizlens-2.0.1-py3-none-any.whl
  • Upload date:
  • Size: 25.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.11

File hashes

Hashes for bizlens-2.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 e0e8e0e216b2e2a16c5b89a26c1325fd6bb38cc0dd490511bb4ff1e548fee11f
MD5 0ad1c54689173b2e29152ab52acab52b
BLAKE2b-256 9f605c241b42a541a962c1ff240a117624ab2caaa074e160b11689b0fe371bf4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page