Skip to main content

A versatile statistical toolkit for Python, featuring core statistical methods, time series analysis, signal processing, and climatology tools

Project description

StatFlow

Python Version License: MIT PyPI Version

A Python library for statistical analysis and data flow management.

Features

  • Statistical Analysis

    • Descriptive statistics
    • Hypothesis testing
    • Regression analysis
    • Time series analysis
    • Probability distributions
  • Data Flow Management

    • Data pipeline construction
    • ETL operations
    • Data validation
    • Data transformation
    • Data quality checks
  • Visualization

    • Statistical plots
    • Distribution visualizations
    • Time series plots
    • Correlation matrices
    • Custom charting
  • Data Processing

    • Data cleaning
    • Feature engineering
    • Data normalization
    • Outlier detection
    • Missing value handling

Installation

Prerequisites

  • Python 3.8 or higher
  • pip or conda package manager

Using pip

pip install statflow

Using conda

conda install -c conda-forge statflow

Usage

Basic Usage

from statflow import DataFlow, StatisticalAnalysis

# Create a data flow pipeline
flow = DataFlow()
flow.load_data("data.csv")
flow.clean_data()
flow.transform_data()

# Perform statistical analysis
analysis = StatisticalAnalysis(flow.data)
summary = analysis.descriptive_stats()
correlation = analysis.correlation_matrix()

Advanced Usage

from statflow import TimeSeries, HypothesisTest

# Time series analysis
ts = TimeSeries("time_series_data.csv")
trend = ts.decompose()
forecast = ts.predict(steps=10)

# Hypothesis testing
test = HypothesisTest(sample1, sample2)
result = test.t_test()
p_value = result.p_value

Project Structure

statflow/
├── statflow/              # Main package directory
│   ├── core/             # Core functionality
│   │   ├── flow.py      # Data flow management
│   │   └── stats.py     # Statistical analysis
│   ├── analysis/         # Analysis modules
│   │   ├── descriptive.py # Descriptive statistics
│   │   └── inferential.py # Inferential statistics
│   └── utils/            # Utility functions
│       ├── visualization.py # Plotting utilities
│       └── preprocessing.py # Data preprocessing
├── tests/                # Test suite
├── docs/                 # Documentation
└── examples/             # Usage examples

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Development Setup

  1. Clone the repository:

    git clone https://github.com/yourusername/statflow.git
    cd statflow
    
  2. Create a virtual environment:

    python -m venv venv
    source venv/bin/activate  # On Windows: venv\Scripts\activate
    
  3. Install development dependencies:

    pip install -e ".[dev]"
    

Testing

Run the test suite:

pytest

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

  • Thanks to all contributors who have helped improve this library
  • Inspired by various statistical analysis tools and data flow frameworks

Contact

For questions or suggestions, please open an issue on GitHub or contact the maintainers.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

statflow-3.4.4.tar.gz (31.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

statflow-3.4.4-py3-none-any.whl (36.8 kB view details)

Uploaded Python 3

File details

Details for the file statflow-3.4.4.tar.gz.

File metadata

  • Download URL: statflow-3.4.4.tar.gz
  • Upload date:
  • Size: 31.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for statflow-3.4.4.tar.gz
Algorithm Hash digest
SHA256 0f4727d7eca3492cb444558e9e1ddacf7c0145ccd557db447df16768be4a638a
MD5 91edb453e22c6422c37d26bb62200922
BLAKE2b-256 26641b9f12ab79127387c79cf86a6ede888fa8c1f5b3c33e2968822cc87b6fdb

See more details on using hashes here.

File details

Details for the file statflow-3.4.4-py3-none-any.whl.

File metadata

  • Download URL: statflow-3.4.4-py3-none-any.whl
  • Upload date:
  • Size: 36.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for statflow-3.4.4-py3-none-any.whl
Algorithm Hash digest
SHA256 00f4d608001beae05e20b2d86cd510d2e84f42b2cb018ec28f893a1aaa25bf7a
MD5 7a0ee328759d35e51597cb96b97598be
BLAKE2b-256 66e3bf6f381591cffe944587aaac8e21dddc0eb7c56e79641cda7486cfaa1860

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page