Skip to main content

A tool to fit data to many distributions and get the best one(s)

Project description

https://badge.fury.io/py/fitter.svg https://github.com/cokelaer/fitter/actions/workflows/main.yml/badge.svg?branch=main https://coveralls.io/repos/cokelaer/fitter/badge.png?branch=main Documentation Status https://zenodo.org/badge/23078551.svg

Compatible with Python 3.7, and 3.8, 3.9

What is it ?

The fitter package is a Python library used for fitting probability distributions to data. It provides a straightforward and and intuitive interface to estimate parameters for various types of distributions, both continuous and discrete. Using fitter, you can easily fit a range of distributions to your data and compare their fit, aiding in the selection of the most suitable distribution. The package is designed to be user-friendly and requires minimal setup, making it a useful tool for data scientists and statisticians working with probability distributions.

Installation

pip install fitter

fitter is also available on conda (bioconda channel):

conda install fitter

Usage

standalone

A standalone application (very simple) is also provided and works with input CSV files:

fitter fitdist data.csv --column-number 1 --distributions gamma,normal

It creates a file called fitter.png and a log fitter.log

From Python shell

First, let us create a data samples with N = 10,000 points from a gamma distribution:

from scipy import stats
data = stats.gamma.rvs(2, loc=1.5, scale=2, size=10000)

Now, without any knowledge about the distribution or its parameter, what is the distribution that fits the data best ? Scipy has 80 distributions and the Fitter class will scan all of them, call the fit function for you, ignoring those that fail or run forever and finally give you a summary of the best distributions in the sense of sum of the square errors. The best is to give an example:

from fitter import Fitter
f = Fitter(data)
f.fit()
# may take some time since by default, all distributions are tried
# but you call manually provide a smaller set of distributions
f.summary()
http://pythonhosted.org/fitter/_images/index-1.png

See the online documentation for details.

Contributors

Setting up and maintaining Fitter has been possible thanks to users and contributors. Thanks to all:

https://contrib.rocks/image?repo=cokelaer/fitter

Changelog

Version

Description

1.6.0

1.5.2

1.5.1

  • fixed regression putting back joblib

1.5.0

  • removed easydev and replaced by tqdm for progress bar

  • progressbar from tqdm also allows replacement of joblib need

1.4.1

1.4.0

1.3.0

1.2.3

  • remove vervose arguments in Fitter class. Using the logging module instead

  • the Fitter.fit has now a progress bar

  • add a standalone application called … fitter (see the doc)

1.2.2

was not released

1.2.1

adding new class called histfit (see documentation)

1.2

1.1

  • Fixed deprecated warning

  • fitter is now in readthedocs at fitter.readthedocs.io

1.0.9

1.0.6

  • summary() now returns the dataframe (instead of printing it)

1.0.5

https://github.com/cokelaer/fitter/issues

1.0.2

add manifest to fix missing source in the pypi repository.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fitter-1.6.0.tar.gz (26.9 kB view details)

Uploaded Source

File details

Details for the file fitter-1.6.0.tar.gz.

File metadata

  • Download URL: fitter-1.6.0.tar.gz
  • Upload date:
  • Size: 26.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.6.0 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.8.9

File hashes

Hashes for fitter-1.6.0.tar.gz
Algorithm Hash digest
SHA256 908223d75b35d3846984bfb2fed1d2a926da5a30b95b704aa95b2a894227c0af
MD5 8d1fa286322104f47fa381be179308ea
BLAKE2b-256 729f0d0a2b1e1a9d1c7145f13c4a57a491e2cff853160c36b56d4252886e4d57

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page