Skip to main content

Utility functions used in the DataCamp Statistical Thinking courses.

Project description

# DataCamp Statistical Thinking utilities

[![version](https://img.shields.io/pypi/v/dc_stat_think.svg)](https://pypi.python.org/pypi/dc_stat_think) [![build status](https://img.shields.io/travis/justinbois/dc_stat_think.svg)](https://travis-ci.org/justinbois/dc_stat_think)

Utility functions used in the DataCamp Statistical Thinking courses.
- [Statistical Thinking in Python Part I](https://www.datacamp.com/courses/statistical-thinking-in-python-part-1/)
- [Statistical Thinking in Python Part II](https://www.datacamp.com/courses/statistical-thinking-in-python-part-2/)
- [Case Studies in Statistical Thinking](https://www.datacamp.com/courses/case-studies-in-statistical-thinking/)


## Installation
dc_stat_think may be installed by running the following command.
```
pip install dc_stat_think
```

## Usage
Upon importing the module, functions from the DataCamp Statistical Thinking courses are available. For example, you can compute a 95% confidence interval of the mean of some data using the `draw_bs_reps()` function.

```python
>>> import numpy as np
>>> import dc_stat_think as dcst
>>> data = np.array([1.2, 3.3, 2.7, 2.4, 5.6,
3.4, 1.3, 3.9, 2.9, 2.1, 2.7])
>>> bs_reps = dcst.draw_bs_reps(data, np.mean, size=10000)
>>> conf_int = np.percentile(bs_reps, [2.5, 97.5])
>>> print(conf_int)
[ 2.21818182 3.60909091]
```

## Implementation
The functions include in dc_stat_think are not *exactly* like those students wrote in the DataCamp Statistical Thinking courses. Notable differences are listed below.

+ The doc strings in dc_stat_think are much more complete.
+ The dc_stat_think module has error checking of inputs.
+ In most cases, especially those involving bootstrapping or other uses of the `np.random` module, dc_stat_think functions are more optimized for speed, in particular using [Numba](http://numba.pydata.org). Note, though, that dc_stat_think does not take advantage of any parallel computing.

If you do want to use functions *exactly* as written in the Statistical Thinking courses, you can use the `dc_stat_think.original` submodule.

```python
>>> import numpy as np
>>> import dc_stat_think.original
>>> data = np.array([1.2, 3.3, 2.7, 2.4, 5.6, 3.4, 1.3, 3.9, 2.9, 2.1, 2.7])
>>> bs_reps = dc_stat_think.original.draw_bs_reps(data, np.mean, size=10000)
>>> conf_int = np.percentile(bs_reps, [2.5, 97.5])
>>> print(conf_int)
[ 2.20909091 3.59090909]
```

## Credits
This package was created with [Cookiecutter](https://github.com/audreyr/cookiecutter) and the [audreyr/cookiecutter-pypackage](https://github.com/audreyr/cookiecutter-pypackage) project template and then modified.


=======
History
=======

0.1.0 (2017-07-20)
0.1.1 (2017-07-20)
0.1.2 (2017-07-24)
0.1.4 (2017-07-26)
0.1.5 (2017-08-17)
1.0.0 (2017-08-28)
------------------


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dc_stat_think-1.0.2.tar.gz (23.1 kB view details)

Uploaded Source

Built Distribution

dc_stat_think-1.0.2-py2.py3-none-any.whl (21.3 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file dc_stat_think-1.0.2.tar.gz.

File metadata

  • Download URL: dc_stat_think-1.0.2.tar.gz
  • Upload date:
  • Size: 23.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.7.3 requests-toolbelt/0.8.0 tqdm/4.29.1 CPython/3.7.2

File hashes

Hashes for dc_stat_think-1.0.2.tar.gz
Algorithm Hash digest
SHA256 dbbef87d0062be4455b3bde80f636d00323172cfd81248e7421ef1dfd32c7b6f
MD5 a0a9db4f887521f5e5045c9d3a876501
BLAKE2b-256 e9321a47b6f7ed07e9e60948f22855bf788e767206d4d0625de0933670133f73

See more details on using hashes here.

Provenance

File details

Details for the file dc_stat_think-1.0.2-py2.py3-none-any.whl.

File metadata

  • Download URL: dc_stat_think-1.0.2-py2.py3-none-any.whl
  • Upload date:
  • Size: 21.3 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.7.3 requests-toolbelt/0.8.0 tqdm/4.29.1 CPython/3.7.2

File hashes

Hashes for dc_stat_think-1.0.2-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 89e8f7a1e6c2a66a4a605ef15bdfd264be805e5922564b881c80d132c44cd261
MD5 36505065777f5a7aff873577eebc13cd
BLAKE2b-256 94eccb861d06dea11a0fcdaf4720cccf0590c6ada568e1d8fcb7207f22f3b4c1

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page