Utility functions used in the DataCamp Statistical Thinking courses.
Project description
DataCamp Statistical Thinking utilities
Utility functions used in the DataCamp Statistical Thinking courses.
- Statistical Thinking in Python Part I
- Statistical Thinking in Python Part II
- Case Studies in Statistical Thinking
Installation
dc_stat_think may be installed by running the following command.
pip install dc_stat_think
Usage
Upon importing the module, functions from the DataCamp Statistical Thinking courses are available. For example, you can compute a 95% confidence interval of the mean of some data using the draw_bs_reps()
function.
>>> import numpy as np
>>> import dc_stat_think as dcst
>>> data = np.array([1.2, 3.3, 2.7, 2.4, 5.6,
3.4, 1.3, 3.9, 2.9, 2.1, 2.7])
>>> bs_reps = dcst.draw_bs_reps(data, np.mean, size=10000)
>>> conf_int = np.percentile(bs_reps, [2.5, 97.5])
>>> print(conf_int)
[ 2.21818182 3.60909091]
Implementation
The functions include in dc_stat_think are not exactly like those students wrote in the DataCamp Statistical Thinking courses. Notable differences are listed below.
- The doc strings in dc_stat_think are much more complete.
- The dc_stat_think module has error checking of inputs.
- In most cases, especially those involving bootstrapping or other uses of the
np.random
module, dc_stat_think functions are more optimized for speed, in particular using Numba. Note, though, that dc_stat_think does not take advantage of any parallel computing.
If you do want to use functions exactly as written in the Statistical Thinking courses, you can use the dc_stat_think.original
submodule.
>>> import numpy as np
>>> import dc_stat_think.original
>>> data = np.array([1.2, 3.3, 2.7, 2.4, 5.6, 3.4, 1.3, 3.9, 2.9, 2.1, 2.7])
>>> bs_reps = dc_stat_think.original.draw_bs_reps(data, np.mean, size=10000)
>>> conf_int = np.percentile(bs_reps, [2.5, 97.5])
>>> print(conf_int)
[ 2.20909091 3.59090909]
Credits
This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template and then modified.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for dc_stat_think-1.1.1-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0f24af603c7b3950857cb9ce34eb6e0df796eed8760c9b7870a1db87ba02af08 |
|
MD5 | 581fbccf722716cae06c6c7951a7730f |
|
BLAKE2b-256 | 976693f343227d0df3f953e94912df75675534c430cdc2a22251f45af85b542f |