Utility functions used in the DataCamp Statistical Thinking courses.
Project description
DataCamp Statistical Thinking utilities
Utility functions used in the DataCamp Statistical Thinking courses.
- Statistical Thinking in Python Part I
- Statistical Thinking in Python Part II
- Case Studies in Statistical Thinking
Installation
dc_stat_think may be installed by running the following command.
pip install dc_stat_think
Usage
Upon importing the module, functions from the DataCamp Statistical Thinking courses are available. For example, you can compute a 95% confidence interval of the mean of some data using the draw_bs_reps()
function.
>>> import numpy as np
>>> import dc_stat_think as dcst
>>> data = np.array([1.2, 3.3, 2.7, 2.4, 5.6,
3.4, 1.3, 3.9, 2.9, 2.1, 2.7])
>>> bs_reps = dcst.draw_bs_reps(data, np.mean, size=10000)
>>> conf_int = np.percentile(bs_reps, [2.5, 97.5])
>>> print(conf_int)
[ 2.21818182 3.60909091]
Implementation
The functions include in dc_stat_think are not exactly like those students wrote in the DataCamp Statistical Thinking courses. Notable differences are listed below.
- The doc strings in dc_stat_think are much more complete.
- The dc_stat_think module has error checking of inputs.
- In most cases, especially those involving bootstrapping or other uses of the
np.random
module, dc_stat_think functions are more optimized for speed, in particular using Numba. Note, though, that dc_stat_think does not take advantage of any parallel computing.
If you do want to use functions exactly as written in the Statistical Thinking courses, you can use the dc_stat_think.original
submodule.
>>> import numpy as np
>>> import dc_stat_think.original
>>> data = np.array([1.2, 3.3, 2.7, 2.4, 5.6, 3.4, 1.3, 3.9, 2.9, 2.1, 2.7])
>>> bs_reps = dc_stat_think.original.draw_bs_reps(data, np.mean, size=10000)
>>> conf_int = np.percentile(bs_reps, [2.5, 97.5])
>>> print(conf_int)
[ 2.20909091 3.59090909]
Credits
This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template and then modified.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file dc_stat_think-1.1.1.tar.gz
.
File metadata
- Download URL: dc_stat_think-1.1.1.tar.gz
- Upload date:
- Size: 24.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.4.0.post20200813 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fbb101584fb786172e0a82b2c400a7e4f6eee368d16317c7bc447a4639d0323e |
|
MD5 | e6c4dbe462e48f5e645b96b4779bede2 |
|
BLAKE2b-256 | 2f9cb824da2a757f12fd92df3d0c6cac361e57135c41ba0cd69b4fce1a44dc8e |
File details
Details for the file dc_stat_think-1.1.1-py2.py3-none-any.whl
.
File metadata
- Download URL: dc_stat_think-1.1.1-py2.py3-none-any.whl
- Upload date:
- Size: 21.7 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.4.0.post20200813 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0f24af603c7b3950857cb9ce34eb6e0df796eed8760c9b7870a1db87ba02af08 |
|
MD5 | 581fbccf722716cae06c6c7951a7730f |
|
BLAKE2b-256 | 976693f343227d0df3f953e94912df75675534c430cdc2a22251f45af85b542f |