Skip to main content

Perform exploratory data analysis (EDA) on quantitative data.

Project description

quanteda

CI/CD codecov Documentation Status License: MIT Python 3.9.0 release version Project Status: Active – The project has reached a stable, usable state and is being actively developed.

Perform exploratory data analysis (EDA) on quantitative data.

Function Description

This package aims to be the starting point for any analysis of quantitative financial data by supplying functions that create charts and metrics to simplify exploratory data analysis and give the user a jump-start on their project. This package simplifies the creation of charts that look at the distribution of numeric features and missing information in the data set; two critical steps in any financial analysis. The package also includes a function that will generate a random time series. Financial variables like stock prices and interest rates vary over time, so this ability to generate a time series quickly is extremely useful. Finally, this package also includes a function that will automatically calculate several useful financial metrics so that more time can be spent on more complicated analysis.

The functions in this package include:

  • plot_missing_vals: Plot tick chart to display missing values for all numeric features in the dataset.
  • plot_num_dist: Creates a chart of histograms for all numeric features in a data set.
  • generate_return_series: Generates a DataFrame with independent time series of returns.
  • generate_financial_metrics: Calculates financial metrics based on a DataFrame of random returns on time series. These metrics are total return, annual return, annual volatilities and sharpe ratio

Contributors

  • Doris (Yun Yi) Cai
  • Jake Barnabe
  • John Shiu
  • Merete Lutz

Installation

For User

To install the package, run the following command from the terminal

pip install quanteda

For Developers

  1. Clone this repository.
 git clone git@github.com:UBC-MDS/quanteda.git
 cd quanteda/
  1. Install the virtual environment.
 conda env create -f environment.yml
  1. Activate the installed environment:
 conda activate quanteda
  1. Install the package.
 poetry install

Usage

Using this package

To use this package, import and call the functions in Python. Below is an example:

from quanteda.plot_missing_vals import plot_missing_vals
from quanteda.plot_num_dist import plot_num_dist
from quanteda.generate_financial_metrics import generate_financial_metrics
from quanteda.generate_return_series import generate_return_series

Call function plot_missing_vals to visualize the presence of missing values. image

Call function plot_num_dist to plot the distribution of the return series. image

Call function generate_financial_metrics to calculate the financial metrics of the return series. image

Call function generate_return_series to simulate time series return given an expected return, volatility and return distribution of a stock.

image

Run unit tests

Execute the following in the project root directory to run the unit tests of the package,

 poetry run pytest

or, to run with the code covergage reporting,

 poetry run pytest --cov=quanteda

Documentation

The official documentation is hosted on Read the Docs: https://quanteda.readthedocs.io/en/latest/

quanteda in the Python Ecosystem

Our package fills a gap in the python ecosystem by being marketed specifically to financial data. Python users commonly create EDA charts using popular packages like matplotlib, altair, and seaborn, and conduct their financial analysis using packages like pandas, numpy, and scipy. These libraries are extensive, but have been generalized to be as useful as possible to as many differen fields as possible. It takes time to learn the syntax and code of these packages that work for financial data. This can be time consuming during EDA, when the goal is to quickly get a rough idea of what the data set you are using looks like. Our package will simplify these actions into a few functions that will save time on tedious EDA so that more time can be spent on analysis and testing.

Contributing

Interested in contributing? Check out the contributing guidelines. Please note that this project is released with a Code of Conduct. By contributing to this project, you agree to abide by its terms.

License

quanteda was created by Doris (Yun Yi) Cai, Jake Barnabe, John Shiu, Merete Lutz. It is licensed under the terms of the MIT license.

Credits

quanteda was created with cookiecutter and the py-pkgs-cookiecutter template.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

quanteda-2.0.7.tar.gz (6.7 kB view details)

Uploaded Source

Built Distribution

quanteda-2.0.7-py3-none-any.whl (8.4 kB view details)

Uploaded Python 3

File details

Details for the file quanteda-2.0.7.tar.gz.

File metadata

  • Download URL: quanteda-2.0.7.tar.gz
  • Upload date:
  • Size: 6.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.7

File hashes

Hashes for quanteda-2.0.7.tar.gz
Algorithm Hash digest
SHA256 203f6ee5b061ab99bf2bd0426fdcf42f81be4413d65f5731093728693d645d08
MD5 2033b722a28f572f51dc30d3d3ae1ee8
BLAKE2b-256 dda480ade35a3cb69631c267cf0b51ce2527a9996a9bd3bd7833f0f6330a1de7

See more details on using hashes here.

File details

Details for the file quanteda-2.0.7-py3-none-any.whl.

File metadata

  • Download URL: quanteda-2.0.7-py3-none-any.whl
  • Upload date:
  • Size: 8.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.7

File hashes

Hashes for quanteda-2.0.7-py3-none-any.whl
Algorithm Hash digest
SHA256 a3f78b6f65135ca241aa595a45a4f38e1519a3744bf44af993503e526d923ab3
MD5 0c99de606174043153dab27d8019e502
BLAKE2b-256 bb8db94b73711c8ba3b27badd10f6b420d29ea1ec6d54b42ba3625a601a28bba

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page