Advanced Econometric Analysis Tools

These details have not been verified by PyPI

Project links

Homepage

Project description

EconKit - Function Descriptions

econkit is a Python library that provides various statistical and econometric analysis tools, including descriptive statistics, correlation matrices, and tests for stationarity and autocorrelation.

Functions

Descriptive Statistics

`descriptives(data)`

Computes descriptive statistics for each numeric column in a DataFrame.

Parameters:

data: pandas.DataFrame containing the data to be analyzed.

Returns:

None. Prints a summary table of the descriptive statistics.

Example Usage:

import pandas as pd
from econkit import econometrics as ec

df = pd.read_csv('your_data.csv')
ec.descriptives(df)

Correlation Matrix

`correlation(df, method='Pearson', p=False)`

Calculates and prints the correlation matrix and p-values for numeric columns in the provided DataFrame. Supports Pearson, Spearman, and Kendall correlation methods.

Parameters:

df: pandas.DataFrame containing the data to be analyzed.
method: str (optional). Method of correlation ('Pearson', 'Spearman', or 'Kendall'). Default is 'Pearson'.
p: bool (optional). If True, p-value matrix is also printed; if False, only the correlation matrix is printed.

Returns:

None. Prints the correlation matrix and optionally the p-value matrix.

Example Usage:

import pandas as pd
from econkit import econometrics as ec

df = pd.read_csv('your_data.csv')
ec.correlation(df, method='Spearman', p=True)

Augmented Dickey-Fuller (ADF) Test

`adf(dataframe, maxlag=None, regression='c', autolag='AIC', handle_na='drop')`

Performs the ADF test on each column in the DataFrame and returns a summary table.

Parameters:

dataframe: pandas.DataFrame containing the data to be tested.
maxlag: int (optional). Maximum number of lags to use.
regression: str (optional). Type of regression trend ('c', 'ct', 'ctt', 'nc'). Default is 'c'.
autolag: str (optional). Method for lag length selection ('AIC', 'BIC', 't-stat').
handle_na: str (optional). How to handle missing values ('drop' or 'fill').

Returns:

None. Prints a summary table of the ADF test results.

Example Usage:

import pandas as pd
from econkit import econometrics as ec

df = pd.read_csv('your_data.csv')
ec.adf(df)

KPSS Test

`kpss(dataframe, regression='c', nlags='auto', handle_na='drop')`

Performs the KPSS test on each column in the DataFrame and returns a summary table.

Parameters:

dataframe: pandas.DataFrame containing the data to be tested.
regression: str (optional). Type of regression trend ('c' or 'ct'). Default is 'c'.
nlags: str or int (optional). Number of lags to use. Default is 'auto'.
handle_na: str (optional). How to handle missing values ('drop' or 'fill').

Returns:

None. Prints a summary table of the KPSS test results.

Example Usage:

import pandas as pd
from econkit import econometrics as ec

df = pd.read_csv('your_data.csv')
ec.kpss(df)

Durbin-Watson Test

`dw(data)`

Performs the Durbin-Watson autocorrelation test and Ljung-Box test for each column of the dataset.

Parameters:

data: pandas.DataFrame containing the time series data.

Returns:

None. Prints a summary table of the Durbin-Watson test results.

Example Usage:

import pandas as pd
from econkit import econometrics as ec

df = pd.read_csv('your_data.csv')
ec.dw(df)

Normality Tests

`normality_tests(data, pvalue=False)`

Performs normality tests (Shapiro-Wilk and Kolmogorov-Smirnov) on each column of a DataFrame and displays results as a formatted table.

Parameters:

data: pandas.DataFrame containing the numerical data.
pvalue: bool (optional). If True, displays the p-value matrix along with the test results. Default is False.

Returns:

None. Prints the results matrix and optionally the p-value matrix.

Example Usage:

import pandas as pd
from econkit import econometrics as ec

df = pd.read_csv('your_data.csv')
ec.normality_tests(df, pvalue=True)

Random Number Generator

`random_number_generator(num_vars, num_random, distribution, mean=0, std_dev=1, seed=None, lower=None, upper=None, p=None, n_trials=None, lambda_param=None, start=None, stop=None, step=None, repeat_each=None, repeat_sequence=None)`

Generates random numbers based on the specified distribution and parameters.

Parameters:

num_vars: int. Number of variables (columns) to generate.
num_random: int. Number of random values per variable.
distribution: str. The distribution type ('normal', 'uniform', 'binomial', 'bernoulli', 'poisson', 'patterned').
mean: float. Mean for the normal distribution. Default is 0.
std_dev: float. Standard deviation for the normal distribution. Default is 1.
lower: float. Lower bound for the uniform distribution.
upper: float. Upper bound for the uniform distribution.
p: float. Probability for Bernoulli or Binomial distributions.
n_trials: int. Number of trials for the Binomial distribution.
lambda_param: float. Lambda parameter for the Poisson distribution.
start: float. Start value for the patterned distribution.
stop: float. Stop value for the patterned distribution.
step: float. Step size for the patterned distribution.
repeat_each: int. Number of times to repeat each number in the patterned distribution.
repeat_sequence: int. Number of times to repeat the entire sequence.

Returns:

pandas.DataFrame. A DataFrame containing the generated random numbers.

Example Usage:

from econkit import econometrics as ec

random_data = ec.random_number_generator(num_vars=2, num_random=10, distribution='normal', mean=0, std_dev=1, seed=42)
print(random_data)

Combine DataFrames

`combine_dataframes(column_name, *dataframes)`

Combines multiple DataFrames based on a specified column into a new DataFrame, aligning by the index.

Parameters:

column_name: str. The column to extract from each DataFrame.
*dataframes: Multiple pandas DataFrames passed as positional arguments.

Returns:

pandas.DataFrame. A new DataFrame combining the specified columns from all input DataFrames.

Example Usage:

import pandas as pd
from econkit import econometrics as ec

df1 = pd.read_csv('data1.csv')
df2 = pd.read_csv('data2.csv')

combined_df = ec.combine_dataframes('Close', df1, df2)
print(combined_df)

Financial Data Retrieval

`data(ticker_symbol, start_date, end_date, interval)`

Downloads financial data from Yahoo Finance and calculates daily returns.

Parameters:

ticker_symbol: str. The stock ticker symbol.
start_date: str. Start date in 'dd-mm-yyyy' format.
end_date: str. End date in 'dd-mm-yyyy' format.
interval: str. Data interval (e.g., '1d', '1wk', '1mo').

Returns:

pandas.DataFrame containing the stock data and calculated returns.

Example Usage:

from econkit import finance as f

start = '01-06-2024'
end = '07-06-2024'
interval = '1d'

df = f.data('AAPL', start, end, interval)
print(df.head())

Usage Notes

Ensure your data is clean and properly formatted before using these functions.
Some functions handle missing values; specify your preferred method using the handle_na parameter.
For time series analysis, ensure your data is indexed by date.

For further details, refer to the function docstrings in the source code or the examples provided above.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.5.8

Feb 26, 2025

0.5.7

Feb 26, 2025

0.5.6

Feb 26, 2025

0.5.5

Feb 26, 2025

0.5.4

Feb 26, 2025

This version

0.5.3

Jan 9, 2025

0.5.2

Jan 9, 2025

0.5.1

Jan 8, 2025

0.5

Jan 8, 2025

0.4.1.9.9.4

Jan 8, 2025

0.4.1.9.9.3

Aug 23, 2024

0.4.1.9.9.2

Aug 23, 2024

0.4.1.9.9

Jul 5, 2024

0.4.1.9.8

Jun 17, 2024

0.4.1.9.7

Jun 17, 2024

0.4.1.9.6

Jun 16, 2024

0.4.1.9.5

Jun 15, 2024

0.4.1.9.4

Jun 15, 2024

0.4.1.9.3

Jun 15, 2024

0.4.1.9.2

Jun 15, 2024

0.4.1.9.1

Jun 15, 2024

0.4.1.9

Jun 15, 2024

0.4.1.8

Jun 15, 2024

0.4.1.7

Jun 15, 2024

0.4.1.6

Jun 15, 2024

0.4.1.5

Jun 15, 2024

0.4.1.4

Jun 14, 2024

0.4.1.3

Jun 14, 2024

0.4.1.2

Jun 14, 2024

0.4.1.1

Jun 14, 2024

0.4.1.0

Jun 14, 2024

0.4.0.9

Jun 12, 2024

0.4.0.8

Jun 12, 2024

0.4.0.7

Jun 12, 2024

0.4.0.5

Jun 11, 2024

0.4.0.4

Jun 11, 2024

0.4.0.3

Apr 20, 2024

0.4.0.2

Mar 13, 2024

0.4.0.1

Mar 13, 2024

0.4.0

Mar 13, 2024

0.3.9

Mar 13, 2024

0.3.8

Mar 13, 2024

0.3.7

Jan 25, 2024

0.3.6

Jan 25, 2024

0.3.5

Jan 22, 2024

0.3.4

Jan 21, 2024

0.3.3

Jan 7, 2024

0.3.2

Jan 7, 2024

0.3.0

Jan 2, 2024

0.2.9

Jan 2, 2024

0.2.8

Jan 2, 2024

0.2.7.5

Jan 2, 2024

0.2.7.4

Jan 1, 2024

0.2.7.3

Jan 1, 2024

0.2.7.1

Jan 1, 2024

0.2.7

Jan 1, 2024

0.2.6

Jan 1, 2024

0.2.5

Jan 1, 2024

0.2.4

Jan 1, 2024

0.2.3

Dec 31, 2023

0.2.2

Dec 31, 2023

0.2.1

Dec 31, 2023

0.2.0

Dec 31, 2023

0.1.9

Dec 31, 2023

0.1.8

Dec 31, 2023

0.1.7

Dec 31, 2023

0.1.6

Dec 31, 2023

0.1.5

Dec 31, 2023

0.1.4

Dec 31, 2023

0.1.3

Dec 31, 2023

0.1.2

Dec 31, 2023

0.1.1

Dec 30, 2023

0.1.0

Dec 30, 2023

0.0.9

Dec 29, 2023

0.0.8

Dec 29, 2023

0.0.7

Dec 29, 2023

0.0.6

Dec 29, 2023

0.0.5

Dec 29, 2023

0.0.4

Dec 29, 2023

0.0.3

Dec 29, 2023

0.0.2

Dec 29, 2023

0.0.1

Dec 29, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

econkit-0.5.3.tar.gz (14.1 kB view details)

Uploaded Jan 9, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

econkit-0.5.3-py3-none-any.whl (12.8 kB view details)

Uploaded Jan 9, 2025 Python 3

File details

Details for the file econkit-0.5.3.tar.gz.

File metadata

Download URL: econkit-0.5.3.tar.gz
Upload date: Jan 9, 2025
Size: 14.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.2

File hashes

Hashes for econkit-0.5.3.tar.gz
Algorithm	Hash digest
SHA256	`a8a764c7361ce3bbcf094b84ddf22926f4dbf286b5f66f1486351adb6ed2a359`
MD5	`481a3a7c4db5b705e308fc08a8b5139d`
BLAKE2b-256	`926574497711f0c9db47034b9c42b522b64b907b048331e59c8ee8688e22227d`

See more details on using hashes here.

File details

Details for the file econkit-0.5.3-py3-none-any.whl.

File metadata

Download URL: econkit-0.5.3-py3-none-any.whl
Upload date: Jan 9, 2025
Size: 12.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.2

File hashes

Hashes for econkit-0.5.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5a48e6454c384c39671d4d999db03b7b4255e40b441b5cfcc96c07fbfa009030`
MD5	`dc3cd0caab16da6f8a9860911f61bfc1`
BLAKE2b-256	`56b0c506ca8b335f8fd1e0e7b4a7184cd660794ca023aba1b8199d91728733ed`

See more details on using hashes here.

econkit 0.5.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

EconKit - Function Descriptions

Functions

Descriptive Statistics

descriptives(data)

Correlation Matrix

correlation(df, method='Pearson', p=False)

Augmented Dickey-Fuller (ADF) Test

adf(dataframe, maxlag=None, regression='c', autolag='AIC', handle_na='drop')

KPSS Test

kpss(dataframe, regression='c', nlags='auto', handle_na='drop')

Durbin-Watson Test

dw(data)

Normality Tests

normality_tests(data, pvalue=False)

Random Number Generator

random_number_generator(num_vars, num_random, distribution, mean=0, std_dev=1, seed=None, lower=None, upper=None, p=None, n_trials=None, lambda_param=None, start=None, stop=None, step=None, repeat_each=None, repeat_sequence=None)

Combine DataFrames

combine_dataframes(column_name, *dataframes)

Financial Data Retrieval

data(ticker_symbol, start_date, end_date, interval)

Usage Notes

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`descriptives(data)`

`correlation(df, method='Pearson', p=False)`

`adf(dataframe, maxlag=None, regression='c', autolag='AIC', handle_na='drop')`

`kpss(dataframe, regression='c', nlags='auto', handle_na='drop')`

`dw(data)`

`normality_tests(data, pvalue=False)`

`random_number_generator(num_vars, num_random, distribution, mean=0, std_dev=1, seed=None, lower=None, upper=None, p=None, n_trials=None, lambda_param=None, start=None, stop=None, step=None, repeat_each=None, repeat_sequence=None)`

`combine_dataframes(column_name, *dataframes)`

`data(ticker_symbol, start_date, end_date, interval)`