Library to produce synthetic price time series
Project description
Synthetick Library
This is a Work In progress for a library to generate synthetic price time series at different levels of abstraction: tick, OHLC, etc.
At the bottom level the library generates tick data, on top of which it calculates price aggregations like OHLC or others.
So, essentially, tick price is at the core so the model how tick price is calculated is presented in the next section.
Tick Data Model
Core Model
Let $p_0$ be the initial price value for the time series $P$, with $n$ elements, then the next price elements of the series are calculated as follows:
$p_1 = p_0 + \Delta p_1$
$p_2 = p_1 + \Delta p_2$
...
$p_{n-2} = p_{n-3} + \Delta p_{n-2}$ (1)
$p_{n-1} = p_{n-2} + \Delta p_{n-1}$ (2)
$p_{n} = p_{n-1} + \Delta p_{n}$ (3)
Where:
$\Delta p_i$ Is the return or price change in for period $i$
The general term is:
$p_{i} = p_{i-1} + \Delta p{i}, i \in [1, 2, ... n]$
Replacing (2) in (3)
$p_n = p_{n-2} + \Delta p_{n-1} + \Delta p_{n} (4)$
If we now repeat the process to replace (1) in (4)
$p_n = p_{n-3} + \Delta p_{n-2} + \Delta p_{n-1} + \Delta p_{n} (4)$
Repeating the process until $p_1$ is replaced in the resultant formula:
$p_n = p_{0} + \sum\limits_{i=1}^n \Delta p_i$
If $\Delta p_{i}$ is produced by an stochastic process, then the series has the characteristics of a Random Walk.
Making $\Delta p_{i} \approx N(\mu, \sigma)$ a normal distribution with mean $\mu$ and standard deviation $\sigma$
If $\mu$ is 0, then the price generation process is an unbiased random walk, but as will be shown later, using $\mu \neq 0$ (biased random walk) it is possible to control the price trend: up (long or bull), range or down (short or hawkish)
With $\sigma$ it is possible to control price volatility.
To wrap up this section, the tick price generation process has three parameters:
- $p_0$: First price in the series, necessary to calculate the remaining $n-1$ elements, with $n$ being the number of elements in the series.
- $\mu$: Mean for the distribution of returns or price change.
- $\sigma$: Standard deviation for the returns distribution.
Bid, Ask, Spread
The price of a financial asset comes in pairs: the price at wich you buy or Ask price, and the price at wich you sell or Bid. So for each new tick price, you need two values: Bid and Ask.
The difference between both is the Spread:
$spread_i = Ask_i - Bid_i$
So to generate tick price it is needed to generate two time series.
The library generates Bid parice first (As described in Core Model), then calculates Ask as a function of Bid and Spread:
$Ask_i = Bid_i + Spread_i$
Where:
$Spread \in (SPREAD_{min}, SPREAD_{max})$ is calculated as a random value with unifoirm dsitribution between $SPREAD_{min}$ and $SPREAD_{max}$
This calculation adds two new parameters to the model:
$spread_{min}$: Minimum value for the spread
$spread_{max}$ Maximum value for the spread
Price Aggregations
Price aggregations are data reductions by nean of applying functions on tick data for a period of time (candles or price bars), for a fix number of ticks (tick bars) or for a fix price change (renko bars).
So far only OHLC data is supported and is claculated using pandas library
tick.price_time_series["bid"].resample(<tick-data-time-series>).ohlc()
For more datails read here
TODO: add Renko and Tick Bars
How to Use
Install
pip install synthetick
How to use
Generating tick data.
This example generates a time series of tick data with a frequency of 1 socond, uptrending, with a volatility range of 10 pips, a spread range from 0.5 to 3 pips, with the pip position at the 4th decimal place.
from datetime import datetime
import pandas as pd
from synthetick.Synthetick import Ticks
DATE_FROM: datetime = pd.to_datetime("2023-01-01 00:00:00")
DATE_TO: datetime = pd.to_datetime("2023-02-01 00:00:00")
tick_data_generator = Ticks(trend=0.01,
volatility_range=10,
spread_min=0.5,
spread_max=3,
pip_position=-4,
remove_weekend=True)
tick_data_generator._compute_date_range(date_from=DATE_FROM,
date_to=DATE_TO,
frequency="1s",
init_value=1.1300)
tick_data_generator.price_time_series.to_csv("test_tick_happy_path.csv", index_label="date-time")
tick_data_generator.price_time_series[300:350][["bid", "ask"]].plot(figsize=(10,3), marker=".", cmap="PiYG")
Generating OHLC Data
from datetime import datetime
import pandas as pd
from synthetick.Synthetick import OHLC
import mplfinance as mpf
DATE_FROM: datetime = pd.to_datetime("2023-01-01 00:00:00")
DATE_TO: datetime = pd.to_datetime("2023-02-01 00:00:00")
ohlc: OHLC = OHLC(trend=0.0001,
volatility_range=10,
spread_min=0.5,
spread_max=3,
pip_position=-4,
remove_weekend=True,
tick_frequency="1s",
time_frame="H")
ohlc.produce(date_from=DATE_FROM, date_to=DATE_TO, init_value=1.300)
ohlc.ohlc_time_series["bid"].to_csv("ohlc_bid_1h.csv", index_label="date-time")
mc2 = mpf.make_marketcolors(up='blue',down='r')
s2 = mpf.make_mpf_style(marketcolors=mc2)
mpf.plot(ohlc.ohlc_time_series["bid"][200:400], type="candle", figsize=(15,4), style=s2)
TODO's
- Improve documentation
- Produce ticks at random intervals.
- Remove weekends option
- Change trend when price reaches zero level
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
File details
Details for the file synthetick-0.1.0-py3-none-any.whl
.
File metadata
- Download URL: synthetick-0.1.0-py3-none-any.whl
- Upload date:
- Size: 8.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 colorama/0.4.4 importlib-metadata/4.6.4 keyring/23.5.0 pkginfo/1.8.2 readme-renderer/34.0 requests-toolbelt/0.9.1 requests/2.31.0 rfc3986/1.5.0 tqdm/4.57.0 urllib3/1.26.5 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3046e1ece24b72807da3829cbf6aa4a6bf7b79694be323a18babfa7361d0eda4 |
|
MD5 | a468edeae502baac1b05f12fd9a25331 |
|
BLAKE2b-256 | 41e14351b5068ec4d3180139a7ae04899b9b29f5f873e392f4e72556f91ae737 |