Skip to main content

Data pipeline for algo-trading, getting and storing both real-time and historical data made easy.

Project description

PFeed: The Single Source of Truth for Algo-Trading Data. Uniting Traders to Clean Once & Share with All.

Twitter Follow GitHub stars PyPI downloads PyPI PyPI - Support Python Versions

This library is NOT stable yet. For a stable release, please wait for 0.1.0.

TL;DR: use pfeed to manage your trading data; other traders will help you clean it

PFeed Architecture

For illustration purposes only, not everything shown is currently supported.

Problem

Starting algo-trading requires reliable and clean data, but traders often work in silos, each writing duplicated code to clean the same datasets—wasting time and effort. Accessing clean, ready-to-use data is a challenge, forcing traders to handle tedious data tasks before they can even start trading.

Solution

pfeed leverages modern data engineering tools to centralize data cleaning efforts, automate ETL/ELT, store data in a data lake with Delta Lake support, and output backtesting-ready data, allowing traders to focus on strategy development.


PFeed (/piː fiːd/) is the data engine for trading, serving as a pipeline between raw data sources and traders. It enables you to download historical data, stream real-time data, and store cleaned data in a local data lake for quantitative analysis, supporting both batch processing and streaming workflows through streamlined data collection, cleaning, transformation, and storage.

Core Features

  • Download or stream reliable, validated and clean data for research, backtesting, or live trading
  • Get historical data (dataframe) or live data in standardized formats by just calling a single function
  • Own your data by storing them locally using MinIO + Deltalake, or in the cloud
  • Interact with different kinds of data (including TradFi, CeFi and DeFi) using a unified interface
  • Scale using modern data tools (e.g. Polars, Dask) and workflow orchestration frameworks (Prefect for batch processing, Bytewax for streaming)

Table of Contents

Installation

For more installation options, please refer to the documentation.

# [RECOMMENDED]: Core Features, including Minio, Deltalake, Ray, etc.
pip install -U "pfeed[core,prefect,bytewax]"

# add your desired data sources, e.g. databento, polygon, etc.
pip install -U "pfeed[core,databento,polygon]"

# Minimal Features
pip install -U "pfeed"

Quick Start

1. Get Historical Data in Dataframe

Get Bybit's data in dataframe, e.g. 1-minute data (data is downloaded on the fly if not found in storage)

import pfeed as pe

bybit = pe.Bybit(data_tool='polars')

df = bybit.get_historical_data(
    'BTC_USDT_PERP',
    resolution='1minute',  # 'raw' or '1tick'/'1t' or '2second'/'2s' etc.
    start_date='2025-01-01',
    end_date='2025-01-01',
)

Printing the first few rows of df:

date resolution product symbol open high low close volume
2025-01-01 00:00:00 1m BTC_USDT_PERP BTCUSDT 93530 93590.8 93501.3 93590.5 30.284
2025-01-01 00:01:00 1m BTC_USDT_PERP BTCUSDT 93590.5 93627.7 93571.8 93625 30.334

By using pfeed, you are just one function call away from getting a standardized dataframe

2. Download Historical Data on the Command Line Interface (CLI)

For more CLI commands, please refer to the documentation.

# download BTC tick data
pfeed download -d BYBIT -p BTC_USDT_PERP -r tick --start-date 2025-01-01 --end-date 2025-02-01

# download data and store it in MinIO
pfeed download -d BYBIT -p BTC_USDT_PERP --storage minio

3. Download Historical Data in Python

import pfeed as pe

bybit = pe.Bybit()
bybit.download(
    product='BTC_USDT_PERP',
    resolution='1s',  # 1-second data
    rollback_period='1w',  # rollback 1 week
    to_storage='local',
)

Supported Data Sources

Data Source Get Historical Data Download Historical Data Get Live Data Stream Live Data
Yahoo Finance 🟢
Bybit 🟢 🟢 🟡 🔴
*Interactive Brokers 🔴 🔴 🔴
*FirstRate Data 🔴 🔴
*Databento 🔴 🔴 🔴 🔴
*Polygon 🔴 🔴 🔴 🔴
Binance 🔴 🔴 🔴 🔴
OKX 🔴 🔴 🔴 🔴

🟢 = finished
🟡 = in progress
🔴 = todo
⚪ = not applicable
* = paid data

Related Projects

  • PFund — A Complete Algo-Trading Framework for Machine Learning, TradFi, CeFi and DeFi ready. Supports Vectorized and Event-Driven Backtesting, Paper and Live Trading
  • PyTrade.org - A curated list of Python libraries and resources for algorithmic trading.

Disclaimer

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES, OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF, OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

This framework is intended for educational and research purposes only. It should not be used for real trading without understanding the risks involved. Trading in financial markets involves significant risk, and there is always the potential for loss. Your trading results may vary. No representation is being made that any account will or is likely to achieve profits or losses similar to those discussed on this platform.

The developers of this framework are not responsible for any financial losses incurred from using this software. This includes but not limited to losses resulting from inaccuracies in any financial data output by PFeed. Users should conduct their due diligence, verify the accuracy of any data produced by PFeed, and consult with a professional financial advisor before engaging in real trading activities.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pfeed-0.0.5.tar.gz (94.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pfeed-0.0.5-py3-none-any.whl (140.8 kB view details)

Uploaded Python 3

File details

Details for the file pfeed-0.0.5.tar.gz.

File metadata

  • Download URL: pfeed-0.0.5.tar.gz
  • Upload date:
  • Size: 94.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.1 CPython/3.13.3 Darwin/24.3.0

File hashes

Hashes for pfeed-0.0.5.tar.gz
Algorithm Hash digest
SHA256 a03818ae979aaa0631853b49181cfefb6de1aaf2604034507bb8a857564cd427
MD5 3c1d4ed6c37a64f457fbecd7f3cbdbb9
BLAKE2b-256 d9f3130d7ca9ae779ea282a2bdf331a1082fdf5d24a342edb6135656efb52aef

See more details on using hashes here.

File details

Details for the file pfeed-0.0.5-py3-none-any.whl.

File metadata

  • Download URL: pfeed-0.0.5-py3-none-any.whl
  • Upload date:
  • Size: 140.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.1 CPython/3.13.3 Darwin/24.3.0

File hashes

Hashes for pfeed-0.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 36b35be1ee22f9ccec75d582cd56cf772d967938e1f8f56ba9a07d31313ef020
MD5 d6a5fdbabf891ba1928aeccf674801fe
BLAKE2b-256 6b2ef997ea79fced0d7a74a4bd6c5d9608da2b44bd52cd7bc58b86379c83222e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page