Python utility for fetching any historical data using caching.

These details have not been verified by PyPI

Project links

Development Status
- 2 - Pre-Alpha
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
- OS Independent
Programming Language
Topic
- Software Development :: Libraries

Project description

Cached Historical Data Fetcher

Supported Python versions License

Python utility for fetching any historical data using caching. Suitable for acquiring data that is added frequently and incrementally, e.g. news, posts, weather, etc.

Installation

Install this via pip (or your favourite package manager):

pip install cached-historical-data-fetcher

Features

Uses cache built on top of joblib, lz4 and aiofiles.
Ready to use with asyncio, aiohttp, aiohttp-client-cache. Uses asyncio.gather for fetching chunks in parallel. (For performance reasons, only using aiohttp-client-cache is probably not a good idea when fetching large number of chunks (web requests).)
Based on pandas and supports MultiIndex.

Usage

`HistoricalDataCache`, `HistoricalDataCacheWithChunk` and `HistoricalDataCacheWithFixedChunk`

Override get_one() method to fetch data for one chunk. update() method will call get_one() for each unfetched chunk and concatenate results, then save to cache.

from cached_historical_data_fetcher import HistoricalDataCacheWithFixedChunk
from pandas import DataFrame, Timedelta, Timestamp
from typing import Any

# define cache class
class MyCacheWithFixedChunk(HistoricalDataCacheWithFixedChunk[Timestamp, Timedelta, Any]):
    delay_seconds = 0.0 # delay between chunks (requests) in seconds
    interval = Timedelta(days=1) # interval between chunks, can be any type
    start_index = Timestamp.utcnow().floor("10D") # start index, can be any type

    async def get_one(self, start: Timestamp, *args: Any, **kwargs: Any) -> DataFrame:
        """Fetch data for one chunk."""
        return DataFrame({"day": [start.day]}, index=[start])

# get complete data
print(await MyCacheWithFixedChunk().update())

                           day
2023-09-30 00:00:00+00:00   30
2023-10-01 00:00:00+00:00    1
2023-10-02 00:00:00+00:00    2

See example.ipynb for real-world example.

`IdCacheWithFixedChunk`

Override get_one method to fetch data for one chunk in the same way as in HistoricalDataCacheWithFixedChunk. After updating ids by calling set_ids(), update() method will call get_one() for every unfetched id and concatenate results, then save to cache.

from cached_historical_data_fetcher import IdCacheWithFixedChunk
from pandas import DataFrame
from typing import Any

class MyIdCache(IdCacheWithFixedChunk[str, Any]):
    delay_seconds = 0.0 # delay between chunks (requests) in seconds

    async def get_one(self, start: str, *args: Any, **kwargs: Any) -> DataFrame:
        """Fetch data for one chunk."""
        return DataFrame({"id+hello": [start + "+hello"]}, index=[start])

cache = MyIdCache() # create cache
cache.set_ids(["a"]) # set ids
cache.set_ids(["b"]) # set ids again, now `cache.ids` is ["a", "b"]
print(await cache.update(reload=True)) # discard previous cache and fetch again
cache.set_ids(["b", "c"]) # set ids again, now `cache.ids` is ["a", "b", "c"]
print(await cache.update()) # fetch only new data

       id+hello
    a   a+hello
    b   b+hello
       id+hello
    a   a+hello
    b   b+hello
    c   c+hello

Contributors ✨

Thanks goes to these wonderful people (emoji key):

This project follows the all-contributors specification. Contributions of any kind welcome!

Project details

These details have not been verified by PyPI

Project links

Development Status
- 2 - Pre-Alpha
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
- OS Independent
Programming Language
Topic
- Software Development :: Libraries

Release history Release notifications | RSS feed

This version

0.2.31

Jun 27, 2025

0.2.30

Jun 27, 2025

0.2.29

May 27, 2025

0.2.28

May 27, 2025

0.2.27

May 25, 2025

0.2.26

Jan 30, 2025

0.2.25

Jan 29, 2025

0.2.24

Nov 15, 2024

0.2.23

Nov 15, 2024

0.2.22

Oct 30, 2024

0.2.21

Oct 30, 2024

0.2.20

May 5, 2024

0.2.19

May 5, 2024

0.2.18

May 4, 2024

0.2.17

May 4, 2024

0.2.16

Mar 25, 2024

0.2.15

Mar 25, 2024

0.2.14

Mar 25, 2024

0.2.13

Feb 6, 2024

0.2.12

Jan 31, 2024

0.2.11

Jan 26, 2024

0.2.10

Jan 20, 2024

0.2.9

Jan 3, 2024

0.2.8

Jan 2, 2024

0.2.7

Dec 10, 2023

0.2.6

Dec 8, 2023

0.2.5

Nov 13, 2023

0.2.4

Nov 11, 2023

0.2.3

Oct 27, 2023

0.2.2

Oct 27, 2023

0.2.1

Oct 16, 2023

0.2.0

Oct 7, 2023

0.1.0

Oct 2, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cached_historical_data_fetcher-0.2.31.tar.gz (13.2 kB view details)

Uploaded Jun 27, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cached_historical_data_fetcher-0.2.31-py3-none-any.whl (13.8 kB view details)

Uploaded Jun 27, 2025 Python 3

File details

Details for the file cached_historical_data_fetcher-0.2.31.tar.gz.

File metadata

Download URL: cached_historical_data_fetcher-0.2.31.tar.gz
Upload date: Jun 27, 2025
Size: 13.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for cached_historical_data_fetcher-0.2.31.tar.gz
Algorithm	Hash digest
SHA256	`e355bedcc552b8268e1682fc37a87a4bb3ff5372d54feea06bd23c539833b713`
MD5	`91410062539a5472341b4c8003e0359d`
BLAKE2b-256	`f845908191adb13d638e6b6413184c410dddf571a5e3c8c1623e1e9bce5c1ad0`

See more details on using hashes here.

File details

Details for the file cached_historical_data_fetcher-0.2.31-py3-none-any.whl.

File metadata

Download URL: cached_historical_data_fetcher-0.2.31-py3-none-any.whl
Upload date: Jun 27, 2025
Size: 13.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for cached_historical_data_fetcher-0.2.31-py3-none-any.whl
Algorithm	Hash digest
SHA256	`dd71e853b4ec361261569c29ccb8593d8109a3f1be2073d32b953844fd441c5a`
MD5	`1441eed97bff35dfec17b42c07c5413d`
BLAKE2b-256	`73c0f780a5b7c17f9a87270f234f70ca58bc567ed633e426ff86f0bbf9993c76`

See more details on using hashes here.

cached-historical-data-fetcher 0.2.31

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Cached Historical Data Fetcher

Installation

Features

Usage

`HistoricalDataCache`, `HistoricalDataCacheWithChunk` and `HistoricalDataCacheWithFixedChunk`

`IdCacheWithFixedChunk`

Contributors ✨

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

cached-historical-data-fetcher 0.2.31

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Cached Historical Data Fetcher

Installation

Features

Usage

HistoricalDataCache, HistoricalDataCacheWithChunk and HistoricalDataCacheWithFixedChunk

IdCacheWithFixedChunk

Contributors ✨

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`HistoricalDataCache`, `HistoricalDataCacheWithChunk` and `HistoricalDataCacheWithFixedChunk`

`IdCacheWithFixedChunk`