A small package for Numerai Signals locally
Project description
pip install signalslite
Why?
- I wanted a pipeline that can generate features quickly so I can add, remove, build more features whenever needed. So it should be able to do everything from scratch in couple of hours. Myu initial choice for database was a relational database, could have been ideal. However, it would increase workload of setting things up and writing operation of all stages of data pipeline was slow, so I decided to use parquet files split into daily structure. This is fast.
- I also wanted it to run on most users' systems, so It should effortlessly run on consumer grade laptops. Consequently, I wanted to automate the whole pipeline on cloud, so makes sense to make it lite, use parallelization when possible, allow for free data sources.
- It should be able to run in Colab default runtime. One way to setup a pipeline is to save all data to mounted drive with more storage.
- Under 1000 LoC possible?
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
signalslite-0.1a2.tar.gz
(12.1 kB
view hashes)
Built Distribution
Close
Hashes for signalslite-0.1a2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 77b51ccc48ecd8a0c2ca139eb5219ef1cbbd6c4566e7c96b8f042598a78fb3b4 |
|
MD5 | 9a16db5e30cbcb040e8be45e864a40be |
|
BLAKE2b-256 | c479ea64feaca8ea66f66acf7e2267cd83ab66f2b1552b8a3775fe83770bc8a0 |