A small package for Numerai Signals locally
Project description
pip install signalslite
Why?
- I wanted a pipeline that can generate features quickly so I can add, remove, build more features whenever needed. So it should be able to do everything from scratch in couple of hours. Myu initial choice for database was a relational database, could have been ideal. However, it would increase workload of setting things up and writing operation of all stages of data pipeline was slow, so I decided to use parquet files split into daily structure. This is fast.
- I also wanted it to run on most users' systems, so It should effortlessly run on consumer grade laptops. Consequently, I wanted to automate the whole pipeline on cloud, so makes sense to make it lite, use parallelization when possible, allow for free data sources.
- It should be able to run in Colab default runtime. One way to setup a pipeline is to save all data to mounted drive with more storage.
- Under 1000 LoC possible?
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
signalslite-0.1a1.tar.gz
(12.1 kB
view hashes)
Built Distribution
Close
Hashes for signalslite-0.1a1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c7d597b13ccb986694cce2aae9594cabf7dad2c0ca5c84f02acc2e19003612c1 |
|
MD5 | badf72932c7d1ce1e64dc4b69728340d |
|
BLAKE2b-256 | 215dec6ff83567c9e6cbe45c21d32a36b37a9052fa0d76f1908650f93805771c |