A data ingestion tool for bigger-than-memory data sets.
For the latest source, discussion, etc, please visit the GitLab repository
Data ingestion for bigger-than-memory data sets
This repository contains the code for computing the mean, standard deviation, max value and the histogram for all features in bigger-than-memory data sets.
- Python 3.6+
The code available in this repository concerns the coding challenge from Jungle.ai for the data engineer position and it is a work in progress.
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
|Filename, size||File type||Python version||Upload date||Hashes|
|Filename, size data_ingest-0.0.1-py3.7.egg (1.9 kB)||File type Egg||Python version 3.7||Upload date||Hashes View hashes|