Datasets downloading/batching/processing in Numpy
Project description
/*
*/
All dataset utilities (downloading/loading/batching/processing) in Numpy

This is an under-development research project, not an official product, expect bugs and sharp edges; please help by trying it out, reporting bugs. Reference docs
What is and why doing numpy-datasets ?
- First, numpy-datasets offers out-of-the-box dataset download and loading only based on Numpy and core Python libraries.
- Second, numpy-datasets offers utilities such as (mini-)batching a.k.a looping through a dataset one chunk at a time, or preprocessing techniques that are highly suited for machine learning and deep learning pipelines.
- Third, numpy-datasets offers many options to transparently deal with very large datasets. For example, automatic mini-batching with a priori caching of the next batch, online preprocessing, and the likes.
- Fourth, numpy-datasets does not only focus on computer vision datasets but also offers plenty in time-series datasets, with a constantly groing collection of implemented datasets.
Minimal Example
import sys
import symjax as sj
import symjax.tensor as T
# create our variable to be optimized
mu = T.Variable(T.random.normal((), seed=1))
# create our cost
cost = T.exp(-(mu-1)**2)
# get the gradient, notice that it is itself a tensor that can then
# be manipulated as well
g = sj.gradients(cost, mu)
print(g)
# (Tensor: shape=(), dtype=float32)
# create the compield function that will compute the cost and apply
# the update onto the variable
f = sj.function(outputs=cost, updates={mu:mu-0.2*g})
for i in range(10):
print(f())
# 0.008471076
# 0.008201109
# 0.007946267
# ...
Installation
Installation is direct with pip as described in this guide.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
numpy-datasets-0.0.2.tar.gz
(69.2 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file numpy-datasets-0.0.2.tar.gz.
File metadata
- Download URL: numpy-datasets-0.0.2.tar.gz
- Upload date:
- Size: 69.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.8.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
210054744760e22efce878bc9ef30a88c2d4855b6ad645e71d0e7accf930b855
|
|
| MD5 |
7f56b4b29594525b63c9ee55269ea78d
|
|
| BLAKE2b-256 |
2c458843e6bf2e9c48fe803a1cedfb95cd8aaf7bd47cbf0c0523dd1cfee20ead
|
File details
Details for the file numpy_datasets-0.0.2-py3-none-any.whl.
File metadata
- Download URL: numpy_datasets-0.0.2-py3-none-any.whl
- Upload date:
- Size: 79.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.8.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9a51796d8d61e23a7bb5b907f4a343f228cfa7ae5af170a37619a08a0b68e645
|
|
| MD5 |
b9254efb13db14334f9147c883ebc1ec
|
|
| BLAKE2b-256 |
392b225952e600b51c4b9e10a4a6347a5504ba58778db39d8503d172986f871d
|