Data pipeline framework for machine learning
Project description
Fuel
Fuel provides your machine learning models with the data they need to learn.
Interfaces to common datasets such as MNIST, CIFAR-10 (image datasets), Google’s One Billion Words (text), and many more
The ability to iterate over your data in a variety of ways, such as in minibatches with shuffled/sequential examples
A pipeline of preprocessors that allow you to edit your data on-the-fly, for example by adding noise, extracting n-grams from sentences, extracting patches from images, etc.
Ensure that the entire pipeline is serializable with pickle; this is a requirement for being able to checkpoint and resume long-running experiments. For this, we rely heavily on the picklable_itertools library.
Fuel is developed primarily for use by Blocks, a Theano toolkit that helps you train neural networks.
If you have questions, don’t hesitate to write to the mailing list.
- Citing Fuel
If you use Blocks or Fuel in your work, we’d really appreciate it if you could cite the following paper:
Bart van Merriënboer, Dzmitry Bahdanau, Vincent Dumoulin, Dmitriy Serdyuk, David Warde-Farley, Jan Chorowski, and Yoshua Bengio, “Blocks and Fuel: Frameworks for deep learning,” arXiv preprint arXiv:1506.00619 [cs.LG], 2015.
- Documentation
Please see the documentation for more information.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file fuel-0.2.0.tar.gz
.
File metadata
- Download URL: fuel-0.2.0.tar.gz
- Upload date:
- Size: 184.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2a4121b614219288c80e1688e0a35c11060fb999dde4218f65412c621a8a5f65 |
|
MD5 | 48bf3267009f7a4285c101485321b87d |
|
BLAKE2b-256 | 16c2b5fb651c90e908f79769b7dd3643982b6a9b1bac9449b8ab16f72612d4f5 |