Batching is a set of tools to format data for training sequence models
Project description
Batching
Batching is a set of tools to format data for training sequence models.
Installation
$ pip install batching
Example usage
Example script exists in sample.py
# Metadata for batch info - including batch IDs and mappings to storage resouces like filenames
storage_meta = StorageMeta(validation_split=0.2)
# Storage for batch data - Memory, Files, S3
storage = BatchStorageMemory(storage_meta)
# Create batches - configuration contains feature names, windowing config, timeseries spacing
batch_generator = Builder(storage,
feature_set,
look_back,
look_forward,
batch_seconds,
batch_size=128)
batch_generator.generate_and_save_batches(list_of_dataframes)
# Generator for feeding batches to training - tf.keras.model.fit_generator
train_generator = BatchGenerator(storage)
validation_generator = BatchGenerator(storage, is_validation=True)
model = tf.keras.Sequential()
model.add(tf.keras.layers.Dense(1, activation='sigmoid')
model.compile(loss=tf.keras.losses.binary_crossentropy,
optimizer=tf.keras.optimizers.Adam(),
metrics=['accuracy'])
model.fit_generator(train_generator,
validation_data=validation_generator,
epochs=epochs)
License
- MIT license
- Copyright 2015 © FVCproductions.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
batching-1.2.0.tar.gz
(9.9 kB
view details)
File details
Details for the file batching-1.2.0.tar.gz
.
File metadata
- Download URL: batching-1.2.0.tar.gz
- Upload date:
- Size: 9.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.8.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8ef11879c4edf320db72a1dd70203f23f28bd399a1eafaf76adc4779f80cda05 |
|
MD5 | 1c5341da10c1953b70ef28ec460eb8e1 |
|
BLAKE2b-256 | 45294525e96a0d128488e2572464839a85c8a124b697f6e64d089d625a6d07d5 |