Module to facilitate the integration of a sklearn training pipeline into a deploy and retraining system
Project description
Module to facilitate the integration of a sklearn training pipeline into a deploy and retraining system
Install
pip install gpam_training
Usage
Multilabel training
First of all, it is needed to have in memory a dataframe from pandas. The csv must be in the following format:
process_id,page_text_extract,tema
1,Lorem ipsum dolor sit amet,1
2,Lorem ipsum dolor sit amet,2
2,Lorem ipsum dolor sit amet,3
42,Lorem ipsum dolor sit amet,2
To train the model, do as shown bellow:
from gpam_training import MultilabelTraining
import pandas as pd
df = pd.read_csv('example.csv')
model = MultilabelTraining(df)
model.train()
To dump a pickle file with the trained model, do the following:
model_pickle = model.get_pickle()
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
gpam_training-0.0.14.tar.gz
(878.4 kB
view hashes)
Built Distribution
Close
Hashes for gpam_training-0.0.14-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c07d9a9b5a32c7ed6f45b9bbee4a89ae10f9d40e00f278cc31a1b56a968294bb |
|
MD5 | 2bdc89b41eff41f7d8e5b6404b38014c |
|
BLAKE2b-256 | afb0436eaa4cdb505e46d94c3be726a6203a081a4b49df0723e2b6adbb5e899d |