Classifier for institution and scholar data
Project description
eric_chen_forward
To train the model:
from eric_chen_forward.model import Classifier
model = Classifier()
# option 1
# text files of labels and paragraphs respectively, separated by newlines
model.train("labels_file_path", "paragraphs_file_path")
# option 2
# csv file with a 'label' column and 'paragraph' column, the column names are hardcoded
model.train(csv_file="csv_file_path")
To use the saved model in code:
with open('model.pkl', 'rb') as f:
model = pickle.load(f)
To run the classifier demo:
from eric_chen_forward import url_classifier_demo
API_KEY = ...
SEARCH_ENGINE_ID = ...
url_classifier_demo.Demo('file path of model.pkl', API_KEY, SEARCH_ENGINE_ID, max_summary_length)
max_summary_length is set to 100 words by default.
Register an API Key and set up the Programmable Search Engine to be able to use the Google Custom Search API: https://developers.google.com/custom-search/v1/overview
After setting up, the Search engine ID can also be found in the control panel: https://programmablesearchengine.google.com/controlpanel/all
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for eric_chen_forward-0.0.10-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8ef2b9f591717d708b0554cbb3438dbc7c7762544ac29ad6a1d81d336c4ca22c |
|
MD5 | 9c0deb411b1664b03fefe5630387aa3f |
|
BLAKE2b-256 | 6703e29504c3e88068b4546fd296d2e66f829531bc47c760473c7765e9c47d8c |