A simple short-text classification tool based on LibLinear
Project description
A simple, efficient short-text classification tool based on LibLinear.
Sample Usage
>>> from tgrocery import Grocery
# Create a grocery(don't forget to set a name)
>>> grocery = Grocery('sample')
# Train from list
>>> train_src = [
('education', 'Student debt to cost Britain billions within decades'),
('education', 'Chinese education for TV experiment'),
('sports', 'Middle East and Asia boost investment in top level sports'),
('sports', 'Summit Series look launches HBO Canada sports doc series: Mudhar')
]
>>> grocery.train(train_src)
# Or train from file
>>> grocery.train('train_ch.txt')
# Save model
>>> grocery.save()
# Load model(the same name as previous)
>>> new_grocery = Grocery('sample')
>>> new_grocery.load()
# Predict
>>> new_grocery.predict('Abbott government spends $8 million on higher education media blitz')
education
# Test from list
>>> test_src = [
('education', 'Abbott government spends $8 million on higher education media blitz'),
('sports', 'Middle East and Asia boost investment in top level sports'),
]
>>> new_grocery.test(test_src)
# Return Accuracy
1.0
# Or test from file
>>> new_grocery.test('test_ch.txt')
# Custom tokenize
>>> custom_grocery = Grocery('custom', custom_tokenize=list)
Installation
$ pip install tgrocery
Links
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
tgrocery-0.1.4.tar.gz
(39.6 kB
view details)
File details
Details for the file tgrocery-0.1.4.tar.gz
.
File metadata
- Download URL: tgrocery-0.1.4.tar.gz
- Upload date:
- Size: 39.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | efbb7124dc8baf3e68408953656302b7351ff069155d352ebb511a9ebe63caa0 |
|
MD5 | 2c4d4eae9471f7de16601ff4eb7bbaea |
|
BLAKE2b-256 | 2b5a676cff402a173d03adf460ee19a9d4d928e03866478ba21582f3c0e522ed |