A library for performing automatic detection of assessment classes of Wikipedia articles.
Project description
Wikipedia article quality classification
This library provides a set of utilities for performing automatic detection of assessment classes of Wikipedia articles. For more information, see the full documentation at https://articlequality.readthedocs.io .
Compatible with Python 3.x only. Sorry.
- Install:
pip install articlequality
- Models: https://github.com/wikimedia/articlequality/tree/master/models
- Documentation: https://articlequality.readthedocs.io
Basic usage
>>> import articlequality
>>> from revscoring import Model
>>>
>>> scorer_model = Model.load(open("models/enwiki.nettrom_wp10.gradient_boosting.model", "rb"))
>>>
>>> text = "I am the text of a page. I have a <ref>word</ref>"
>>> articlequality.score(scorer_model, text)
{'prediction': 'stub',
'probability': {'stub': 0.27156163795807853,
'b': 0.14707452309674252,
'fa': 0.16844898943510833,
'c': 0.057668704007171959,
'ga': 0.21617801281707663,
'start': 0.13906813268582238}}
Authors
- Aaron Halfaker -- https://github.com/halfak
- Morten Warncke-Wang -- https://github.com/nettrom
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
articlequality-0.4.2.tar.gz
(31.6 kB
view hashes)
Built Distribution
Close
Hashes for articlequality-0.4.2-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d4a843af4a46dffe1b3463f218c1b76da631b2eff7003e052be8c5aac525bb87 |
|
MD5 | 1d937247928a3e00bc69f100e2095930 |
|
BLAKE2b-256 | bc28d11981b9195b8ca6e8adcce418132be8417c4ba630872ec630261f274430 |