brwording - Processamento de Linguagem Natural em Português
Project description
BRWording
- Text Analytics for Portuguese Wordings
Create an easy Text Analytics in One-Line-Code
Main Features:
- Load
Excel
,CSV
andTXT
file types - Stemming
- Lemmatization
- Stopwords
- TD-IDF
- Sentimental Analysis
- Graphical interpretation
- Word Cloud
The TF-IDF was calculated by:
How to Install
pip install BRWording
How to use
sintax
:
from brwording import brwording
w = brwording.wording()
w.load_file('data/example.txt',type='txt')
w.build_tf_idf(lemmatizer=True,stopwords=True)
w.tfidf
The fields to load_file
are:
file
: the file pathtype
: file type, can betxt csv
orexcel
header
: if you are reading a csv file, so you must tell if this file has a header or not (False
orTrue
)sep
: if you are reading a csv file, you must tell what kind field separator you wantcolumn
: if you read acsv
orexcel
file, you must tell what column you want to parse
The method build_tf_idf
has a default True
option for both parameters.
Output
If want to see the sentimental Graphical interpretation
sintax
:
w.sentimental_graf()
You can rotate the graph if you pass rotate=True
in argument
output
You can print the same information as a table using the follow command:
sintax
:
w.sentimental_table()
if you want to create a wordcloud, just strike the folowing command, but if you want to create a cloud with your own mask, just pass you image address as picture
sintax
:
w.word_cloud(picture='none')
output
Looking for a word into colection
if you want to see what files on your colection has a word, run look2word
sintax
:
w.look2word('bonito')
New features are incoming.
enjoi!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for brwording-0.0.5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | cb768c48d6ff527a5fbd94bfd25b23ce36b6a33236dd1f3f58d7e7977ee14e7c |
|
MD5 | ed1e84802cba17949c1ae7789b04da19 |
|
BLAKE2b-256 | 72f897da92c84cefd58cd784f74917a146e0a8fb1f03ed58a78b30432fa2685f |