4 projects
textplumber
Pipeline components for Sci-kit learn to extract relevant features from text data, including tokens, parts of speech, lexicon scores, document-level statistics and embeddings.
conc
A Python library for efficient corpus analysis, enabling corpus linguistic analysis in Jupyter notebooks.
contextapp
A browser-based concordancer and language analysis application.
corpress
Create a text corpus from a WordPress site using the WordPress API.