13 projects
split-folders
Split folders with files (e.g. images) into training, validation and test (dataset) folders.
clean-text
Functions to preprocess and normalize text.
pd3f
Reconstruct the original continuous text from PDFs with language models
dehyphen
Dehyphenation of broken text (mainly German), i.e., extracted from a PDF
pd3f-flair
Flair's language models without unnecessary dependencies
hyperhyper
Python Library to Construct Word Embeddings for Small Data
german
Preprocess German texts for serious NLP.
german-lemmatizer
A Python package (using a Docker image under the hood) to lemmatize German texts.
text-classification-keras
Text Classification Library for Keras
get-wayback-machine
Fetch a URL via the latest Wayback Machine Snapshot
get-retries
Adding retries to Requests.get() with exponential backoff
mw-category-members
Using MediaWiki's API, retrieve pages that belong to a given category
deep-plots
Visualize Your Deep Learning Training in Static Graphics