9 projects
findanywhere
Tool for searching data in possible malformed input data as preprocessing step for further analysis.
token-distance
Python library designed to perform fuzzy token matching within text documents. Utilizing advanced algorithms, this tool allows developers and data scientists to search and compare tokens based on flexible criteria, beyond exact matches. The library supports tokenization through whitespace, regular expressions, or custom functions, and provides weighted comparisons for nuanced analysis.
hellsicht
Extendable tool for having a first view on data and categorize data structures
pyploid
Framework for building experiments or educational material regarding evolutionary algorithms and genetics.
yajirushi
metarchive
Tool for analysing links within a website archived by archive.org
tritium-pipeline
Log based job pipeline tool
rasierwasser
Simple pip repository server for internal usage
CubeFlow
Framework for creating grid based simulations.