15 projects
iterabledata
Iterable data processing Python library
undatum
A powerful command-line tool for data processing and analysis
qddate
Quick and dirty date parsing Python library to parse HTML dates really fast
internacia
Python SDK for accessing internacia-db data with support for countries, international blocks, and fuzzy search
wparc
WordPress API Crawler and Backup Tool
ydiskarc
Command-line tool to backup public resources from Yandex.Disk
apibackuper
a command-line tool and python library for API backuping
filerepack
Repacks existing (un)compressed files for higher compression
metacrafter
Metacrafter metadata classification tool
metawarc
metawarc: a command-line tool for data extraction from WARC files (web archives)
spcrawler
spcrawler: A command-line tool to backup Sharepoint public installations data from open API endpoint
docx2csv
Extracts tables from .docx files and saves them as csv or xlsx
lazyscraper
Lazy simple command line tool, a swiss knife for scraper writers. Automates scraping so much as possible
russiannames
Russian names parser, gender identification and processing tools
newsworker
Advanced news feeds extractor and finder library. Helps to automatically extract news from websites without RSS/ATOM feeds