57 projects
vine-wayback
Fetch Vines from the Wayback Machine
htmldiff2
Diffs arbitrary HTML inline.
pymarc
Read, write and modify MARC bibliographic data
pincushion
An archiving tool for Historypin
bagit
Create and validate BagIt packages
etudier
Collect a citation graph from Google Scholar
feediverse
Connect an RSS Feed to Mastodon
idloc
Get JSON-LD for a Library of Congress name or subject authority.
marctable
Convert MARC to CSV and Parquet
memento-cli
Examine snapshots in eeb archives such as the Internet Archive's Wayback Machine
twarc-csv
A twarc plugin to output Twitter data as CSV
markdown-to-respec
Convert specifications written in Markdown to ReSpec HTML
wikipediarevs
Download all the revisions for a set of Wikipedia articles
twarc
Archive tweets from the command line
bibdesk2zotero
convert BibDesk BibTeX files for import into Zotero
public-domains
Find possible host names in a source text
twitter-archive-unshorten
Unshorten the URLs in your Twitter archive
twarc-network
Generate network visualizations for Twitter data
twarc-edits
Find edited tweetes
waybackprov
Checks the provenance of a URL in the Wayback machine
twarc-timeline-archive
A twarc plugin to collect the timelines of a list of users
microdata
html5lib extension for parsing microdata
twarc-text
A twarc plugin to print tweets to the terminal
twarc-hashtags
A twarc plugin to extract hashtags from Twitter data
airwaves
Unlocking the Airwaves Utilities
xkcd2347
List the dependencies for a github project
twarc-videos
A twarc plugin to extract referenced video from tweet data
twarc-ids
A twarc plugin to read Twitter data and output the tweet ids
luckysocial
lookup social media accounts for names
wikieds
Command line tool to Print a markdown summary of editors for a Wikipedia article.
fusionbuilder
Parse Fusion Page Builder text.
inst341data
install data in jupyter notebook
bagcat
A command line utility for managing BagIt packages in Amazon S3
puid
Lookup a PRONOM Unique Identifier for a file.
dedoop
dedupe files and send them to the cloud
ptree
Work with PairTree file system convention
wikilinks
Get a list of Wikipedia articles that link to a website.
solrpy
Client for the Solr search service
diffengine
Monitor changes to webpages in RSS feeds
oembedders
A utility for dispatching to known oembed providers
iacoll
Collect metadata for Internet Archive collections
storified
Download your Storify data
lastweet
Send Twitter/Mastodon updates about LastFM activity
nyaraka
Download Omeka data
wikidata_suggest
Interactively look up Wikidata entities from the command line
hathitables
Turn HathiTrust Collections into CSV
hathilda
Turn HathiTrust Data into JSON-LD
teizone
Add coordinates to TEI zones.
summoner
Work with the Serials Solutions Summon API
wplinks
find wikipedia articles that links to a website
opensearch
Interact with opensearch services
skosdict
Turn a SKOS concept scheme into a JSON dictionary
oai2pairtree
UNKNOWN
twitterator
iterating functions for twitter api
dflat
a command line tool for working with dflat digital preservation file systems
marcup
manage create/update/deletes marc feeds
marcdb
parse MARC data and store into a rdbms