61 projects
bluecore-models
Blue Core BIBFRAME Data Models
marctable
Convert MARC to CSV and Parquet
solrpy
Client for the Solr search service
vttdiff
Create an HTML "diff" for webVTT files
waybackprov
Checks the provenance of a URL in the Wayback machine
aotycount
Summarize the album's on Alf Eaton's AOTY site
community-cloud-storage
Distributed storage for community archives using ipfs-cluster, Tailscale and Docker
pincushion
An archiving tool for Historypin
twarc
Archive tweets from the command line
idloc
Find and get readable JSON-LD from Library of Congress Linked Data Service
pymarc
Read, write and modify MARC bibliographic data
bagit
Create and validate BagIt packages
vine-wayback
Fetch Vines from the Wayback Machine
htmldiff2
Diffs arbitrary HTML inline.
etudier
Collect a citation graph from Google Scholar
feediverse
Connect an RSS Feed to Mastodon
memento-cli
Examine snapshots in eeb archives such as the Internet Archive's Wayback Machine
twarc-csv
A twarc plugin to output Twitter data as CSV
markdown-to-respec
Convert specifications written in Markdown to ReSpec HTML
wikipediarevs
Download all the revisions for a set of Wikipedia articles
bibdesk2zotero
convert BibDesk BibTeX files for import into Zotero
public-domains
Find possible host names in a source text
twitter-archive-unshorten
Unshorten the URLs in your Twitter archive
twarc-network
Generate network visualizations for Twitter data
twarc-edits
Find edited tweetes
twarc-timeline-archive
A twarc plugin to collect the timelines of a list of users
microdata
html5lib extension for parsing microdata
twarc-text
A twarc plugin to print tweets to the terminal
twarc-hashtags
A twarc plugin to extract hashtags from Twitter data
airwaves
Unlocking the Airwaves Utilities
xkcd2347
List the dependencies for a github project
twarc-videos
A twarc plugin to extract referenced video from tweet data
twarc-ids
A twarc plugin to read Twitter data and output the tweet ids
luckysocial
lookup social media accounts for names
wikieds
Command line tool to Print a markdown summary of editors for a Wikipedia article.
fusionbuilder
Parse Fusion Page Builder text.
inst341data
install data in jupyter notebook
bagcat
A command line utility for managing BagIt packages in Amazon S3
puid
Lookup a PRONOM Unique Identifier for a file.
dedoop
dedupe files and send them to the cloud
ptree
Work with PairTree file system convention
wikilinks
Get a list of Wikipedia articles that link to a website.
diffengine
Monitor changes to webpages in RSS feeds
oembedders
A utility for dispatching to known oembed providers
iacoll
Collect metadata for Internet Archive collections
storified
Download your Storify data
lastweet
Send Twitter/Mastodon updates about LastFM activity
nyaraka
Download Omeka data
wikidata_suggest
Interactively look up Wikidata entities from the command line
hathitables
Turn HathiTrust Collections into CSV
hathilda
Turn HathiTrust Data into JSON-LD
teizone
Add coordinates to TEI zones.
summoner
Work with the Serials Solutions Summon API
wplinks
find wikipedia articles that links to a website
opensearch
Interact with opensearch services
skosdict
Turn a SKOS concept scheme into a JSON dictionary
oai2pairtree
UNKNOWN
twitterator
iterating functions for twitter api
dflat
a command line tool for working with dflat digital preservation file systems
marcup
manage create/update/deletes marc feeds
marcdb
parse MARC data and store into a rdbms