Sockit is a natural-language processing toolkit for modeling structured occupation information and Standard Occupational Classification (SOC) codes in unstructured text from job titles, job postings, and resumes.
Extends SCons to build targets in remote environments.
WordTrie: a simple trie (prefix tree) for word and phrase matching
Secure Infrastructure for Research with Administrative Data
Censuscoding: determine the Census blockgroup for a street address
Automatic generation of codebooks from dataframes.
An alignment and variant-calling pipeline for Illumina deep sequencing of HIV-1, based on the probabilistic aligner HMMER.
A data analysis environment that unites the best features of pandas, R, Stata, and others.
An automated phylogenomics pipeline.
A lightweight bioinformatics framework with automated tracking of diagnostics and provenance.