52 projects
start-ocr
Applying pdfplumber + opencv + pytesseract to extract content and metadata from formal PDF files.
corpus-judge
Cleaning, setting Justices
corpus-unpdf
Parse Philippine Supreme Court decisions issued in PDF format as text.
markdown-toc-segments
Create markdown segments based on a table of contents
cloudflare-r2
Wrapper around commonly used boto3 functions in Cloudflare R2 API.
django-fragments
Custom template tags for common html idioms in Django.
django-entries
Entries is a helper Django app, CRUD.
sqlpyd
Validate raw content with pydantic for consumption by sqlite-utils.
pylts
Pydantic wrapper around litestreamed database specific to an AWS bucket.
corpus-pax
Using Github API (to pull individuals, orgs, and article content), setup a local sqlite database, syncing images to Cloudflare.
statute-trees
Tree-based Philippine Codifications, Statutes, and Documents, using a uniform node structure (i.e., leaves of a tree) identified by a given material path.
citation-utils
Regex-based docket- and report- styled citations based on Philippine Supreme Court decisions.
statute-patterns
Philippine statutory law pattern matching and unit retrieval.
citation-docket
Regex formula of Philippine Supreme Court citations in docket format, i.e. GR, AM, AC, BM.
citation-report
Regex formula of Philippine Supreme Court citations in report format, i.e. SCRA, PHIL, OFFG.
citation-date
Regex date formula and decoder - Philippine Supreme Court Decisions
cloudflare-images
Wrapper around Cloudflare Images API, usable custom Django storage class.
start-cloudflare
A Pydantic BaseSettings class intended to be extended by other Cloudflare services.
start-github
Use Github Token to Access API
start-url
httpx.get URL metadata with BeautifulSoup
pagemeta
Extract Open Graph metadata from a URL via BeautifulSoup.
start-sdk
Settings for Github, Cloudflare Images API
corpus-x
Add codification and statute tables to pre-existing corpus-base database.
corpus-base
Add justice, decision, citation, voting, opinion, and segment tables to database.
statute-utils
Helper functions for statutory processing.
text-gists
Helper functions for matching regex and other text-based patterns.
treeish
Tree function helpers: set ids, get node from ids, fetch nodes with key
utils-jinja-sqlite
Common helper utility functions used when connecting to sqlite databases and binding SQL values from python variables.
lawsql-trees
Pull and format statute / codification data (tree structures) from local path to raw yaml files.
acquisition-statute-parser
Parse statutes into fields after acquisition from scraping.
acquisition-decisions-sc
Processes decisions starting 1996 to present date. The Philippine Supreme Court's digital library starts in 1996.
acquisition-decisions-legacy
Processes decisions before 1996. The Philippine Supreme Court's digital library starts in 1996.
acquisition-case-transform
Fix typograpic errors / non-standard Supreme Court citations
lawsql-justice
Pattern matching for Justices
lawsql-cases-raw
Get decision data from .html and yaml files; updates local database with field lookups.
lawsql-cases-justices
Get justices from yaml file; updates local database.
lawsql-tree-unit
Format units for use in tree structures, e.g. Statutes, Codifications, etc.
statute-matcher
Generate list of Philippine statute designations from a text string.
statute-matcher-regex
Raw regex strings and constructors to create Philippine statutory labels.
statute-serial-number
Uniform serialization of Statutes that map to STATUTEPATH directory
lawsql-utils
lawsql helper functions
decision-segment
Separates decision segments
decision-title
Get & format case titles from cleaned Philippine Supreme Court decisions
django-yabl
Yet another bookmarking library (yabl) for Django. Bookmark and tag arbitrary models.
decision-title-vs-inre
Decision Title Parts and Lines via Regexes
decision-section
Create sections from pre-formatted html of Philippine Supreme Court decisions with specially combined elements.
decision-footnote
Split html style sup-footnotes to validated list of dicts
django-highlights
Text selection and save as highlight via htmx
acquisition-ruling-phrase
Pattern matching common phrases in Supreme Court decisions indicating the start of content with doctrinal value.
acquisition-sanitizer
Clean scraped decisions for better pattern matching.
django-add-comments
Add and display htmx comments to arbitrary Django models.
acquisition-extractor
Parse statute, decision data from a specified location