Applying pdfplumber + opencv + pytesseract to extract content and metadata from formal PDF files.
Cleaning, setting Justices
Parse Philippine Supreme Court decisions issued in PDF format as text.
Create markdown segments based on a table of contents
Wrapper around commonly used boto3 functions in Cloudflare R2 API.
Custom template tags for common html idioms in Django.
Entries is a helper Django app, CRUD.
Validate raw content with pydantic for consumption by sqlite-utils.
Pydantic wrapper around litestreamed database specific to an AWS bucket.
Using Github API (to pull individuals, orgs, and article content), setup a local sqlite database, syncing images to Cloudflare.
Tree-based Philippine Codifications, Statutes, and Documents, using a uniform node structure (i.e., leaves of a tree) identified by a given material path.
Regex-based docket- and report- styled citations based on Philippine Supreme Court decisions.
Philippine statutory law pattern matching and unit retrieval.
Regex formula of Philippine Supreme Court citations in docket format, i.e. GR, AM, AC, BM.
Regex formula of Philippine Supreme Court citations in report format, i.e. SCRA, PHIL, OFFG.
Regex date formula and decoder - Philippine Supreme Court Decisions
Wrapper around Cloudflare Images API, usable custom Django storage class.
A Pydantic BaseSettings class intended to be extended by other Cloudflare services.
Use Github Token to Access API
httpx.get URL metadata with BeautifulSoup
Extract Open Graph metadata from a URL via BeautifulSoup.
Settings for Github, Cloudflare Images API
Add codification and statute tables to pre-existing corpus-base database.
Add justice, decision, citation, voting, opinion, and segment tables to database.
Helper functions for statutory processing.
Helper functions for matching regex and other text-based patterns.
Tree function helpers: set ids, get node from ids, fetch nodes with key
Common helper utility functions used when connecting to sqlite databases and binding SQL values from python variables.
Pull and format statute / codification data (tree structures) from local path to raw yaml files.
Parse statutes into fields after acquisition from scraping.
Processes decisions starting 1996 to present date. The Philippine Supreme Court's digital library starts in 1996.
Processes decisions before 1996. The Philippine Supreme Court's digital library starts in 1996.
Fix typograpic errors / non-standard Supreme Court citations
Pattern matching for Justices
Get decision data from .html and yaml files; updates local database with field lookups.
Get justices from yaml file; updates local database.
Format units for use in tree structures, e.g. Statutes, Codifications, etc.
Generate list of Philippine statute designations from a text string.
Raw regex strings and constructors to create Philippine statutory labels.
Uniform serialization of Statutes that map to STATUTEPATH directory
lawsql helper functions
Separates decision segments
Get & format case titles from cleaned Philippine Supreme Court decisions
Yet another bookmarking library (yabl) for Django. Bookmark and tag arbitrary models.
Decision Title Parts and Lines via Regexes
Create sections from pre-formatted html of Philippine Supreme Court decisions with specially combined elements.
Split html style sup-footnotes to validated list of dicts
Text selection and save as highlight via htmx
Pattern matching common phrases in Supreme Court decisions indicating the start of content with doctrinal value.
Clean scraped decisions for better pattern matching.
Add and display htmx comments to arbitrary Django models.
Parse statute, decision data from a specified location