Skip to main content

A utility package for NLP and Machine Learning

Project description

Chonker Logo

Chonker: Data Utilities for NLP and Machine Learning

Welcome to Chonker! This code was assembled to provide a library to make data-intensive tasks just a little easier. Right now, Chonker is quite small: made up of one key module (wrangle) with one more currently in development (chonktorch). As a full-time student/researcher, 100% of this code develops from use in my ongoing projects. Chonker provides a neat package for myself and others who want to reuse convenient data functions instead of rebuilding them every time. If you somehow happen upon this library, I hope that something in here may be of use to you too!



A module containing classes and functions designed to streamline data pre- and post-processing (i.e. wrangling)

chonker.chonktorch (forthcoming)

A module for pre-built neural models for NLP, based off of my research and built on PyTorch


Chonker is a work-in-progress with few guarantees. However, it is also open source and free to be modified and distributed, as long as those modifications remain free. As such, Chonker is licensed by the Mozilla Public License 2.0.


C.M. Downey

PhD Student

Department of Linguistics

University of Washington

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

chonker-1.4.3-py3-none-any.whl (19.4 kB view hashes)

Uploaded py3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page