Auto DC
Project description
AutoDC
Automated data-centric processing
This repository is the official Python implementation of position paper "AutoDC: Automated data-centric processing". The implementation will continue being updated in the coming months.
AutoDC is a framework to enable domain experts to automatically and systematically improve datasets without much coding requirement and manual process, the idea similar with AutoML (automated machine learning).
By using the AutoML system, such as Google Cloud AutoML, domain experts only need to bring in the input data, and AutoML takes care of the manual ML processes, then produces output predictions, along with user-defined evaluation metrics. With a similar idea, AutoDC is designed for domain experts to bring in a labeled dataset, such as annotated images, to the system; AutoDC takes care of the manual data improvement processes, and produces the improved dataset, by automatically correcting the incorrect labels (with user feedbacks), detecting edge cases, and augmenting edge cases.
Citation
Zac Yung-Chun Liu, Shoumik Roychowdhury, Scott Tarlow, Akash Nair, Shweta Badhe, and Tejas Shah. AutoDC: Automated data-centric processing, NeurIPS 2021: DCAI workshop, arXiv: 2111.12548.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file autodc-1.0.6.tar.gz
.
File metadata
- Download URL: autodc-1.0.6.tar.gz
- Upload date:
- Size: 8.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.7.1 importlib_metadata/4.10.0 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.8.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | dd1228276bcd8e8674f5c0ddf8da81a45c6c5646e71ce59cdbbe620b1b0c72dd |
|
MD5 | 24b5ceecac76f88aed11f218c3d14988 |
|
BLAKE2b-256 | ef118ad2dbf9c3c35cf4e6bbd0c012772b252152ae65f3558ed402a6ba1f33a1 |
File details
Details for the file autodc-1.0.6-py3-none-any.whl
.
File metadata
- Download URL: autodc-1.0.6-py3-none-any.whl
- Upload date:
- Size: 10.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.7.1 importlib_metadata/4.10.0 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.8.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 402d7a23ff3b22e8765a3b2cee290d1e1ccc36a70427317aade646782e814273 |
|
MD5 | 25c6c615e6f2714cb3b9a603ac4fdc88 |
|
BLAKE2b-256 | 5dfe10445b7db4fea9ab9d8ca8ce2650455a9b4743dbb1a3cde03a61d8d3e186 |