Append missing model cards to Huggingface datasets
Project description
Datacard
This repo aims to find and update the missing model cards for Hugging face datasets.
If you find this a worth while pursute, feel free to reach out and let's try to make the Hugging face datasets complete :wink:
Setup
# install poetry
git clone --recurse-submodules --remote-submodules git@github.com:Hugging-Face-Supporter/datacards.git
cd datacards
git submodule update
poetry install
Run
poetry shell
python datacards/main.py
WIP
- Look into how to provide multiple answers in model card (ex. Glue dataset)
- Find the datasets that are missing information by parsing the README
- Find ways to know what categories are valid answers
- Create method to filter for missing datasets
- Create tool to annotate the datasets
- Toggle between datasets to annotate.
- Save modified files to the README again
- Once done, find ways to create automatic PR to Hugging face datasets
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
datacards-0.1.1.tar.gz
(5.5 kB
view hashes)
Built Distribution
Close
Hashes for datacards-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c48926fbd5a258b1d98876986a9b76ccc7ebbe71b1dd480066ffab86dfdd2f65 |
|
MD5 | d00b196cd099ca6853d87a2c03627041 |
|
BLAKE2b-256 | 1d51aa5cc95fdb8629550b406ca1e281ea84c483a4cca6f519b34d5b5e25cf72 |