A Framework for Cross-Dialectal NLP
Project description
Multi-VALUE: The VernAcular Language Understanding Evaluation benchmark
Setup
Prerequisites:
- Create a virtual environment
conda create --name value python=3.7.13
conda activate value
- Install requirements:
pip install -r requirements.txt
- Install spaCy English pipeline and nltk wordnet
bash downloads.sh
- Confirm that your setup is correct by running the unittest
python -m unittest tests.py
Build Multi-VALUE CoQA (optional)
- Pull data
bash pull_coqa.sh
- Run for each dialect
python -m src.build_coqa_value --dialect aave &
python -m src.build_coqa_value --dialect appalachian &
python -m src.build_coqa_value --dialect chicano &
python -m src.build_coqa_value --dialect indian &
python -m src.build_coqa_value --dialect multi &
python -m src.build_coqa_value --dialect singapore &
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
value_nlp-0.1.1.tar.gz
(4.0 MB
view hashes)
Built Distribution
Close
Hashes for value_nlp-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | fed23fdc9a3a9136a58de52b07ce45dcbe493a71611b43dcd1b291b22009a4fd |
|
MD5 | 6da858c658cb8be68f67b1c21735603b |
|
BLAKE2b-256 | 5d90f7adf6fe0e540f08b500e56ec6a04483e87cd980408e2c98b2f97c64917e |