A Framework for Cross-Dialectal NLP
Project description
Multi-VALUE: The VernAcular Language Understanding Evaluation benchmark
Setup
Prerequisites:
- Create a virtual environment
conda create --name value python=3.7.13
conda activate value
- Install requirements:
pip install -r requirements.txt
- Install spaCy English pipeline and nltk wordnet
bash downloads.sh
- Confirm that your setup is correct by running the unittest
python -m unittest tests.py
Build Multi-VALUE CoQA (optional)
- Pull data
bash pull_coqa.sh
- Run for each dialect
python -m src.build_coqa_value --dialect aave &
python -m src.build_coqa_value --dialect appalachian &
python -m src.build_coqa_value --dialect chicano &
python -m src.build_coqa_value --dialect indian &
python -m src.build_coqa_value --dialect multi &
python -m src.build_coqa_value --dialect singapore &
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
value_nlp-0.1.2.tar.gz
(4.0 MB
view hashes)