A language modeling toolkit
A neural language modeling toolkit built on PyTorch.
This is a scientific piece of code, so expect rough edges.
BrnoLM has so far powered language modeling in the following papers:
- Beneš et al. Text Augmentation for Language Models in High Error Recognition Scenario
- Žmolíková et al. BUT System for CHiME-6 Challenge
- Beneš et al. i-vectors in language modeling: An efficient way of domain adaptation for feed-forward models
- Beneš et al. Unsupervised Language Model Adaptation for Speech Recognition with no Extra Resources
To install, clone this repository and exploit the provided
git clone email@example.com:BUTSpeechFIT/BrnoLM.git cd BrnoLM pip install . # or, if you don't care about environmental pollution: python setup.py install
Occasionally, a PIP version is produced, so you can simply
pip install brnolm to obtain the last pre-packed version.
The above way of installation takes care of dependencies. If you want to prepare an environment yourself, know that BrnoLM requires:
torch numpy scikit-learn
Exact tested versions are provided in
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.