Skip to main content

spaCy extension for kenLM.

Project description

## spacy_kenlm: KenLM extension for spaCy 2.0

This package adds [kenLM](https://github.com/kpu/kenlm) support
as a [spaCy 2.0 extension](https://spacy.io/usage/processing-pipelines#extensions).

## Usage

Train a `kenLM` language model first (or use the test model from `test.arpa`).

Add the spaCyKenLM to the spaCy pipeline to return scores.

```
import spacy
from spacy_kenlm import spaCyKenLM

nlp = spacy.load('en_core_web_sm')

spacy_kenlm = spaCyKenLM() # default model from test.arpa

nlp.add_pipe(spacy_kenlm)

doc = nlp('How are you?')

# doc score
doc._.kenlm_score

# span score
doc[:2]._.kenlm_score

# token score
doc[2]._.kenlm_score
```

## Installation

Install from the pip package.

```
pip install spacy_kenlm
```

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spacy_kenlm-0.1.1.tar.gz (3.2 kB view details)

Uploaded Source

File details

Details for the file spacy_kenlm-0.1.1.tar.gz.

File metadata

File hashes

Hashes for spacy_kenlm-0.1.1.tar.gz
Algorithm Hash digest
SHA256 33f2b7dc71e42b92e1bf65bd3a572accebd79c7ea707517a8bfa8442210b797f
MD5 cc71fab2d07b5ad324d4e86691cf854b
BLAKE2b-256 8dacdc62d6eef6d0f5eeb3f6e6cf34c47ee6e4b6d11141daa93d15a53ee3a18a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page