Skip to main content

A SpaCy wrapper for the GLiNER model for enhanced Named Entity Recognition capabilities

Project description

GLiNER SpaCy Wrapper

Introduction

This project is a wrapper for integrating GLiNER, a Named Entity Recognition (NER) model, with the SpaCy Natural Language Processing (NLP) library. GLiNER, which stands for Generalized Language INdependent Entity Recognition, is an advanced model for recognizing entities in text. The SpaCy wrapper enables easy integration and use of GLiNER within the SpaCy environment, enhancing NER capabilities with GLiNER's advanced features.

Features

  • Integrates GLiNER with SpaCy for advanced NER tasks.
  • Customizable chunk size for processing large texts.
  • Support for specific entity labels like 'person' and 'organization'.
  • Configurable output style for entity recognition results.

Installation

To install this library, install it via pip:

pip install gliner-spacy

Usage

To use this wrapper in your SpaCy pipeline, follow these steps:

  1. Import SpaCy and the GLiNER SpaCy wrapper.
  2. Create a SpaCy Language instance.
  3. Add the gliner_spacy component to the SpaCy pipeline.
  4. Process text using the pipeline.

Example code:

import spacy
from gliner_spacy.pipeline import GlinerSpacy

nlp = spacy.blank("en")
nlp.add_pipe("gliner_spacy")
text = "This is a text about Bill Gates and Microsoft."
doc = nlp(text)

for ent in doc.ents:
    print(ent.text, ent.label_)

Expected Output

Bill Gates person
Microsoft organization

Configuration

The default configuration of the wrapper can be modified according to your requirements. The configurable parameters are:

  • gliner_model: The GLiNER model to be used.
  • chunk_size: Size of the text chunk to be processed at once.
  • labels: The entity labels to be recognized.
  • style: The style of output for the entities (either 'ent' or 'span').

Contributing

Contributions to this project are welcome. Please ensure that your code adheres to the project's coding standards and include tests for new features.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gliner-spacy-0.0.1.tar.gz (3.4 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page