Skip to main content

Relationships Extraction from NARrative Documents

Project description

Renard

DOI

Renard (Relationship Extraction from NARrative Documents) is a library for creating and using custom character networks extraction pipelines. Renard can extract dynamic as well as static character networks.

The Renard logo

Installation

You can install the latest version using pip:

pip install renard-pipeline

Currently, Renard supports Python>=3.8,<=3.11

Documentation

Documentation, including installation instructions, can be found at https://compnet.github.io/Renard/

If you need local documentation, it can be generated using Sphinx. From the docs directory, make html should create documentation under docs/_build/html.

Tutorial

Renard's central concept is the Pipeline.A Pipeline is a list of PipelineStep that are run sequentially in order to extract a character graph from a document. Here is a simple example:

from renard.pipeline import Pipeline
from renard.pipeline.tokenization import NLTKTokenizer
from renard.pipeline.ner import NLTKNamedEntityRecognizer
from renard.pipeline.character_unification import GraphRulesCharacterUnifier
from renard.pipeline.graph_extraction import CoOccurrencesGraphExtractor

with open("./my_doc.txt") as f:
	text = f.read()

pipeline = Pipeline(
	[
		NLTKTokenizer(),
		NLTKNamedEntityRecognizer(),
		GraphRulesCharacterUnifier(min_appearance=10),
		CoOccurrencesGraphExtractor(co_occurrences_dist=25)
	]
)

out = pipeline(text)

For more information, see renard_tutorial.py, which is a tutorial in the jupytext format. You can open it as a notebook in Jupyter Notebook (or export it as a notebook with jupytext --to ipynb renard-tutorial.py).

Running tests

Renard uses pytest for testing. To launch tests, use the following command :

poetry run python -m pytest tests

Expensive tests are disabled by default. These can be run by setting the environment variable RENARD_TEST_ALL to 1.

Contributing

see the "Contributing" section of the documentation.

How to cite

If you use Renard in your research project, please cite it as follows:

@Article{Amalvy2024,
  doi	       = {10.21105/joss.06574},
  year	       = {2024},
  publisher    = {The Open Journal},
  volume       = {9},
  number       = {98},
  pages	       = {6574},
  author       = {Amalvy, A. and Labatut, V. and Dufour, R.},
  title	       = {Renard: A Modular Pipeline for Extracting Character
                  Networks from Narrative Texts},
  journal      = {Journal of Open Source Software},
} 

We would be happy to hear about your usage of Renard, so don't hesitate to reach out!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

renard_pipeline-0.6.2.tar.gz (69.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

renard_pipeline-0.6.2-py3-none-any.whl (77.6 kB view details)

Uploaded Python 3

File details

Details for the file renard_pipeline-0.6.2.tar.gz.

File metadata

  • Download URL: renard_pipeline-0.6.2.tar.gz
  • Upload date:
  • Size: 69.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.11.10 Linux/6.11.10-200.fc40.x86_64

File hashes

Hashes for renard_pipeline-0.6.2.tar.gz
Algorithm Hash digest
SHA256 6c9ab60513d5ce66bf9a3d875a8af46f494f14e26d165b757a6001a1008bab77
MD5 8097c7f0d218067ee638de85d77705e4
BLAKE2b-256 70d8e9323eed778bb06d3c9c016a44f245d5f35c02c24cbda8410419dbd8cdc1

See more details on using hashes here.

File details

Details for the file renard_pipeline-0.6.2-py3-none-any.whl.

File metadata

  • Download URL: renard_pipeline-0.6.2-py3-none-any.whl
  • Upload date:
  • Size: 77.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.11.10 Linux/6.11.10-200.fc40.x86_64

File hashes

Hashes for renard_pipeline-0.6.2-py3-none-any.whl
Algorithm Hash digest
SHA256 0d67b16d34c30fa78c3a832e2519ddcc199486f71fcb8a00e0a087fe32db19d2
MD5 79e18ad70ce8e0a18c3f9739149f6444
BLAKE2b-256 daf06e25ec5bafff453d030455ac0b453c712dd066fd0b5382465a5c6cb93ad3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page