Skip to main content

Library devoted to Document level Attitude and Relation Extraction for text objects with entity-linking (EL) API support

Project description

AREkit 0.22.1

AREkit (Attitude and Relation Extraction Toolkit) -- is a python toolkit, devoted to document level Attitude and Relation Extraction between text objects from mass-media news.

Description

This toolkit aims to solve data preparation problems in Relation Extraction related taks, considiering such factors as:

  • 🔗 EL (entity-linking) API support for objects,
  • ➰ avoidance of cyclic connections,
  • :straight_ruler: distance consideration between relation participants (in terms or sentences),
  • 📑 relations annotations and filtering rules,
  • *️⃣ entities formatting or masking, and more.

Using AREkit you may focus on preparation and experiments with your ML-models by shift all the data-preparation part onto toolset of this project (tutorial). In order to do so, we provide:

  • :file_folder: API for external collection binding (native support of BRAT-based exported annotations)
  • ➿ pipelines and iterators for handling large-scale collections serialization without out-of-memory issues.
  • evaluators which allows you to assess your trained model.

AREkit complements the OpenNRE functionality since document-level RE setting is not widely explored (2.4 [paper]). The core functionality includes (1) API for document presentation with EL (Entity Linking, i.e. Object Synonymy) support for sentence level relations preparation (dubbed as contexts) (2) API for contexts extraction (3) relations transferring from sentence-level onto document-level, and more. It providers contrib modules of neural networks (like OpenNRE) applicable for sentiment attitude extraction task.

Installation

pip install git+https://github.com/nicolay-r/AREkit.git@0.22.1-rc

Download Resources

from arekit.data import download_data
download_data()

Applications

  • ARElight [site] [github]
    • Infer attitudes from large Mass-media documents or sample texts for your Machine Learning models applications

Papers

  • Frame-Based attitude extraction workflow for news processing [code]
    • Represents an attitude annotation workflow based on RuSentiFrames lexicon which is utilized for news processing;
  • Neural Networks Applications in Sentiment Attitude Extraction [code]
    • Neural Networks application for attitude extraction from analytical articles;
  • BERT-based model utils for Sentiment Attitude Extraction task [code]
    • Analytical news formatter for BERT-based models;

Related Frameworks

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arekit-0.22.1.tar.gz (160.1 kB view details)

Uploaded Source

Built Distribution

arekit-0.22.1-py3-none-any.whl (317.5 kB view details)

Uploaded Python 3

File details

Details for the file arekit-0.22.1.tar.gz.

File metadata

  • Download URL: arekit-0.22.1.tar.gz
  • Upload date:
  • Size: 160.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.5

File hashes

Hashes for arekit-0.22.1.tar.gz
Algorithm Hash digest
SHA256 3912fa2ae6597b4c7f5c4a1f558724571ea9b76614ee66dd5af054c937e80a8b
MD5 e8d61e67491097adaf2e3bb1ef14543b
BLAKE2b-256 fad122260c6ce32afc2e12402007b6ef22863a698a8a68bf3c3b52ad4143dd1a

See more details on using hashes here.

File details

Details for the file arekit-0.22.1-py3-none-any.whl.

File metadata

  • Download URL: arekit-0.22.1-py3-none-any.whl
  • Upload date:
  • Size: 317.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.5

File hashes

Hashes for arekit-0.22.1-py3-none-any.whl
Algorithm Hash digest
SHA256 08928d3c971fa18264f21c847163c013c0731db4359716b7b7f9b1ad50ba515c
MD5 5a07353bdc8131327f570983971e5a1f
BLAKE2b-256 ccee521b5370fde05f9774e832a4448885db8694cbba401dde03f4fcd3fbd04f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page