Library devoted to Document level Attitude and Relation Extraction for text objects with entity-linking (EL) API support

These details have not been verified by PyPI

Project links

Homepage

Project description

AREkit 0.22.1

AREkit (Attitude and Relation Extraction Toolkit) -- is a python toolkit, devoted to document level Attitude and Relation Extraction between text objects from mass-media news.

Description

This toolkit aims to solve data preparation problems in Relation Extraction related taks, considiering such factors as:

🔗 EL (entity-linking) API support for objects,
➰ avoidance of cyclic connections,
:straight_ruler: distance consideration between relation participants (in terms or sentences),
📑 relations annotations and filtering rules,
*️⃣ entities formatting or masking, and more.

Using AREkit you may focus on preparation and experiments with your ML-models by shift all the data-preparation part onto toolset of this project (tutorial). In order to do so, we provide:

:file_folder: API for external collection binding (native support of BRAT-based exported annotations)
➿ pipelines and iterators for handling large-scale collections serialization without out-of-memory issues.
evaluators which allows you to assess your trained model.

AREkit complements the OpenNRE functionality since document-level RE setting is not widely explored (2.4 [paper]). The core functionality includes (1) API for document presentation with EL (Entity Linking, i.e. Object Synonymy) support for sentence level relations preparation (dubbed as contexts) (2) API for contexts extraction (3) relations transferring from sentence-level onto document-level, and more. It providers contrib modules of neural networks (like OpenNRE) applicable for sentiment attitude extraction task.

Installation

pip install git+https://github.com/nicolay-r/AREkit.git@0.22.1-rc

Download Resources

from arekit.data import download_data
download_data()

Applications

ARElight [site] [github]
- Infer attitudes from large Mass-media documents or sample texts for your Machine Learning models applications

Papers

Frame-Based attitude extraction workflow for news processing [code]
- Represents an attitude annotation workflow based on RuSentiFrames lexicon which is utilized for news processing;
Neural Networks Applications in Sentiment Attitude Extraction [code]
- Neural Networks application for attitude extraction from analytical articles;
BERT-based model utils for Sentiment Attitude Extraction task [code]
- Analytical news formatter for BERT-based models;

Related Frameworks

OpenNRE [github] [paper]
DeRE [github] [paper]
CREST [github] [paper]

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.25.0

Sep 29, 2024

0.24.0

Nov 6, 2023

0.23.1

Nov 6, 2023

0.23.0

Nov 6, 2023

This version

0.22.1

Nov 7, 2023

0.22.0

Nov 7, 2023

0.21.0

Nov 7, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arekit-0.22.1.tar.gz (160.1 kB view details)

Uploaded Nov 7, 2023 Source

Built Distribution

arekit-0.22.1-py3-none-any.whl (317.5 kB view details)

Uploaded Nov 7, 2023 Python 3

File details

Details for the file arekit-0.22.1.tar.gz.

File metadata

Download URL: arekit-0.22.1.tar.gz
Upload date: Nov 7, 2023
Size: 160.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.5

File hashes

Hashes for arekit-0.22.1.tar.gz
Algorithm	Hash digest
SHA256	`3912fa2ae6597b4c7f5c4a1f558724571ea9b76614ee66dd5af054c937e80a8b`
MD5	`e8d61e67491097adaf2e3bb1ef14543b`
BLAKE2b-256	`fad122260c6ce32afc2e12402007b6ef22863a698a8a68bf3c3b52ad4143dd1a`

See more details on using hashes here.

File details

Details for the file arekit-0.22.1-py3-none-any.whl.

File metadata

Download URL: arekit-0.22.1-py3-none-any.whl
Upload date: Nov 7, 2023
Size: 317.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.5

File hashes

Hashes for arekit-0.22.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`08928d3c971fa18264f21c847163c013c0731db4359716b7b7f9b1ad50ba515c`
MD5	`5a07353bdc8131327f570983971e5a1f`
BLAKE2b-256	`ccee521b5370fde05f9774e832a4448885db8694cbba401dde03f4fcd3fbd04f`