No project description provided

Project description

The text2story package

The text2story main package contains the main classes and methods for the T2S pipeline: from text to formal representation to visualization or other representation.

Relation to Brat2Viz

The Text2Story package is a generalization of Brat2Viz and should in fact contain all the funcionalities and variants of the T2S project output.

Getting Start.
The Framework Structure.
The Annotators.
Installation.
- 4.1. Linux Ubuntu
- 4.2. Windows
The Web App.

1. Getting Started

The main goal of the text2story is to extract narrative from raw text. The narrative components comprise events, the participants (or participants) in the events, and the time expressions.

Event: Eventuality that happens or occurs or state or circumstance that is temporally relevant
Time: Temporal expressions that represent units of time.
Participants: Named entities, or participants, that play an important role in the event or state.

These elements relate to each other by some relations, like Semantic Role Links and Objectal Links.

Semantic Role Links: The identification of the way an entity is involved/participates in an eventuality. For instance, there is the "agent" semantic role link, in which an event is linked to a participant that intentionally caused it.

A simple code to perform the extraction of the narrative elements, and the two type of relations described above is like the following.

import text2story as t2s # Import the package

t2s.load("en") # Load the pipelines for the English language

text = 'On Friday morning, Max Healthcare, which runs 10 private hospitals around Delhi, put out an "SOS" message, saying it had less than an hour\'s supply remaining at two of its sites. The shortage was later resolved.'

doc = t2s.Narrative('en', text, '2020-05-30')

doc.extract_participants() # Extraction done with all tools.
doc.extract_participants('spacy', 'nltk') # Extraction done with the SPACY and NLTK tools.
doc.extract_participants('srl') # Extraction done with just the SRL tool.

doc.extract_times() # Extraction done with all tools 

doc.extract_events() # Extraction of events with all tools
doc.extract_semantic_role_link() # Extraction of semantic role links with all tools (should be done after extracting events since most semantic relations are between an participant and an event)

ann_str = doc.ISO_annotation() # Outputs ISO annotation in .ann format (txt) in a file called 'annotations.ann'
with open('annotations.ann', "w") as fd:
    fd.write(ann_str)

2. Framework Structure

.
│   README.md
|   env.yml
│   requirements.txt
|
└──Text2Story
      └──core
      │   │   annotator.py (META-annotator)
      │   │   entity_structures.py (ParticipantEntity, TimexEntity and EventEntity classes)
      │   |   exceptions.py (Exceptions raised by the package)
      │   |   link_structures.py (TemporalLink, AspectualLink, SubordinationLink, SemanticRoleLink and ObjectalLink classes)
      │   |   narrative.py (Narrative class)
      │   |   utils.py (Utility functions)
      │
      └───readers (tools to support the reading of some specific kind of annotated corpus)
      |   | read.py (Abstract class: defines the structure of a reader)
      |   | TokenCorpus (Internal representation of a token, its annotations and relations)
      |   | read_brat.py (it reads annotated file of type supported the BRAT annotation tool)
      |   | read_ecb.py (it processes ecb+ corpus format)
      |   | read_framenet.py (it processes Framenet corpus format)
      |   | read_propbank.py (it processes Propbank corpus format)  
      └───annotators (tools supported by the package to do the extractions)
      |   |   NLTK
      |   │   PY_HEIDELTIME
      |   |   BERTNERPT
      |   |   TEI2GO (requires the manual installation for each used model)
      |   |   SPACY
      |   |   SRL
      └───experiments
      |   |   evaluation.py (it performs batch evaluation of narrative corpora)
      |   |   metrics.py (it implements some specific metrics, like relaxed recall and relaxed precision)
      |   |   stats.py (it counts some narrative elements, and produce some stats of the narrative corpora)
      └───visualization
      |   |   brat2viz: a module that converts a BRAT annotation file to visual representations, like  Message Sequence Chart (MSC) and (Knowledge Graph) KG
      |   |   viz: a module that contain bubble_tikz.py, a class dedicate to build Bubble diagrams
      
└── Webapp
      |  backend.py
      |  main.py
      |  session_state.py
      |  input_phase.py
      |  output_phase.py

3. The Annotators

All annotators have the same interface: they implement a function called 'extract_' followed by the name of the particular extraction. E.g., if they are extracting participants, then they implement a function named 'extract_participants', with two arguments: the language of text and the text itself.

Extractions	Interface	Supporting tools
Participant	extract_participants(lang, text)	SPACY, NLTK , SRL, BERTNERPT
Timexs	extract_timexs(lang, text, publication_time)	PY_HEIDELTIME
Event	extract_events(lang, text, publication_time)	SRL
SemanticLink	extract_semantic_role_link(lang, text, publication_time)	SRL

To change some model used in the supported tools, just go to text2story/annotators/ANNOTATOR_TO_BE_CHANGED and change the model in the file: __init__.py.

To add a new tool, add a folder to text2story/annotators with the name of the annotator all capitalized (just a convention; useful to avoid name colisions). In that folder, create a file called '__init__.py' and there implement a function load() and the desired extraction functions. The function load() should load the pipeline to some variable defined by you, so that, every time we do an extraction, we don't need to load the pipeline all over again. (Implement it, even if your annotator doesn't load anything. Leave it with an empty body.)

In the text2story.annotators.__init__.py file, add a call to the load() function, and to the extract functions. (See the already implemented tools for guidance.)

PS: Don't forget to normalize the labels to our semantic framework!

4. Installation

4.1 Linux / Ubuntu

The installation requires graphviz software, the latex suite and the software poppler to convert pdf to png. In Linux, to install these software open a terminal and type the following commands:

sudo apt-get install graphviz libgraphviz-dev texlive-latex-base  texlive-latex-extra poppler-utils

After that, create a virtual enviroment using venv or other tool of your preference. For instance, using the following command in the prompt line:

$ python3 -m venv venv

Then, activate the virtual enviroment in the prompt line. Like, the following command:

$ source venv/bin/activate

After that, you are ready to install

4.2 Windows

First, make sure you have Microsoft C++ Build Tools. Then install graphviz software by download one suitable version in this link. Next, install the latex-suite like these tutorial explains. Then, install Popple packed for windows, which you download here.

Finnally, you can install text2story using pip. If it did not recognize the graphviz installation, then you can use the following command for pip (tested in pip == 21.1.1).

pip install text2story  --global-option=build_ext --global-option="-IC:\Program Files\Graphviz\include" --global-option="-LC:\Program Files\Graphviz\lib\"

For newer version of pip (tested in pip == 23.1.2), you can type the following command:

pip install --use-pep517  --config-setting="--global-option=build_ext"  --config-setting="--global-option=-IC:\Program Files\Graphviz\include" --config-setting="--global-option=-LC:\Program Files\Graphviz\lib"

Web App

#### Web app
```ssh
python backend.py
streamlit run main.py

and a page on your browser will open!

Project details

Release history Release notifications | RSS feed

1.6.11

Mar 19, 2026

1.6.10.1

Dec 19, 2025

1.6.10

Dec 19, 2025

1.6.9.3

Dec 17, 2025

1.6.9.2

Dec 17, 2025

1.6.9.1

Dec 17, 2025

1.6.9

Dec 17, 2025

1.6.8.6

Nov 5, 2025

1.6.8.5

Nov 5, 2025

1.6.8.4

Oct 31, 2025

1.6.8.3

Oct 31, 2025

1.6.8.2

Oct 31, 2025

1.6.8.1

Oct 31, 2025

1.6.8

Oct 30, 2025

1.6.7

Oct 29, 2025

1.6.6.4

Oct 28, 2025

1.6.6.3

Oct 28, 2025

1.6.6.2

Oct 28, 2025

1.6.6.1

Oct 27, 2025

1.6.6

Oct 23, 2025

1.6.5

Oct 23, 2025

1.6.4.5

Oct 8, 2025

1.6.4.4

Oct 8, 2025

1.6.4.3

Oct 8, 2025

1.6.4.2

Oct 7, 2025

1.6.4.1

Oct 7, 2025

1.6.4

Oct 7, 2025

1.6.3

Oct 7, 2025

This version

1.6.2

May 15, 2025

1.6.1

Mar 26, 2025

1.6.0

Jul 2, 2024

1.5.1

Jun 4, 2024

1.5.0

Jun 4, 2024

1.4.9

May 15, 2024

1.4.8

May 6, 2024

1.4.7

May 6, 2024

1.4.6

May 6, 2024

1.4.5

May 6, 2024

1.4.4

Nov 22, 2023

1.4.3

Nov 8, 2023

1.4.2

Oct 18, 2023

1.4.1 yanked

Sep 21, 2023

1.4.0 yanked

Sep 19, 2023

1.4.0.dev8 pre-release yanked

Sep 21, 2023

1.4.0.dev7 pre-release yanked

Sep 20, 2023

1.4.0.dev6 pre-release yanked

Sep 20, 2023

1.4.0.dev5 pre-release yanked

Sep 20, 2023

1.4.0.dev4 pre-release yanked

Sep 20, 2023

1.4.0.dev3 pre-release yanked

Sep 20, 2023

1.4.0.dev2 pre-release yanked

Sep 20, 2023

1.4.0.dev1 pre-release yanked

Sep 20, 2023

1.4.0.dev0 pre-release yanked

Sep 20, 2023

1.3.11 yanked

Sep 13, 2023

1.3.10 yanked

Sep 13, 2023

1.3.9 yanked

Sep 13, 2023

1.3.8 yanked

Sep 13, 2023

1.3.7 yanked

Sep 13, 2023

1.3.6 yanked

Sep 13, 2023

1.3.5 yanked

Sep 12, 2023

1.3.4 yanked

Jun 16, 2023

1.3.3 yanked

Jun 13, 2023

1.3.2 yanked

Jun 12, 2023

1.3.1 yanked

Jun 12, 2023

1.3.0 yanked

Jun 12, 2023

1.2.25 yanked

Jun 1, 2023

1.2.24 yanked

Jun 1, 2023

1.2.23 yanked

May 31, 2023

1.2.22 yanked

May 31, 2023

1.2.21 yanked

May 31, 2023

1.2.20 yanked

May 31, 2023

1.2.18 yanked

May 30, 2023

1.2.17 yanked

May 29, 2023

1.2.16 yanked

May 26, 2023

1.2.15 yanked

May 26, 2023

1.2.14 yanked

May 26, 2023

1.2.13 yanked

May 26, 2023

1.2.12 yanked

May 26, 2023

1.2.11 yanked

May 25, 2023

1.2.10 yanked

May 25, 2023

1.2.9 yanked

May 24, 2023

1.2.8 yanked

May 24, 2023

1.2.7 yanked

May 24, 2023

1.2.6 yanked

May 24, 2023

1.2.5 yanked

May 17, 2023

1.2.4 yanked

May 17, 2023

1.2.3 yanked

May 16, 2023

1.2.2 yanked

May 16, 2023

1.2.1 yanked

May 16, 2023

1.2.0 yanked

May 10, 2023

1.1.28 yanked

May 10, 2023

1.1.27 yanked

Mar 31, 2023

1.1.26 yanked

Mar 31, 2023

1.1.25 yanked

Mar 29, 2023

1.1.24 yanked

Mar 29, 2023

1.1.9 yanked

Dec 12, 2022

1.0.9 yanked

Nov 4, 2022

1.0.8 yanked

Nov 4, 2022

1.0.6 yanked

Nov 4, 2022

1.0.5 yanked

Oct 26, 2022

1.0.4 yanked

Oct 26, 2022

1.0.3 yanked

Oct 25, 2022

1.0.2 yanked

Oct 24, 2022

1.0.0 yanked

Oct 21, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

text2story-1.6.2.tar.gz (1.4 MB view details)

Uploaded May 15, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

text2story-1.6.2-py3-none-any.whl (1.4 MB view details)

Uploaded May 15, 2025 Python 3

File details

Details for the file text2story-1.6.2.tar.gz.

File metadata

Download URL: text2story-1.6.2.tar.gz
Upload date: May 15, 2025
Size: 1.4 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.19

File hashes

Hashes for text2story-1.6.2.tar.gz
Algorithm	Hash digest
SHA256	`70a41d3b5630696d16a78dc12a4257e0924b0ea956523433d69fab6b2c28f238`
MD5	`ff3526d3eabb6bf5534db6427aae951b`
BLAKE2b-256	`2e76119a3b61a3e6d552fd196d41ef9d0fb65389f7697d20866110cbdcef46eb`

See more details on using hashes here.

File details

Details for the file text2story-1.6.2-py3-none-any.whl.

File metadata

Download URL: text2story-1.6.2-py3-none-any.whl
Upload date: May 15, 2025
Size: 1.4 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.19

File hashes

Hashes for text2story-1.6.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`83b232ccf5596972e27abefa12ea8d35c42da94883f42bd75f8f4e4ab17b976c`
MD5	`f5417f392c481f30f9df1afb8f641b3a`
BLAKE2b-256	`52ac0890f48932dbca7ba8d5ecfc7784c7aeaf45003618f1eafc2787f5b66564`

See more details on using hashes here.

text2story 1.6.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Project description

The text2story package

Table of Contents

1. Getting Started

2. Framework Structure

3. The Annotators

4. Installation

4.1 Linux / Ubuntu

4.2 Windows

Web App

Project details

Verified details

Maintainers

Unverified details

Project links

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes