Parallel corpus as a graph.
Project description
parallel-corpus
Parallel corpus as a graph.
Ported from Graph in spraakbanken/swell-editor.
Install
To install parallel-corpus
in the current environment:
pip install parallel-corpus
To add parallel-corpus
to a PDM project:
pdm add parallel-corpus
To add parallel-corpus
manually to pyproject.toml
:
[project]
dependencies = ["parallel-corpus>=0.1.1"]
Usage
first = "Jonathan saknades ."
# Initialize graph with source and target equal.
g = graph.init(first)
second = "Jonat han saknades ."
# Update target with new text.
gm = graph.set_target(g, second)
# The graph will now contain a edge from 'Jonathan' and both 'Jonat' and 'han'.
print(f"{gm.edges=}")
Changelog
This project keeps a changelog.
Development
This project uses conventional commits.
Tools used:
- pdm for project management.
- pre-commit for pre-commit checking
- runs ruff linter
- runs ruff formatter
- checks that commit message is according conventional commits.
- install hooks with
pre-commit install
.
- git-cliff for changelog updates.
- bump-my-version for version bumping.
- syrupy for snapshot testing.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
parallel_corpus-0.1.1.tar.gz
(27.5 kB
view hashes)
Built Distribution
Close
Hashes for parallel_corpus-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 73a211168c58a068ee49cfe13fcfd872dafdc6db930bded057c004e4ec8496d5 |
|
MD5 | 1cae3c8891bda828480cb34ebc8dc6a4 |
|
BLAKE2b-256 | a1d02cfb552ba18ef66f8e8ab0f6567812c1fcff84722109edd088a0fb085c52 |