Skip to main content

A replaCy component to fix issue boundary, fix signle or double space at start, extend to next word if facing casing issue.

Project description

replaCy Issue Boundary

A replaCy component to fix issue boundary:

  • fix signle or double space at start.
  • fix double comma.
  • fix comma at start
  • fix parenthesis space.
  • fix first letter is a lower case.
  • extend to next word if facing casing issue.

Warning

Add after joiner to work

Install

poetry add replacy_issue_boundary

or

pip install replacy_issue_boundary

Usage

import en_core_web_sm
from replacy import ReplaceMatcher
from replacy.db import load_json
from replacy_issue_boundary import IssueBoundary
from spacy.util import filter_spans


nlp = en_core_web_sm.load()
replaCy = ReplaceMatcher(nlp, load_json('path to match dict(s)'))
issue_boundary = IssueBoundary()
replaCy.add_pipe(name="span_filter", component=filter_spans, before="joiner")
replaCy.add_pipe(issue_boundary, name="article_agreer", after="joiner")

Developing

The CI/CD in this project is great. GitHub Actions run linting and tests on any PR. If you merge into master, release-drafter drafts a new release based on PR commits and tags (e.g. if the PR is tagged feature and minor it will create a minor version bump with the changes labeled as Features).

I can't figure out the automatic versioning bit... leaving it in a broken state for now :/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

replacy-issue-boundary-0.2.7.tar.gz (4.9 kB view details)

Uploaded Source

Built Distribution

replacy_issue_boundary-0.2.7-py3-none-any.whl (5.8 kB view details)

Uploaded Python 3

File details

Details for the file replacy-issue-boundary-0.2.7.tar.gz.

File metadata

  • Download URL: replacy-issue-boundary-0.2.7.tar.gz
  • Upload date:
  • Size: 4.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.0.10 CPython/3.8.6 Linux/5.4.0-1031-azure

File hashes

Hashes for replacy-issue-boundary-0.2.7.tar.gz
Algorithm Hash digest
SHA256 b1ccc38bb01f9c1bf503d08c7f507ecaf6d58da082211b5e86e57a55ce985f92
MD5 3f1dff35d33e82223a5c02729d742d01
BLAKE2b-256 ab9f93155e42ccc2ff30885a015738168fa3161b466b50dbd48a084889f1b5a9

See more details on using hashes here.

File details

Details for the file replacy_issue_boundary-0.2.7-py3-none-any.whl.

File metadata

File hashes

Hashes for replacy_issue_boundary-0.2.7-py3-none-any.whl
Algorithm Hash digest
SHA256 e71fccecb17bd6c011b86c4cd7afbde6ca75b90550f205cc29476f63b70a82d5
MD5 703d980c270b1229d8c12837c4fa1313
BLAKE2b-256 8c35bbc28ffffffdcc755080d17eb4fe393a961361264228afac8fb5849f921a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page