Skip to main content

Anchorman takes a list of terms and a text. It finds the terms in this text and replaces them with another representation.

Project description

# Welcome to Anchorman

Latest Version

Turn your text into [hypertext]( and enrich the content. Anchorman finds terms in text and replaces them with another representation.

The replacement is rule-based. Each term is checked against the rules and will be applied if valid.

# How many items will be marked at all in the text. replaces_at_all: 5

# Input term has to be exact match in text. case_sensitive: true

## Features

  • replacement rules
  • consider text units in the rules (e.g. paragraphs)
  • replace only n items of the same item
  • specify restricted_areas for linking by tag: a, img
  • sort elements by value before apply them
  • return applied elements

## Usage

>>> from anchorman import annotate
>>> text = 'The quick brown fox jumps over the lazy dog.'
>>> elements = [{'fox': {'value': '/wiki/fox', 'data-type': 'animal'}}]
>>> print annotate(text, elements)
'The quick brown <a href="/wiki/fox" data-type="animal">fox</a> jumps over the lazy dog.'

## Installation

To install Anchorman, simply:

pip install anchorman

## Credits and contributions

We published this at github and pypi to provide our solution to you. Pleased for feedback and contributions.

Thanks [@tarnacious]( for inspiration and first steps.

## Todo

  • check if position exist in input and save extra processing
  • html.parser vs lxml in bs4 - benchmarks and drawbacks

<img src=”” width=”200”>

Stay tuned.

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for anchorman, version 0.5.3
Filename, size File type Python version Upload date Hashes
Filename, size anchorman-0.5.3.tar.gz (9.1 kB) File type Source Python version None Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page