Anchorman takes a list of terms and a text. It finds the terms in this text and replaces them with another representation.
Project description
# Welcome to Anchorman
Turn your text into [hypertext](http://en.wikipedia.org/wiki/Hypertext) and enrich the content. Anchorman finds terms in text and replaces them with another representation.
The replacement is rule-based. Each term is checked against the rules and will be applied if valid.
# How many items will be marked at all in the text. replaces_at_all: 5
# Input term has to be exact match in text. case_sensitive: true
## Features
replacement rules
consider text units in the rules (e.g. paragraphs)
replace only n items of the same item
specify restricted_areas for linking by tag: a, img
sort elements by value before apply them
return applied elements
## Usage
>>> from anchorman import annotate >>> text = 'The quick brown fox jumps over the lazy dog.' >>> elements = [{'fox': {'value': '/wiki/fox', 'data-type': 'animal'}}] >>> print annotate(text, elements) 'The quick brown <a href="/wiki/fox" data-type="animal">fox</a> jumps over the lazy dog.'
## Installation
To install Anchorman, simply:
pip install anchorman
## Credits and contributions
We published this at github and pypi to provide our solution to you. Pleased for feedback and contributions.
Thanks [@tarnacious](https://github.com/tarnacious) for inspiration and first steps.
## Todo
check if position exist in input and save extra processing
html.parser vs lxml in bs4 - benchmarks and drawbacks
<img src=”https://raw.githubusercontent.com/rebeling/anchorman/master/docs/anchorman.png” width=”200”>
Stay tuned.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file anchorman-0.5.3.tar.gz
.
File metadata
- Download URL: anchorman-0.5.3.tar.gz
- Upload date:
- Size: 9.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 09844b1c8614c6173026d541fb77113178ef87acde37a4edd3f5facb2c0e2dd0 |
|
MD5 | 6d680560caa55703d1ea154a7b94e083 |
|
BLAKE2b-256 | 94dec8b5c361684bf827758ef325acf4a83f712a6d824a00660e02f0863f8b65 |