Skip to main content
Avatar for ktadmin from gravatar.com

ktadmin

Username    ktadminkt
Date joined   Joined

82 projects

pyprocessors-rf_consolidate

Last released

RFConsolidate annotations coming from different annotators

pyformatters-xml-rf

Last released

Groupe RF XML formatter

pyprocessors-afp_keywords

Last released

Processor based on AFP keywords extraction

pysegmenters-blingfire

Last released

Segmenter based on BlingFire

pyprocessors-iptc_mapper

Last released

Sherpa IPTC category mapper

pyconverters-whisperx

Last released

WhisperX converter for audio transcription with speaker diarization support.

pyconverters-cairn_xml

Last released

Cairn.info XML converter

pyprocessors-afp_entities

Last released

AFPEntities annotations coming from different annotators

pyannotators-entityfishing

Last released

Annotator based on entity-fishing

pyprocessors-segment_renseignor

Last released

Create segments from annotations based on Renseignor document structure

pyprocessors-deepl

Last released

DeepL processor plugin for pymultirole

pyconverters-openai_audio

Last released

OpenAIAudio converter

pysegmenters-syntok

Last released

syntok segmenter

pysegmenters-rules_segmenter

Last released

Rule-based segmenter

pysegmenters-pysdb

Last released

Rule-based segmenter

pysegmenters-md_splitter

Last released

Markdown splitter segmenter

pyprocessors-tag2segment

Last released

Create segments from annotations

pyprocessors-standoff2inline

Last released

Sherpa transform annotations to categories processor

pyprocessors-reconciliation

Last released

Sherpa reconciliation processor

pyprocessors-pseudonimizer

Last released

Processor based on Presidio anonymizer

pyprocessors-openai_completion

Last released

OpenAICompletion processor

pyprocessors-nameparser

Last released

Processor based on Nameparser

pyprocessors-document_fingerprint

Last released

Sherpa Consolidation processor

pyprocessors-chunk_sentences

Last released

Sherpa sentence chunking processor

pyprocessors-categories_from_annotations

Last released

Sherpa transform annotations to categories processor

pyprocessors-capitalizer

Last released

Replace document text with capitalized annotations

pyformatters-tabular

Last released

Tabular formatter for Sherpa

pyconverters-pyword

Last released

Convert DOCX to Markdown using mammoth

pyconverters-pypowerpoint

Last released

Convert PPTX to text using python-pptx

pyconverters-pyexcel

Last released

Convert XLSX to 1-segment per row document

pyconverters-pubmedfetcher

Last released

Fetch and convert Pubmed articles

pyconverters-paddleocr

Last released

Convert PDF to structured text using PaddleOCR

pyconverters-openai_vision

Last released

OpenAIVision converter

pyconverters-mistralocr

Last released

Convert PDF to structured text using MistralOCR

pyconverters-inscriptis

Last released

Convert HTML to text using inscriptis

pyannotators-spacyner

Last released

Annotator based on Spacy NER

pyannotators-spacymatcher

Last released

SpacyMatcher annotator using the spacy rule-matching engine

pyannotators-patterns

Last released

Annotator based on Presidio pattern recognizer

pyannotators-duckling

Last released

Annotator based on Facebook Duckling

pyannotators-acronyms

Last released

Annotator based on Facebook's Acronyms

pyconverters-grobid

Last released

Convert PDF to structured text using Grobid

pymultirole-plugins

Last released

Sherpa multirole plugins

pyconverters-newsml

Last released

NewsML converter (AFP news)

pyprocessors-opennre

Last released

Processor based on Huggingface transformers zero-shot classification pipeline

pyprocessors-afp_sports

Last released

Sherpa Consolidation processor

pyconverters-isako

Last released

Convert PDF to structured text using Isako

pyprocessors-consolidate

Last released

Sherpa Consolidation processor

pyprocessors-generative_augmenter

Last released

GenerativeazA$$ processor

pyprocessors-bel_entities

Last released

Sherpa Consolidation processor

pyprocessors-xcago_reconciliation

Last released

Sherpa xcago_reconciliation processor

pyformatters-bel_table

Last released

Sherpa BELTable formatter

pyconverters-xcago

Last released

X-CAGO converter.

pyformatters-afp_quality

Last released

Sherpa AFP Quality formatter

pyimporters-skos

Last released

Sherpa knowledge import plugins

pyimporters-json

Last released

Sherpa knowledge import plugins

pyimporters-csv

Last released

Sherpa knowledge import plugins

pyimporters-skos-rf

Last released

Sherpa knowledge import plugins

pyimporters-mesh

Last released

Sherpa knowledge import plugins

pyimporters-obo

Last released

Sherpa knowledge import plugins

pyimporters-plugins

Last released

Sherpa knowledge import plugins

pyconverters-ocrmypdf

Last released

Convert OCRized PDF to text using [OCRmyPDF]

pyprocessors-escrim_reconciliation

Last released

Sherpa ESCRIM processor

pyprocessors-rf_resegment

Last released

Sherpa sentence chunking processor

pyprocessors-restore_punctuation

Last released

Sherpa sentence chunking processor

pyprocessors-q_and_a

Last released

Processor based on Huggingface transformers Q&A pipeline

pyprocessors-readinggrid

Last released

Processor that generate a focussed reading-grid (keep only the sentences containing annotation)

pyprocessors-normalizer

Last released

Sherpa git initnormalizer

pyprocessors-mazars_table

Last released

Sherpa Mazars normalizer

pyformatters-textranksummarizer

Last released

Formatter/processor based on TextRank

pyformatters-summarizer

Last released

Formatter based on Huggingface transformers summarization pipeline

pyconverters-speech

Last released

Speech recognition converter based on Huggingface pipeline

pyconverters-deeptranscript

Last released

DeepTranscript converter.

pyannotators-trankitner

Last released

Annotator based on Trankit NER

pyannotators-trfclassifier

Last released

Classifier based on Huggingface Text Classification pipeline

pyannotators-zeroshotclassifier

Last released

Annotator based on Huggingface transformers zero-shot classification pipeline

pyprocessors-similar_segments

Last released

Similar segments processor

pyprocessors-search_segments

Last released

Similar segments processor

pyformatters-consolidate

Last released

Sherpa Consolidation formatter

pysegmenters-rules

Last released

Rule-based segmenter

pyprocessors-silero_te

Last released

text repunctuation and recapitalization for

pysegmenters-spacyrules

Last released

Segmenter based on Spacy

pyimporters-dummy

Last released

Sherpa knowledge import plugins

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page