Skip to main content
Avatar for ktadmin from gravatar.com

ktadmin

Username    ktadminkt
Date joined   Joined

85 projects

pyprocessors-glotlid

Last released

Sherpa Consolidation processor

pyprocessors-afp_entities

Last released

Sherpa Consolidation processor

pyconverters-inscriptis

Last released

Convert HTML to text using inscriptis

pyprocessors-chunk_sentences

Last released

Sherpa sentence chunking processor

pysegmenters-syntok

Last released

syntok segmenter

pysegmenters-rules_segmenter

Last released

Rule-based segmenter

pysegmenters-pysdb

Last released

Rule-based segmenter

pysegmenters-md_splitter

Last released

Markdown splitter segmenter

pysegmenters-blingfire

Last released

Segmenter based on BlingFire

pyprocessors-tag2segment

Last released

Create segments from annotations

pyconverters-mineru

Last released

Convert PDF to structured text using MinerU

pyprocessors-standoff2inline

Last released

Sherpa transform annotations to categories processor

pyprocessors-segment_renseignor

Last released

Create segments from annotations based on Renseignor document structure

pyprocessors-rf_consolidate

Last released

RFConsolidate annotations coming from different annotators

pyprocessors-reconciliation

Last released

Sherpa reconciliation processor

pyprocessors-pseudonimizer

Last released

Processor based on Presidio anonymizer

pyprocessors-openai_completion

Last released

OpenAICompletion processor

pyprocessors-nameparser

Last released

Processor based on Nameparser

pyprocessors-iptc_mapper

Last released

Sherpa IPTC category mapper

pyprocessors-document_fingerprint

Last released

Sherpa Consolidation processor

pyprocessors-deepl

Last released

DeepL processor plugin for pymultirole

pyprocessors-consolidate

Last released

Sherpa Consolidation processor

pyprocessors-categories_from_annotations

Last released

Sherpa transform annotations to categories processor

pyprocessors-afp_keywords

Last released

Processor based on AFP keywords extraction

pyprocessors-capitalizer

Last released

Replace document text with capitalized annotations

pyformatters-tabular

Last released

Tabular formatter for Sherpa

pyformatters-xml-rf

Last released

Groupe RF XML formatter

pyformatters-json

Last released

Json formatter for Sherpa

pyformatters-afp_quality

Last released

Sherpa AFP Quality formatter

pyconverters-whisperx

Last released

WhisperX converter for audio transcription with speaker diarization support.

pyconverters-pyword

Last released

Convert DOCX to Markdown using mammoth

pyconverters-pypowerpoint

Last released

Convert PPTX to text using python-pptx

pyconverters-pyexcel

Last released

Convert XLSX to 1-segment per row document

pyconverters-pubmedfetcher

Last released

Fetch and convert Pubmed articles

pyconverters-paddleocr

Last released

Convert PDF to structured text using PaddleOCR

pyconverters-openai_vision

Last released

OpenAIVision converter

pyconverters-openai_audio

Last released

OpenAIAudio converter

pyconverters-newsml

Last released

NewsML converter (AFP news)

pyconverters-mistralocr

Last released

Convert PDF to structured text using MistralOCR

pyconverters-grobid

Last released

Convert PDF to structured text using Grobid

pyconverters-cairn_xml

Last released

Cairn.info XML converter

pyannotators-spacyner

Last released

Annotator based on Spacy NER

pyannotators-spacymatcher

Last released

SpacyMatcher annotator using the spacy rule-matching engine

pyannotators-patterns

Last released

Annotator based on Presidio pattern recognizer

pyannotators-entityfishing

Last released

Annotator based on entity-fishing

pyannotators-duckling

Last released

Annotator based on Facebook Duckling

pyannotators-acronyms

Last released

Annotator based on Facebook's Acronyms

pymultirole-plugins

Last released

Sherpa multirole plugins

pyimporters-skos-rf

Last released

Sherpa SKOS-RF knowledge import plugin

pyimporters-skos

Last released

Sherpa SKOS knowledge import plugin

pyimporters-obo

Last released

Sherpa OBO knowledge import plugin

pyimporters-mesh

Last released

Sherpa MeSH knowledge import plugin

pyimporters-json

Last released

Sherpa JSON knowledge import plugin

pyimporters-csv

Last released

Sherpa CSV/Excel/TXT knowledge import plugin

pyimporters-plugins

Last released

Sherpa knowledge import plugins

pyprocessors-opennre

Last released

Processor based on Huggingface transformers zero-shot classification pipeline

pyprocessors-generative_augmenter

Last released

GenerativeazA$$ processor

pyformatters-bel_table

Last released

Sherpa BELTable formatter

pyprocessors-afp_sports

Last released

Sherpa Consolidation processor

pyprocessors-xcago_reconciliation

Last released

Sherpa xcago_reconciliation processor

pyprocessors-bel_entities

Last released

Sherpa Consolidation processor

pyconverters-isako

Last released

Convert PDF to structured text using Isako

pyconverters-xcago

Last released

X-CAGO converter.

pyconverters-ocrmypdf

Last released

Convert OCRized PDF to text using [OCRmyPDF]

pyprocessors-rf_resegment

Last released

Sherpa sentence chunking processor

pyprocessors-restore_punctuation

Last released

Sherpa sentence chunking processor

pyprocessors-q_and_a

Last released

Processor based on Huggingface transformers Q&A pipeline

pyprocessors-readinggrid

Last released

Processor that generate a focussed reading-grid (keep only the sentences containing annotation)

pyprocessors-normalizer

Last released

Sherpa git initnormalizer

pyprocessors-mazars_table

Last released

Sherpa Mazars normalizer

pyformatters-textranksummarizer

Last released

Formatter/processor based on TextRank

pyformatters-summarizer

Last released

Formatter based on Huggingface transformers summarization pipeline

pyconverters-speech

Last released

Speech recognition converter based on Huggingface pipeline

pyconverters-deeptranscript

Last released

DeepTranscript converter.

pyannotators-trankitner

Last released

Annotator based on Trankit NER

pyannotators-trfclassifier

Last released

Classifier based on Huggingface Text Classification pipeline

pyannotators-zeroshotclassifier

Last released

Annotator based on Huggingface transformers zero-shot classification pipeline

pyprocessors-similar_segments

Last released

Similar segments processor

pyprocessors-search_segments

Last released

Similar segments processor

pyformatters-consolidate

Last released

Sherpa Consolidation formatter

pysegmenters-rules

Last released

Rule-based segmenter

pyprocessors-silero_te

Last released

text repunctuation and recapitalization for

pysegmenters-spacyrules

Last released

Segmenter based on Spacy

pyimporters-dummy

Last released

Sherpa knowledge import plugins

1 archived project

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page