Skip to main content

Format transformer tool for doccano

Project description

doccano-transformer

Codacy Badge Build Status

Doccano Transformer helps you to transform an exported dataset into the format of your favorite machine learning library.

Supported formats

Doccano Transformer supports the following formats:

  • CoNLL 2003
  • spaCy

Install

To install doccano-transformer, simply use pip:

pip install doccano-transformer

Examples

Named Entity Recognition

The following formats are supported:

  • CoNLL 2003
  • spaCy
from doccano_transformer.datasets import NERDataset
from doccano_transformer.utils import read_jsonl

dataset = read_jsonl(filepath='example.jsonl', dataset=NERDataset, encoding='utf-8')
dataset.to_conll2003(tokenizer=str.split)
dataset.to_spacy(tokenizer=str.split)

Contribution

We encourage you to contribute to doccano transformer! Please check out the Contributing to doccano transformer guide for guidelines about how to proceed.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

doccano-transformer-1.0.2.tar.gz (28.2 kB view details)

Uploaded Source

Built Distribution

doccano_transformer-1.0.2-py3-none-any.whl (6.4 kB view details)

Uploaded Python 3

File details

Details for the file doccano-transformer-1.0.2.tar.gz.

File metadata

  • Download URL: doccano-transformer-1.0.2.tar.gz
  • Upload date:
  • Size: 28.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/50.3.2 requests-toolbelt/0.9.1 tqdm/4.51.0 CPython/3.8.6

File hashes

Hashes for doccano-transformer-1.0.2.tar.gz
Algorithm Hash digest
SHA256 fa7bd1652d498f02740de92b939548f884d1d78861c2222ee8a5848f3b08ce75
MD5 2e4f014cd0b934481b273a31a1e5a1ea
BLAKE2b-256 f36cd6acb845eaa5d0e3558612f8fc2e0dbcb6487d82263aef05f890dd63913e

See more details on using hashes here.

File details

Details for the file doccano_transformer-1.0.2-py3-none-any.whl.

File metadata

  • Download URL: doccano_transformer-1.0.2-py3-none-any.whl
  • Upload date:
  • Size: 6.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/50.3.2 requests-toolbelt/0.9.1 tqdm/4.51.0 CPython/3.8.6

File hashes

Hashes for doccano_transformer-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 5121c7d3c54f55e30f9a32b24e82ba7ab2b28f79f64ff006c6bde0a841b34d4b
MD5 29e324f186965a51e07873243352a3cb
BLAKE2b-256 1131a08af5ff31760d3a80d2dfc132fab7806fdb6e9a9cd39e50c11b08039385

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page