Skip to main content

Annotation meets Large Language Models.

Project description

doccano-mini

doccano-mini is a few-shot annotation tool to assist the development of applications with Large language models (LLMs). Once you annotate a few text, you can solve your task (e.g. text classification) with LLMs via LangChain.

At this time, the following tasks are supported:

  • Text classification
  • Question answering
  • Summarization
  • Paraphrasing
  • Named Entity Recognition
  • Task Free

Note: This is an experimental project.

Installation

pip install doccano-mini

Usage

For this example, we will be using OpenAI’s APIs, so we need to set the environment variable in the terminal.

export OPENAI_API_KEY="..."

Then, we can run the server.

doccano-mini

Now, we can open the browser and go to http://localhost:8501/ to see the interface.

Step1: Annotate a few text

In this step, we will annotate a few text. We can add a new text by clicking the + button. Try it out by double-clicking on any cell. You'll notice you can edit all cell values.

Step1

The editor also supports pasting in tabular data from Google Sheets, Excel, and many other similar tools.

Copy and Paste

Step2: Test your task

In this step, we will test your task. We can enter a new test to the text box and click the Predict button. Then, we can see the result of the test.

“Step2”

Step3: Download the config

In this step, we will download the LangChain's config. We can click the Download button to download it. After loading the config file, we can predict a label for the new text.

from langchain.chains import load_chain

chain = load_chain("chain.yaml")
chain.run("YOUR TEXT")

Development

poetry install
streamlit run doccano_mini/home.py

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

doccano_mini-0.0.10.tar.gz (12.9 kB view details)

Uploaded Source

Built Distribution

doccano_mini-0.0.10-py3-none-any.whl (20.7 kB view details)

Uploaded Python 3

File details

Details for the file doccano_mini-0.0.10.tar.gz.

File metadata

  • Download URL: doccano_mini-0.0.10.tar.gz
  • Upload date:
  • Size: 12.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for doccano_mini-0.0.10.tar.gz
Algorithm Hash digest
SHA256 4f56240e56b7e76afdaf6188c13de0d3ca8980328a14cb57448511f603e8da5f
MD5 a3fbe218f1ff14f7de02a74838fde02f
BLAKE2b-256 37d26a41f8761b5a961b269cd8aeffb43705f51a11e46cdd28ddcfdcfab8276a

See more details on using hashes here.

File details

Details for the file doccano_mini-0.0.10-py3-none-any.whl.

File metadata

File hashes

Hashes for doccano_mini-0.0.10-py3-none-any.whl
Algorithm Hash digest
SHA256 bc39c2e3ff44b1b7b3227dd7ef824c680030727809d8691774d7bfa2de49b500
MD5 408df380cf18b2df31bfe6125f1cebd0
BLAKE2b-256 b3c74c20efc74a155b90cb99b8decfa8c58815486359ffc6e6c2e8892e07ca08

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page