Skip to main content

Annotation meets Large Language Models.

Project description

doccano-mini

doccano-mini is a few-shot annotation tool to assist the development of applications with Large language models (LLMs). Once you annotate a few text, you can solve your task (e.g. text classification) with LLMs via LangChain.

At this time, the following tasks are supported:

  • Text classification
  • Question answering
  • Summarization
  • Paraphrasing
  • Named Entity Recognition
  • Task Free

Note: This is an experimental project.

Installation

pip install doccano-mini

Usage

For this example, we will be using OpenAI’s APIs, so we need to set the environment variable in the terminal.

export OPENAI_API_KEY="..."

Then, we can run the server.

doccano-mini

Now, we can open the browser and go to http://localhost:8501/ to see the interface.

Step1: Annotate a few text

In this step, we will annotate a few text. We can add a new text by clicking the + button. Try it out by double-clicking on any cell. You'll notice you can edit all cell values.

Step1

The editor also supports pasting in tabular data from Google Sheets, Excel, and many other similar tools.

Copy and Paste

Step2: Test your task

In this step, we will test your task. We can enter a new test to the text box and click the Predict button. Then, we can see the result of the test.

“Step2”

Step3: Download the config

In this step, we will download the LangChain's config. We can click the Download button to download it. After loading the config file, we can predict a label for the new text.

from langchain.chains import load_chain

chain = load_chain("chain.yaml")
chain.run("YOUR TEXT")

Development

poetry install
streamlit run doccano_mini/home.py

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

doccano_mini-0.0.10.tar.gz (12.9 kB view hashes)

Uploaded Source

Built Distribution

doccano_mini-0.0.10-py3-none-any.whl (20.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page