Annotation meets Large Language Models.
Project description
doccano-mini
doccano-mini is a few-shot annotation tool to assist the development of applications with Large language models (LLMs). Once you annotate a few text, you can solve your task (e.g. text classification) with LLMs via LangChain.
At this time, the following tasks are supported:
- Text classification
- Question answering
- Summarization
- Paraphrasing
- Named Entity Recognition
- Task Free
Note: This is an experimental project.
Installation
pip install doccano-mini
Usage
For this example, we will be using OpenAI’s APIs, so we need to set the environment variable in the terminal.
export OPENAI_API_KEY="..."
Then, we can run the server.
doccano-mini
Now, we can open the browser and go to http://localhost:8501/
to see the interface.
Step1: Annotate a few text
In this step, we will annotate a few text. We can add a new text by clicking the +
button. Try it out by double-clicking on any cell. You'll notice you can edit all cell values.
The editor also supports pasting in tabular data from Google Sheets, Excel, and many other similar tools.
Step2: Test your task
In this step, we will test your task. We can enter a new test to the text box and click the Predict
button. Then, we can see the result of the test.
Step3: Download the config
In this step, we will download the LangChain's config. We can click the Download
button to download it. After loading the config file, we can predict a label for the new text.
from langchain.chains import load_chain
chain = load_chain("chain.yaml")
chain.run("YOUR TEXT")
Development
poetry install
streamlit run doccano_mini/home.py
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for doccano_mini-0.0.10-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | bc39c2e3ff44b1b7b3227dd7ef824c680030727809d8691774d7bfa2de49b500 |
|
MD5 | 408df380cf18b2df31bfe6125f1cebd0 |
|
BLAKE2b-256 | b3c74c20efc74a155b90cb99b8decfa8c58815486359ffc6e6c2e8892e07ca08 |