Skip to main content

Low Resource Context Relation Sampler for contexts with relations for fact-checking and fine-tuning your LLM models, powered by AREkit

Project description

arekit-ss 0.23.1

arekit-ss [AREkit double "s"] -- is an extension for instant object-pair context sampling from AREkit collection of datasources.

For custom text sampling, please follow the ARElight project.

Mind the case (issue #18): switching to another language may changed amount of extracted data due to terms_per_context parameter that crops context with fixed amount of words.

Installation

Install dependencies:

pip install git+https://github.com/nicolay-r/arekit-ss.git@0.23.1

Download AREkit related data, from which sources are required:

python -m arekit.download_data

Usage

Example of composing prompts:

python -m arekit_ss.sample --writer csv --source rusentrel --sampler prompt \
  --prompt "For text: '{text}', the attitude between '{s_val}' and '{t_val}' is: '{label_val}'" \
  --dest_lang en --docs_limit 1 --text_parser lm --output_dir 'out_rusentrel_prompt'

output_prompts

samplers

  • nn -- CNN/LSTM architecture related, including frames annotation from RuAttitudes.
  • bert -- BERT-based, single-input sequence.
  • prompt -- prompt-based sampler for ChatGPT and the related conversational systems [prompt engeneering guide]

Writers

Powered by

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arekit_ss-0.23.1.tar.gz (46.9 kB view hashes)

Uploaded Source

Built Distribution

arekit_ss-0.23.1-py3-none-any.whl (48.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page