Skip to main content

YiVal is an open-source project designed to revolutionize the way developers and researchers evaluate and refine AI models.

Project description

🧚🏻‍️ YiVal

Website · Producthunt · Documentation

⚡ Build any Generative AI application with evaluation and improvement ⚡

👉 Follow us: Twitter | Discord

Downloads License: MIT GitHub star chart Dependency Status Open Issues

🤔 What is YiVal?

YiVal is an GenAI-Ops framework that allows you to iteratively tune your Generative AI model metadata, params, prompts and retrieval configs all at once with your preferred choices of test dataset generation, evaluation algorithms and improvement strategies.

Check out our quickstart guide!

📣 What's Next?

Expected Features in Sep

  • Add ROUGE and BERTScore evaluators
  • Add support to midjourney
  • Add support to LLaMA2-70B, LLaMA2-7B, Falcon-40B,
  • Support LoRA fine-tune to open source models

🚀 Features

🔧 Experiment Mode: 🤖 Agent Mode (Auto-prompting):
Workflow Define your AI/ML application ➡️ Define test dataset ➡️ Evaluate 🔄 Improve ➡️ Prompt related artifacts built ✅ Define your AI/ML application ➡️ Auto-prompting ➡️ Prompt related artifacts built ✅
Features 🌟 Streamlined prompt development process
🌟 Support for multimedia and multimodel
🌟 Support CSV upload and GPT4 generated test data
🌟 Dashboard tracking latency, price and evaluator results
🌟 Human(RLHF) and algorithm based improvers
🌟 Service with detailed web view
🌟 Customizable evaluators and improvers
🌟 Non-code experience of Gen-AI application build
🌟 Witness your Gen-AI application born and improve with just one click
Demos - Animal story with MidJourney 🐯 Open In Colab
- Model Comparison in QA ability 🌟Open In Colab
- Startup Company Headline Generation Bot🔥 Open In Colab
- Automate Prompt Generation with Retrieval Methods🔥 Open In Colab

Model Support matrix

We support 100+ LLM ( gpt-4 , gpt-3.5-turbo , llama e.g.).

Different Model sources can be viewed as follow

Model llm-Evaluate Human-Evaluate Variation Generate Custom func
OpenAI
Azure
TogetherAI
Cohere
Huggingface
Anthropic
MidJourney

To support different models in custom func(e.g. Model Comparison) , follow our example

To support different models in evaluators and generators , check our config

Installation

pip install yival

Demo

Multi-model Mode

Yival has multimodal capabilities and can handle generated images in AIGC really well.

Find more information in the Animal story demo we provided.

yival run demo/configs/animal_story.yml

pic

Basic Interactive Mode

To get started with a demo for basic interactive mode of YiVal, run the following command:

yival demo --basic_interactive

Once started, navigate to the following address in your web browser:

http://127.0.0.1:8073/interactive

Click to view the screenshot

Screenshot 2023-08-17 at 10 55 31 PM

For more details on this demo, check out the Basic Interactive Mode Demo.

Question Answering with expected result evaluator

yival demo --qa_expected_results

Once started, navigate to the following address in your web browser: http://127.0.0.1:8073/

Click to view the screenshot Screenshot 2023-08-18 at 1 11 44 AM

For more details, check out the Question Answering with expected result evaluator.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

yival-0.1.0a4.tar.gz (89.9 kB view hashes)

Uploaded Source

Built Distribution

yival-0.1.0a4-py3-none-any.whl (116.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page