The AIMon SDK that is used to interact with the AIMon API and the product.

Project description

🎉Welcome to AIMon Rely

AIMon Rely is a state-of-the-art system consisting of multiple models for detecting LLM quality issues during offline evaluations and continuous production monitoring. We offer hallucination metrics that is fast, reliable and cost-effective. We also support additional metrics such as completeness, conciseness and toxicity.

Read our blog post for more details.

✨ Join our community on Slack or reach out to us at info@aimon.ai to get your API key.

Metrics Supported

These are the quality metrics that are currently available via the API. Some of them are in progress and will be available in a future release.

Metric	Status
Model Hallucination (Passage and Sentence Level)	✓
Completeness	✓
Conciseness	✓
Toxicity	✓
Semantic Similarity	⌛
Sentiment	⌛
Coherence	⌛
Sensitive Data (PII/PHI/PCI)	⌛

Product

Follow these steps to use the product:

Step 1: Get access to the beta product by joining the wait list on our website or by requesting it on Slack or sending an email to info@aimon.ai
Step 2: Install the AIMon SDK by running pip install aimon in your terminal.
Step 3: Refer to the sample notebook for an example of how to instrument an LLM application using our SDK.

API

Steps to use the API:

Step 1: Get your API key by requesting it on our Slack or sending an email to info@aimon.ai
Step 2: You can try the API using either of these methods
- [OPTION 1] Try the simple langchain summarization application that is augmented with AIMon Rely to detect hallucinations at the sentence level.
  - Step 1: Run pip install -r examples/requirements.txt && pip install aimon
  - Step 2: Run streamlit run examples/langchain_summarization_app.py
- [OPTION 2] Download the Postman collection specified below to access the API
  - Model Hallucination (Passage and Sentence Level): Postman Collection

Sandbox

You can play with a Sandbox that is available on our website.

Benchmarks

Hallucination Detection

To demonstrate the effectiveness of our system, we benchmarked it against popular industry benchmarks for the hallucination detection task. The table below shows our results.

A few key takeaways:

✅ AIMon Rely is 10x cheaper than GPT-4 Turbo.

✅ AIMon Rely is 4x faster than GPT-4 Turbo.

✅ AIMon Rely provides the convenience of a fully hosted API that includes baked-in explainability.

✅ Support for a context length of up to 32,000 tokens (with plans to further expand this in the near future).

Overall, AIMon Rely is 10 times cheaper, 4 times faster and close to or even better than GPT-4 on the benchmarks making it a suitable choice for both offline and online detection of hallucinations.

Completeness, Conciseness Detection

There is a lack of industry standard benchmark datasets for these metrics. We will be publishing an evaluation dataset soon. Stay Tuned! ⌛

Pricing

Please reach out to info@aimon.ai for pricing details related to the product and the API.

Future Work

We are working on additional metrics as detailed in the table above.
In addition, we are working on something awesome to make the offline evaluation and continuous model quality monitoring experience more seamless.

Join our Slack for the latest updates and discussions on generative AI reliability.

Project details

Release history Release notifications | RSS feed

0.7.3

Sep 28, 2024

0.7.2

Sep 5, 2024

0.7.1

Aug 18, 2024

0.7.0

Aug 11, 2024

0.6.1

Aug 4, 2024

0.6.0

Jul 30, 2024

0.5.0

Jul 29, 2024

0.4.0

Jul 9, 2024

0.3.1

Jun 21, 2024

0.3.0

Jun 21, 2024

0.2.0

Jun 4, 2024

This version

0.1.0

May 22, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aimon-0.1.0.tar.gz (13.5 kB view hashes)

Uploaded May 22, 2024 Source

Built Distribution

aimon-0.1.0-py3-none-any.whl (15.3 kB view hashes)

Uploaded May 22, 2024 Python 3

Hashes for aimon-0.1.0.tar.gz

Hashes for aimon-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`660a56532b8e363cbcb127048812cdb47ee603a6c964ca8d702512546372d1ab`
MD5	`61bf813e94121d044019dec607538825`
BLAKE2b-256	`961c7153b2e9b20c800ebe8462abd9161017aec0784cbe2c7c34d59414c93454`

Hashes for aimon-0.1.0-py3-none-any.whl

Hashes for aimon-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`929577d28ffe7bcc489f81e43d6ffb94f7c631856b32a238da48016be6e1c3f4`
MD5	`0e0a734fdd2eff087e2b7f4badf36cff`
BLAKE2b-256	`3dd41441a2d25f321093ee2e3380bf31bbb3a24e91197d1cd31c44fac9bcdc6a`