Skip to main content

CLI for Polvo - Test embedding models on your data

Project description

Polvo CLI

๐Ÿ™ Polvo CLI - Find the best embedding model for your data

A command-line interface for testing and evaluating embedding models on your datasets using the Polvo API.

Installation

From PyPI (when published)

pip install polvo-cli

Development Installation

git clone <repository-url>
cd polvo-cli
pip install -e .

Quick Start

  1. Check API health:

    polvo health
    
  2. List available models:

    polvo models
    
  3. Test models on your dataset:

    polvo test data.csv --model minilm --model mpnet
    

Commands

polvo test

Test embedding models on your dataset.

polvo test <file> [OPTIONS]

# Examples:
polvo test data.csv --model minilm --model mpnet --model openai-small
polvo test data.json --output csv > results.csv
polvo test data.txt --model minilm --api-url https://api.usepolvo.com

Arguments:

  • file: Dataset file (CSV, JSON, or TXT)

Options:

  • --model, -m: Models to test (can specify multiple, default: minilm, mpnet)
  • --column, -c: Column name for CSV files
  • --output, -o: Output format: table, json, csv (default: table)
  • --api-url: API URL (default: http://localhost:8000)

polvo models

List available embedding models.

polvo models [OPTIONS]

# Example:
polvo models --api-url https://api.usepolvo.com

Options:

polvo health

Check API health status.

polvo health [OPTIONS]

# Example:
polvo health --api-url https://api.usepolvo.com

Options:

polvo version

Show CLI version.

polvo version

Configuration

Environment Variables

You can set the default API URL using environment variables:

export POLVO_API_URL="https://api.usepolvo.com"

Or create a .env file in your working directory:

POLVO_API_URL=https://api.usepolvo.com

Output Formats

Table Format (Default)

Beautiful table output with recommendations:

โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”“
โ”ƒ                Embedding Model Evaluation Results                โ”ƒ
โ”กโ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”ฉ
โ”‚ Model   โ”‚ Retrieval โ”‚ Clustering โ”‚ Speed (ms) โ”‚ Cost/1K โ”‚ Dims โ”‚
โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”ค
โ”‚ minilm  โ”‚      0.85 โ”‚       0.72 โ”‚        120 โ”‚    Free โ”‚  384 โ”‚
โ”‚ mpnet   โ”‚      0.88 โ”‚       0.78 โ”‚        180 โ”‚    Free โ”‚  768 โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”˜

Recommendations:
  mpnet offers the best balance of quality and speed
  minilm is fastest for large-scale applications

Best model: mpnet

JSON Format

Structured JSON output for programmatic use:

polvo test data.csv --output json

CSV Format

CSV output for further analysis:

polvo test data.csv --output csv > results.csv

Supported File Formats

  • CSV: Comma-separated values (specify column with --column)
  • JSON: JSON arrays or objects
  • TXT: Plain text files (one text per line)

Error Handling

The CLI provides clear error messages and appropriate exit codes:

  • 0: Success
  • 1: General error (file not found, API error, etc.)

Examples

Test Multiple Models on a Dataset

polvo test customer_reviews.csv \
  --model minilm \
  --model mpnet \
  --model openai-small \
  --output table

Export Results for Analysis

polvo test data.csv --model minilm --model mpnet --output csv > evaluation_results.csv

Check Remote API

polvo health --api-url https://api.usepolvo.com

Development

Running Tests

pytest tests/

Code Formatting

black src/
ruff check src/

Support

For issues and feature requests, please visit the GitHub repository.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

polvo_cli-0.1.0.tar.gz (10.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

polvo_cli-0.1.0-py3-none-any.whl (6.9 kB view details)

Uploaded Python 3

File details

Details for the file polvo_cli-0.1.0.tar.gz.

File metadata

  • Download URL: polvo_cli-0.1.0.tar.gz
  • Upload date:
  • Size: 10.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for polvo_cli-0.1.0.tar.gz
Algorithm Hash digest
SHA256 6eacf18375123f48220fa868d88e5aa3c220a178e6db2c98a9d3afa127c78f35
MD5 f0869dee97377859de864c19a5f571df
BLAKE2b-256 b9c88de70a2a1d4c52c60ea6d35baa18f5c6bebabbb3d519dfb99c0eea5344ff

See more details on using hashes here.

File details

Details for the file polvo_cli-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: polvo_cli-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 6.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for polvo_cli-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 032eba4ee4c710f2d499c808b49846d92dd787c65d4924ace36f8bdfe6ba7ace
MD5 1fcc9cbf56f6b9163a63b00a9f71e773
BLAKE2b-256 7d5127581d9b3d0a5392d647afec14436b290a5072a8a4972435532911e307da

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page