Skip to main content

Hierarchical document intelligence without vectors

Project description

Vectorless

Reasoning-native Document Intelligence Engine

PyPI PyPI Downloads Crates.io Crates.io Downloads Docs License

Vectorless is a reasoning-native document intelligence engine written in Rust — no vector database, no embeddings, no similarity search. It transforms documents into hierarchical semantic trees and uses LLMs to navigate the structure, retrieving the most relevant content through deep contextual understanding instead of vector math.

Quick Start

Install

pip install vectorless

Index and Query

import asyncio
from vectorless import Engine, IndexContext

async def main():
    # Create engine — api_key and model are required
    engine = Engine(
        workspace="./data",
        api_key="sk-...",
        model="gpt-4o",
    )

    # Index a document (PDF or Markdown)
    result = await engine.index(IndexContext.from_file("./report.pdf"))
    doc_id = result.doc_id

    # Query
    result = await engine.query(doc_id, "What is the total revenue?")
    print(result.single().content)

asyncio.run(main())
Rust
[dependencies]
vectorless = "0.1"
use vectorless::client::{EngineBuilder, IndexContext, QueryContext};

#[tokio::main]
async fn main() -> vectorless::Result<()> {
    let engine = EngineBuilder::new()
        .with_workspace("./data")
        .with_key("sk-...")
        .with_model("gpt-4o")
        .build()
        .await?;

    // Index
    let result = engine.index(IndexContext::from_path("./report.pdf")).await?;
    let doc_id = result.doc_id().unwrap();

    // Query
    let result = engine.query(
        QueryContext::new("What is the total revenue?").with_doc_id(doc_id)
    ).await?;
    println!("Answer: {}", result.content);

    Ok(())
}

Examples

See examples for more and stay tuned.

Contributing

Contributions welcome! If you find this useful, please ⭐ the repo — it helps others discover it.

Star History

Star History Chart

License

Apache License 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vectorless-0.1.4.tar.gz (372.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vectorless-0.1.4-cp310-cp310-manylinux_2_34_x86_64.whl (4.9 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.34+ x86-64

File details

Details for the file vectorless-0.1.4.tar.gz.

File metadata

  • Download URL: vectorless-0.1.4.tar.gz
  • Upload date:
  • Size: 372.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.12.6

File hashes

Hashes for vectorless-0.1.4.tar.gz
Algorithm Hash digest
SHA256 75f1cce04569e464777849bdab0f57e9605e2fa53a8f5c241ad3a61054ea9291
MD5 a4061185119f37445ab2e819fc228d1f
BLAKE2b-256 0a2d512366c13cbf7a4b53c981e1e1f17623d46b3b2c8f977d997a59df78b0a3

See more details on using hashes here.

File details

Details for the file vectorless-0.1.4-cp310-cp310-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for vectorless-0.1.4-cp310-cp310-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 4a61a926b0ead4be501fe0c1c59fcf2c073bbe0a86cc3ba15f7a2a220bd6e227
MD5 65167ad45306c4acdbe7744d76b56ae3
BLAKE2b-256 b3e72c80ef533550fd722902d63f1e50f5b412d24e51a4d842591a61417a62e9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page