Skip to main content

Hierarchical document intelligence without vectors

Project description

Vectorless

Reasoning-native Document Intelligence Engine

PyPI PyPI Downloads Crates.io Crates.io Downloads Docs License

Vectorless is a reasoning-native document intelligence engine written in Rust — no vector database, no embeddings, no similarity search. It transforms documents into hierarchical semantic trees and uses LLMs to navigate the structure, retrieving the most relevant content through deep contextual understanding instead of vector math.

Quick Start

Install

pip install vectorless

Index and Query

import asyncio
from vectorless import Engine, IndexContext

async def main():
    # Create engine — api_key and model are required
    engine = Engine(
        workspace="./data",
        api_key="sk-...",
        model="gpt-4o",
    )

    # Index a document (PDF or Markdown)
    result = await engine.index(IndexContext.from_file("./report.pdf"))
    doc_id = result.doc_id

    # Query
    result = await engine.query(doc_id, "What is the total revenue?")
    print(result.single().content)

asyncio.run(main())
Rust
[dependencies]
vectorless = "0.1"
use vectorless::client::{EngineBuilder, IndexContext, QueryContext};

#[tokio::main]
async fn main() -> vectorless::Result<()> {
    let engine = EngineBuilder::new()
        .with_workspace("./data")
        .with_key("sk-...")
        .with_model("gpt-4o")
        .build()
        .await?;

    // Index
    let result = engine.index(IndexContext::from_path("./report.pdf")).await?;
    let doc_id = result.doc_id().unwrap();

    // Query
    let result = engine.query(
        QueryContext::new("What is the total revenue?").with_doc_id(doc_id)
    ).await?;
    println!("Answer: {}", result.content);

    Ok(())
}

Examples

See examples for more and stay tuned.

Contributing

Contributions welcome! If you find this useful, please ⭐ the repo — it helps others discover it.

Star History

Star History Chart

License

Apache License 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vectorless-0.1.3.tar.gz (366.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vectorless-0.1.3-cp310-cp310-manylinux_2_34_x86_64.whl (4.3 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.34+ x86-64

File details

Details for the file vectorless-0.1.3.tar.gz.

File metadata

  • Download URL: vectorless-0.1.3.tar.gz
  • Upload date:
  • Size: 366.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.12.6

File hashes

Hashes for vectorless-0.1.3.tar.gz
Algorithm Hash digest
SHA256 7867ea8950b6bf9fc34b7861e48209c70180f4a2c448e1e05c4c1b56f1e7ad43
MD5 aee38278b1e8e5659856b0b50752a6e8
BLAKE2b-256 174ba68ca672f4358ef2fc5011ce457e96ec03ea8ffc63b49655484cb1f2a493

See more details on using hashes here.

File details

Details for the file vectorless-0.1.3-cp310-cp310-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for vectorless-0.1.3-cp310-cp310-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 fbde8f540cd720f82b3a7a0fc66ea22c4511081c9c4283b9fa47acef20babb0b
MD5 93f33a192e682c4192e405609e28a6e3
BLAKE2b-256 8651de6f94eb18351be7cbf88a2b884676b6c8c3815c9866bb086306af45cd24

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page