Skip to main content

Official Python SDK for Cerevox

Project description

Cerevox Logo

Cerevox - The Data Layer 🧠 ⚡

Parse documents with enterprise-grade reliability
AI-powered • Highest Accuracy • Vector DB ready

CI Status Code Coverage Maintainability PyPI version Python versions License


Official Python SDK for Lexa - Parse documents into structured data

🎯 Perfect for: RAG applications, document analysis, data extraction, and vector database preparation

📦 Installation

pip install cerevox

📋 Requirements

🚀 Quick Start

Basic Usage

from cerevox import Lexa

# Parse a document
client = Lexa(api_key="your-api-key")
documents = client.parse(["document.pdf"])

print(f"Extracted {len(documents[0].content)} characters")
print(f"Found {len(documents[0].tables)} tables")

Async Processing (Recommended)

import asyncio
from cerevox import AsyncLexa

async def main():
    async with AsyncLexa(api_key="your-api-key") as client:
        documents = await client.parse(["document.pdf", "report.docx"])
        
        # Get chunks optimized for vector databases
        chunks = documents.get_all_text_chunks(target_size=500)
        print(f"Ready for embedding: {len(chunks)} chunks")

asyncio.run(main())

✨ Features

🚀 Performance & Scale

  • 10x Faster than traditional solutions
  • Native Async Support with concurrent processing
  • Enterprise-grade reliability with automatic retries

🧠 AI-Powered Extraction

  • SOTA Accuracy with cutting-edge ML models
  • Advanced Table Extraction preserving structure and formatting
  • 12+ File Formats including PDF, DOCX, PPTX, HTML, and more

🔗 Integration Ready

  • Vector Database Optimized chunks for RAG applications
  • 7+ Cloud Storage integrations (S3, SharePoint, Google Drive, etc.)
  • Framework Agnostic works with Django, Flask, FastAPI
  • Rich Metadata extraction including images, formatting, and structure

📋 Examples

Explore comprehensive examples in the examples/ directory:

Example Description
lexa_examples.py Complete SDK functionality demonstration
vector_db_preparation.py Vector database chunking and integration patterns
async_examples.py Advanced async processing techniques
document_examples.py Document analysis and manipulation features
cloud_integrations.py Cloud storage service integrations

🚀 Run Examples

# Clone and explore
git clone https://github.com/CerevoxAI/cerevox-python.git
cd cerevox-python

export CEREVOX_API_KEY="your-api-key"

# Run demos
python examples/lexa_examples.py          # Basic usage
python examples/vector_db_preparation.py  # Vector DB integration
python examples/async_examples.py         # Async features
python examples/document_examples.py      # Document analysis
python examples/cloud_integrations.py     # Cloud Integrations Coming Soon!

📚 Documentation

📖 Guides & Tutorials

🔗 External Resources

🤝 Contributing

We welcome contributions! Please see our Contributing Guide for details.

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🆘 Support & Community

📖 Resources

💬 Get Help

🐛 Issues


⭐ Star us on GitHub if Cerevox helped your project!
Made with ❤️ by the Cerevox team
Happy Parsing 🔍 ✨

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cerevox-0.1.4.tar.gz (50.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cerevox-0.1.4-py3-none-any.whl (47.5 kB view details)

Uploaded Python 3

File details

Details for the file cerevox-0.1.4.tar.gz.

File metadata

  • Download URL: cerevox-0.1.4.tar.gz
  • Upload date:
  • Size: 50.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for cerevox-0.1.4.tar.gz
Algorithm Hash digest
SHA256 884385b735b78405eb74d8919255ff5176e65790a9e236c818786724c29e018d
MD5 b398b3240a9e83d66f2b1d9d8ec903e3
BLAKE2b-256 22c73b6fa22aba3e8f872bf213f25f62535f7dcd535aeabd583846e28c812d0e

See more details on using hashes here.

File details

Details for the file cerevox-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: cerevox-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 47.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for cerevox-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 c88cfddbd6d7902eb59a32b45be106abee1d8cba7e6af32f480c963bcc129b39
MD5 d035d24ce7cc03a8ec3587055fdaf791
BLAKE2b-256 3c7d768cf34cd02c937bdccdfa2737f867c4a09a5feccc20e0782532c4a0afbb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page