Official Python SDK for Cerevox
Project description
Cerevox - The Data Layer 🧠 ⚡
Parse documents with enterprise-grade reliability
AI-powered • Highest Accuracy • Vector DB ready
Official Python SDK for Lexa - Parse documents into structured data
🎯 Perfect for: RAG applications, document analysis, data extraction, and vector database preparation
📦 Installation
pip install cerevox
📋 Requirements
- Python 3.9+
- API key from Cerevox
🚀 Quick Start
Basic Usage
from cerevox import Lexa
# Parse a document
client = Lexa(api_key="your-api-key")
documents = client.parse(["document.pdf"])
print(f"Extracted {len(documents[0].content)} characters")
print(f"Found {len(documents[0].tables)} tables")
Async Processing (Recommended)
import asyncio
from cerevox import AsyncLexa
async def main():
async with AsyncLexa(api_key="your-api-key") as client:
documents = await client.parse(["document.pdf", "report.docx"])
# Get chunks optimized for vector databases
chunks = documents.get_all_text_chunks(target_size=500)
print(f"Ready for embedding: {len(chunks)} chunks")
asyncio.run(main())
✨ Features
🚀 Performance & Scale
- 10x Faster than traditional solutions
- Native Async Support with concurrent processing
- Enterprise-grade reliability with automatic retries
🧠 AI-Powered Extraction
- SOTA Accuracy with cutting-edge ML models
- Advanced Table Extraction preserving structure and formatting
- 12+ File Formats including PDF, DOCX, PPTX, HTML, and more
🔗 Integration Ready
- Vector Database Optimized chunks for RAG applications
- 7+ Cloud Storage integrations (S3, SharePoint, Google Drive, etc.)
- Framework Agnostic works with Django, Flask, FastAPI
- Rich Metadata extraction including images, formatting, and structure
📋 Examples
Explore comprehensive examples in the examples/ directory:
| Example | Description |
|---|---|
lexa_examples.py |
Complete SDK functionality demonstration |
vector_db_preparation.py |
Vector database chunking and integration patterns |
async_examples.py |
Advanced async processing techniques |
document_examples.py |
Document analysis and manipulation features |
cloud_integrations.py |
Cloud storage service integrations |
🚀 Run Examples
# Clone and explore
git clone https://github.com/CerevoxAI/cerevox-python.git
cd cerevox-python
export CEREVOX_API_KEY="your-api-key"
# Run demos
python examples/lexa_examples.py # Basic usage
python examples/vector_db_preparation.py # Vector DB integration
python examples/async_examples.py # Async features
python examples/document_examples.py # Document analysis
python examples/cloud_integrations.py # Cloud Integrations Coming Soon!
📚 Documentation
📖 Guides & Tutorials
- API Reference - Complete API documentation
- Vector Database Integration - RAG and vector DB setup
- Advanced Examples - Real-world usage patterns
- Migration Guide - Migrate from other tools
🔗 External Resources
- Full Documentation - Comprehensive guides
- Interactive API Docs - Try the API
- Discord Community - Get help and discuss
🤝 Contributing
We welcome contributions! Please see our Contributing Guide for details.
📄 License
This project is licensed under the MIT License - see the LICENSE file for details.
🆘 Support & Community
|
📖 Resources |
💬 Get Help |
🐛 Issues |
⭐ Star us on GitHub if Cerevox helped your project!
Made with ❤️ by the Cerevox team
Happy Parsing 🔍 ✨
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file cerevox-0.1.2.tar.gz.
File metadata
- Download URL: cerevox-0.1.2.tar.gz
- Upload date:
- Size: 50.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4d57b54a486efae4470f7b47d7306bac84abe72eaa83f535e76dadd732cc476e
|
|
| MD5 |
807ca5f90b35565f49a45f1cdd7acf5a
|
|
| BLAKE2b-256 |
6762d2528aa3748699102df40bff7ed2f2dd7295b42d4e3865ac7b0e7eddf6f0
|
File details
Details for the file cerevox-0.1.2-py3-none-any.whl.
File metadata
- Download URL: cerevox-0.1.2-py3-none-any.whl
- Upload date:
- Size: 47.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
84d0ff15ca7c63cfb7e02a6583ea7b08b3fa9aa55b0bce303496c064e789f619
|
|
| MD5 |
031c39d804727b138ec8c3278e5f915c
|
|
| BLAKE2b-256 |
579d10c7acc44232ab467807c547ec6ebfce455f32a934d14a07ff0334f626bc
|