Open-source tool for accurate & fast scientific literature data extraction with LLM and human-in-the-loop.
Project description
Extralit
Extract structured data from scientific literature with human validation
Extralit is an open-source platform that transforms how researchers extract structured data from scientific literature. Want to get started? Check out our documentation.
Why use Extralit?
Accelerate Scientific Data Collection
Manual data extraction from research papers is slow and error-prone, often taking 6-12 months for systematic reviews. Extralit combines AI-powered extraction with human validation to reduce this to weeks while maintaining research-grade accuracy.
Take Control of Your Research Data
Most scientific data extraction tools are inflexible black boxes. Extralit is different - it's open source and puts you in control. Define custom extraction schemas, validate results, and integrate with your existing research workflows.
Scale Your Literature Reviews
Whether you're conducting a systematic review, meta-analysis, or building a scientific knowledge base, Extralit helps you efficiently process hundreds of papers. Our platform handles complex tables, figures, and relationships while preserving scientific rigor.
🏘️ Community
We're an open-source project built for researchers, by researchers. Here's how to get involved:
- Slack Community: Connect with other researchers and developers
- Documentation: Learn how to use and contribute to Extralit
- Roadmap: See what we're building and share your ideas
Real-World Impact
Extralit is already accelerating research at leading institutions:
- Gates Foundation: Reduced systematic review time for malaria intervention studies from 6 months to 6 weeks
- Life Science Research: Streamlined extraction of clinical trial endpoints, genetic markers, and intervention protocols
- Meta-Analysis: Enabled rapid synthesis of evidence across hundreds of papers while maintaining rigorous validation
👨💻 Getting Started
Installation
Install Extralit using pip:
pip install extralit
Initialize the client:
import extralit as ex
client = ex.Extralit(
api_url="https://your-deployment-url",
api_key="your-api-key"
)
Create an extraction schema
Define what data you want to extract:
TBD
Add documents and start extraction
TBD
Need more help? Check out our detailed tutorials.
🥇 Contributors
Want to contribute? Great! Check out our contribution guide or join our Slack community.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file extralit-0.6.1.tar.gz.
File metadata
- Download URL: extralit-0.6.1.tar.gz
- Upload date:
- Size: 185.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: pdm/2.25.9 CPython/3.9.23 Linux/6.11.0-1018-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c527c8c578403aafd8b6d922b394375d43d75c2e44cce645ac5b1b5d5f20b818
|
|
| MD5 |
78138f8b966c4c0386dc2f622026e4a1
|
|
| BLAKE2b-256 |
0e56b4dec5cf67d28fee745b81d881e38795b7909d2014931b5666a4f9765d75
|
File details
Details for the file extralit-0.6.1-py3-none-any.whl.
File metadata
- Download URL: extralit-0.6.1-py3-none-any.whl
- Upload date:
- Size: 248.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: pdm/2.25.9 CPython/3.9.23 Linux/6.11.0-1018-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5ddb682fd706812504fa4a8a9b6bc6e4a8150b7573ded1ba2a733c66acb0eb84
|
|
| MD5 |
5bd5c8175db384eb5ab797dacbee493c
|
|
| BLAKE2b-256 |
77984893e9bc7eca1e2a9bfceddddc6f42e090e1a509ad45843d2fad9129124b
|