Skip to main content

Open-source tool for accurate & fast scientific literature data extraction with LLM and human-in-the-loop.

Project description

Extralit
Extralit

Extract structured data from scientific literature with human validation

CI Codecov Downloads

Extralit is an open-source platform that transforms how researchers extract structured data from scientific literature. Want to get started? Check out our documentation.

Why use Extralit?

Accelerate Scientific Data Collection

Manual data extraction from research papers is slow and error-prone, often taking 6-12 months for systematic reviews. Extralit combines AI-powered extraction with human validation to reduce this to weeks while maintaining research-grade accuracy.

Take Control of Your Research Data

Most scientific data extraction tools are inflexible black boxes. Extralit is different - it's open source and puts you in control. Define custom extraction schemas, validate results, and integrate with your existing research workflows.

Scale Your Literature Reviews

Whether you're conducting a systematic review, meta-analysis, or building a scientific knowledge base, Extralit helps you efficiently process hundreds of papers. Our platform handles complex tables, figures, and relationships while preserving scientific rigor.

🏘️ Community

We're an open-source project built for researchers, by researchers. Here's how to get involved:

  • Slack Community: Connect with other researchers and developers
  • Documentation: Learn how to use and contribute to Extralit
  • Roadmap: See what we're building and share your ideas

Real-World Impact

Extralit is already accelerating research at leading institutions:

  • Gates Foundation: Reduced systematic review time for malaria intervention studies from 6 months to 6 weeks
  • Life Science Research: Streamlined extraction of clinical trial endpoints, genetic markers, and intervention protocols
  • Meta-Analysis: Enabled rapid synthesis of evidence across hundreds of papers while maintaining rigorous validation

👨‍💻 Getting Started

Installation

Install Extralit using pip:

pip install extralit

Initialize the client:

import extralit as ex

client = ex.Extralit(
    api_url="https://your-deployment-url",
    api_key="your-api-key"
)

Create an extraction schema

Define what data you want to extract:

TBD

Add documents and start extraction

TBD

Need more help? Check out our detailed tutorials.

🥇 Contributors

Want to contribute? Great! Check out our contribution guide or join our Slack community.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

extralit-0.6.1.tar.gz (185.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

extralit-0.6.1-py3-none-any.whl (248.7 kB view details)

Uploaded Python 3

File details

Details for the file extralit-0.6.1.tar.gz.

File metadata

  • Download URL: extralit-0.6.1.tar.gz
  • Upload date:
  • Size: 185.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: pdm/2.25.9 CPython/3.9.23 Linux/6.11.0-1018-azure

File hashes

Hashes for extralit-0.6.1.tar.gz
Algorithm Hash digest
SHA256 c527c8c578403aafd8b6d922b394375d43d75c2e44cce645ac5b1b5d5f20b818
MD5 78138f8b966c4c0386dc2f622026e4a1
BLAKE2b-256 0e56b4dec5cf67d28fee745b81d881e38795b7909d2014931b5666a4f9765d75

See more details on using hashes here.

File details

Details for the file extralit-0.6.1-py3-none-any.whl.

File metadata

  • Download URL: extralit-0.6.1-py3-none-any.whl
  • Upload date:
  • Size: 248.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: pdm/2.25.9 CPython/3.9.23 Linux/6.11.0-1018-azure

File hashes

Hashes for extralit-0.6.1-py3-none-any.whl
Algorithm Hash digest
SHA256 5ddb682fd706812504fa4a8a9b6bc6e4a8150b7573ded1ba2a733c66acb0eb84
MD5 5bd5c8175db384eb5ab797dacbee493c
BLAKE2b-256 77984893e9bc7eca1e2a9bfceddddc6f42e090e1a509ad45843d2fad9129124b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page