Skip to main content

Open-source tool for accurate & fast scientific literature data extraction with LLM and human-in-the-loop.

Project description

Extralit
Extralit

Extract structured data from scientific literature with human validation

CI Codecov Downloads

Extralit is an open-source platform that transforms how researchers extract structured data from scientific literature. Want to get started? Check out our documentation.

Why use Extralit?

Accelerate Scientific Data Collection

Manual data extraction from research papers is slow and error-prone, often taking 6-12 months for systematic reviews. Extralit combines AI-powered extraction with human validation to reduce this to weeks while maintaining research-grade accuracy.

Take Control of Your Research Data

Most scientific data extraction tools are inflexible black boxes. Extralit is different - it's open source and puts you in control. Define custom extraction schemas, validate results, and integrate with your existing research workflows.

Scale Your Literature Reviews

Whether you're conducting a systematic review, meta-analysis, or building a scientific knowledge base, Extralit helps you efficiently process hundreds of papers. Our platform handles complex tables, figures, and relationships while preserving scientific rigor.

🏘️ Community

We're an open-source project built for researchers, by researchers. Here's how to get involved:

  • Slack Community: Connect with other researchers and developers
  • Documentation: Learn how to use and contribute to Extralit
  • Roadmap: See what we're building and share your ideas

Real-World Impact

Extralit is already accelerating research at leading institutions:

  • Gates Foundation: Reduced systematic review time for malaria intervention studies from 6 months to 6 weeks
  • Life Science Research: Streamlined extraction of clinical trial endpoints, genetic markers, and intervention protocols
  • Meta-Analysis: Enabled rapid synthesis of evidence across hundreds of papers while maintaining rigorous validation

👨‍💻 Getting Started

Installation

Install Extralit using pip:

pip install extralit

Initialize the client:

import extralit as ex

client = ex.Extralit(
    api_url="https://your-deployment-url",
    api_key="your-api-key"
)

Create an extraction schema

Define what data you want to extract:

TBD

Add documents and start extraction

TBD

Need more help? Check out our detailed tutorials.

🥇 Contributors

Want to contribute? Great! Check out our contribution guide or join our Slack community.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

extralit-0.6.0.tar.gz (175.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

extralit-0.6.0-py3-none-any.whl (237.6 kB view details)

Uploaded Python 3

File details

Details for the file extralit-0.6.0.tar.gz.

File metadata

  • Download URL: extralit-0.6.0.tar.gz
  • Upload date:
  • Size: 175.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: pdm/2.25.5 CPython/3.9.23 Linux/6.11.0-1018-azure

File hashes

Hashes for extralit-0.6.0.tar.gz
Algorithm Hash digest
SHA256 3e0f27f0b5204e6238c97ce785b047679766a2c5d95a4cad73b2a019211e8ee9
MD5 09d6c15a3086c07dffad8a4000b94be3
BLAKE2b-256 9c8888cfef9bef47191aeb6fc510dfa5f3c18c911fd233914cec583674be02da

See more details on using hashes here.

File details

Details for the file extralit-0.6.0-py3-none-any.whl.

File metadata

  • Download URL: extralit-0.6.0-py3-none-any.whl
  • Upload date:
  • Size: 237.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: pdm/2.25.5 CPython/3.9.23 Linux/6.11.0-1018-azure

File hashes

Hashes for extralit-0.6.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0f1ea8f5a8ea116c074f648d042c2f0b91b3a5df37994f87dd1de3d752a6726c
MD5 406ddd433a29169cfffa6f46f69b212c
BLAKE2b-256 a9ed63b86e666ff818b860e7b3c3e9ba0324823d56cf59b1c8c71b58db9498d2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page