Skip to main content

Extract structured claims from text

Project description

Claim Extractor

Extract LinkedClaims from text using LLMs.

Quick Start

from claim_extractor import ClaimExtractor

# Initialize
extractor = ClaimExtractor()

## OPTIONALLY include extra instructions and override default message prompt

extractor=ClaimExtractor(extra_system_instructions="Only look for claims about islands", message_prompt="The following narrative may or may not have claims in it, include any claims about islands and especially trees on islands. Otherwise return empty array if not found.  Here is the text {text}")

# Extract claims from text
text = "John Smith was the CEO of TechCorp from 2020 to 2023 and increased revenue by 40%."
claims = extractor.extract_claims(text)

# Returns:
# [
#   {
#     "subject": "urn:person:John_Smith",
#     "claim": "controlled", 
#     "object": "urn:company:TechCorp",
      "effectiveDate": 2020,
      "statement": "John Smith was the CEO of TechCorp from 2020 to 2023",
#     "howKnown": "DOCUMENT",
#   },
#   {
#     "subject": "urn:person:John_Smith",
#     "claim": "impact:revenue",
#     "object": "urn:company:TechCorp",
#     "amt": 1.4,
#     "effectiveDate": 2023,
#     "statement": "John Smith increased revenue of Tech Corp by 40% from 2020 to 2023"
#   }
# ]

Installation

From PyPI

pip install linked-claims-extractor

Development Setup

# Clone the repository
git clone https://github.com/Cooperation-org/linked-claims-extractor.git
cd linked-claims-extractor

# Create virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install in development mode
pip install -e .

# Install build tools (optional, for publishing)
pip install build twine

For publishing instructions, see PUBLISH.md.

Configuration

Set environment variable:

export ANTHROPIC_API_KEY=your-key

Or create a .env file:

ANTHROPIC_API_KEY=your-key

Usage

from claim_extractor import ClaimExtractor

# Basic usage
extractor = ClaimExtractor()
claims = extractor.extract_claims("Your text here...")

# Extract from URL
claims = extractor.extract_claims_from_url("https://example.com/article")

Related Projects

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

linked_claims_extractor-0.2.4.tar.gz (12.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

linked_claims_extractor-0.2.4-py3-none-any.whl (10.0 kB view details)

Uploaded Python 3

File details

Details for the file linked_claims_extractor-0.2.4.tar.gz.

File metadata

  • Download URL: linked_claims_extractor-0.2.4.tar.gz
  • Upload date:
  • Size: 12.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for linked_claims_extractor-0.2.4.tar.gz
Algorithm Hash digest
SHA256 3f9bfe7f42f76dce712942b668c62bd148829724d0a437b638739aafd8dd5127
MD5 ef489b4aebe0f856c0107a6124d7ad44
BLAKE2b-256 d05f05294a693c7fba22d0601847cd822452fc6b25bd4e8d0d711a66ccbc10ab

See more details on using hashes here.

File details

Details for the file linked_claims_extractor-0.2.4-py3-none-any.whl.

File metadata

File hashes

Hashes for linked_claims_extractor-0.2.4-py3-none-any.whl
Algorithm Hash digest
SHA256 0fdcd1a635d3ceb31a243ce2efddb807961f72f9bbea3e06eda9a4445f04953e
MD5 05ce208c304f0da211c4fac07e9f084e
BLAKE2b-256 4a1743c903f234d5657a0ea0c476e3e90cd9cf52bea2dc89de425623d4361b43

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page