Skip to main content

An intelligent document analysis library for Python.

Project description

DocLM - Intelligent Document Analysis

DocLM is a Python library that uses advanced AI to analyze any PDF document and generate comprehensive summary reports with smart analytics.

Features

  • Multi-Format Support: Analyze PDF, DOCX, TXT, and more.
  • Smart Summarization: AI-powered summaries that capture the essence of your documents.
  • Detailed Analytics: Get key insights and statistics from your content.
  • Google Colab Ready: Seamlessly integrate with Google Colab for cloud-based analysis.

How to Get Your API Token

  1. Run the backend server: python app.py.
  2. Open your browser to http://127.0.0.1:5000.
  3. Click "Get Started" and fill out the signup form.
  4. Your unique API token will be displayed and emailed to you.

Installation

pip install DocLM

Quick Start

from DocLM import DocumentAnalyzer

# Initialize the analyzer with your token
analyzer = DocumentAnalyzer(api_token="your_token_here")

# Analyze any PDF document
# Make sure you have a file named 'document.pdf' in the same directory
try:
    report = analyzer.analyze("document.pdf")

    # Get the smart analytics report
    print("Summary:")
    print(report.summary)
    print("\nKey Insights:")
    for insight in report.key_insights:
        print(f"- {insight}")

except FileNotFoundError:
    print("Error: 'document.pdf' not found. Please create a sample PDF to analyze.")
except Exception as e:
    print(f"An error occurred: {e}")

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

doclm-1.0.0.tar.gz (2.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

doclm-1.0.0-py3-none-any.whl (1.9 kB view details)

Uploaded Python 3

File details

Details for the file doclm-1.0.0.tar.gz.

File metadata

  • Download URL: doclm-1.0.0.tar.gz
  • Upload date:
  • Size: 2.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for doclm-1.0.0.tar.gz
Algorithm Hash digest
SHA256 8e055edb7d704fde684a0dbb497910b4f1bfc19bf14dd382ef9d24b06545690d
MD5 4e53c089522cf681ee491dd70f17637e
BLAKE2b-256 1dd46810109b791f31238600dd84924bac8d4397b4feb7141ec3076618c7641d

See more details on using hashes here.

File details

Details for the file doclm-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: doclm-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 1.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for doclm-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 436631c84d1d04ebf53326cc45abc5f3868758ae11496a5e181eab2c9dfc8ec2
MD5 97b81ee1e075c0ad15c5a9fa2fbd2337
BLAKE2b-256 b19ce3833fa308c380a6ad24e547c2546cd7e21eb4f283395aad8cfd7af80675

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page