Skip to main content

Python script to analyze the similarity of two PDFs

Project description

Bluewave

This is a Python script to analyze the similarity of two PDFs.

Usage

To run this, an example would be:

from bluewave.compare_pdfs import compare_pdf_files

filenames = ["file1.pdf", "file2.pdf"]
result = compare_pdf_files(filenames,
                           methods=False,
                           pretty_print=False,
                           verbose=True,
                           regen_cache=True,
                           sidecar_only=False,
                           no_importance=False)

To run with AWS connection

filenames = ["sample_files/sample_file_1.pdf", "sample_files/sample_file_2.pdf"]
aws_config = {
    "profile_name": "default",
    "pdf_bucket": "bluewave2023",
    "cache_bucket": "bluewave2023-cache",
    "pg_username": "postgres",
    "pg_password": "gizmo1228",
    "pg_host": "bluewave.chwv9x0hbowb.us-east-2.rds.amazonaws.com",
    "pg_port": 5432,
    "pg_db": "bluewave",
    "pg_table": "results"
}

result = compare_pdf_files(filenames,
                           methods=False,
                           pretty_print=False,
                           verbose=True,
                           regen_cache=True,
                           sidecar_only=False,
                           no_importance=False,
                           aws_config=aws_config)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bluewave-0.0.8.tar.gz (41.9 MB view details)

Uploaded Source

Built Distribution

bluewave-0.0.8-py3-none-any.whl (29.9 kB view details)

Uploaded Python 3

File details

Details for the file bluewave-0.0.8.tar.gz.

File metadata

  • Download URL: bluewave-0.0.8.tar.gz
  • Upload date:
  • Size: 41.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.9.6 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.10.1 urllib3/1.26.15 tqdm/4.64.1 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.5 CPython/3.6.8

File hashes

Hashes for bluewave-0.0.8.tar.gz
Algorithm Hash digest
SHA256 4f4d7f4d26ea79a749db01c1d429228e81d9843f142a657078c2dc059883be81
MD5 8dfff504d0dbb921adbbebd512e8b9e8
BLAKE2b-256 81715765d0d9c1017ecc4708f40843affecfd937691471602aaf8b5a1715f465

See more details on using hashes here.

File details

Details for the file bluewave-0.0.8-py3-none-any.whl.

File metadata

  • Download URL: bluewave-0.0.8-py3-none-any.whl
  • Upload date:
  • Size: 29.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.9.6 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.10.1 urllib3/1.26.15 tqdm/4.64.1 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.5 CPython/3.6.8

File hashes

Hashes for bluewave-0.0.8-py3-none-any.whl
Algorithm Hash digest
SHA256 b3b418f01a9134a52171b26562a1baaf0a9b168728799e90151d3570d7070534
MD5 0d7e50471729b89131023c68cb609de1
BLAKE2b-256 ec12f97737ed42bcb3da762d7a8f6b55767e1ff8f84d2fff0de878d887b19536

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page