Skip to main content

A unified library to load any document type effortlessly into LangChain for generative AI.

Project description

Unified Document Loader

A Python library that automatically detects file types and loads documents using LangChain's powerful document loaders.

Installation

pip install docsuite

#Usage

from docsuite import UnifiedDocumentLoader

file_path = 'example.pdf'  # Replace with your file path
loader = UnifiedDocumentLoader(file_path)

try:
    documents = loader.load()
    print(documents)
except UnsupportedFileTypeError as e:
    print(e)

#Features

-Automatically detects document types (e.g., TXT, PDF, CSV). -Leverages LangChain's robust document loaders. -Easy-to-use and extensible.

#License

docsuite is released under the MIT License. You are free to use, modify, and distribute the code for both commercial and non-commercial purposes.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

docsuite-0.1.1.tar.gz (4.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

docsuite-0.1.1-py3-none-any.whl (4.2 kB view details)

Uploaded Python 3

File details

Details for the file docsuite-0.1.1.tar.gz.

File metadata

  • Download URL: docsuite-0.1.1.tar.gz
  • Upload date:
  • Size: 4.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.4

File hashes

Hashes for docsuite-0.1.1.tar.gz
Algorithm Hash digest
SHA256 d9188360e6a7f7ab4b3832e45f07006ac610866b326f194c0337885629022c13
MD5 c76beaf399d7744210876d5bec3d7a41
BLAKE2b-256 2b2b66377a76d26e9092c051425fb0d5fb5d0b20ea3e32919b6fe0bc1ebeaf85

See more details on using hashes here.

File details

Details for the file docsuite-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: docsuite-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 4.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.4

File hashes

Hashes for docsuite-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 9a56b6fce437f57ef9fe2b1aa7be7501f8e6b32c4e6f037b12826ef884b98704
MD5 bfdb63bf7b72bfac9d3291a9901e0811
BLAKE2b-256 dcad31a4e936e4375131fa0a52178e324837e0e7a33de27d3ffaf1f2836f65cf

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page