Skip to main content

This package consumes one or more Spanish constitution PDFs and then processes them to generate embedding vectors. The vectors are generated with OpenAI service and PineCone is used to store and retrieve embedding vectors.

Project description

Document ingestor

This package consumes one or more spanish constitution pdf and then processes it to generate the embedding vectors. The vectors are generated with OpenAI service and PineCone is used to store and retrieve embedding vectors.

Requirements

pymupdf==1.23.8

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

document_ingestor-0.1.1.tar.gz (8.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

document_ingestor-0.1.1-py3-none-any.whl (8.9 kB view details)

Uploaded Python 3

File details

Details for the file document_ingestor-0.1.1.tar.gz.

File metadata

  • Download URL: document_ingestor-0.1.1.tar.gz
  • Upload date:
  • Size: 8.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.1

File hashes

Hashes for document_ingestor-0.1.1.tar.gz
Algorithm Hash digest
SHA256 116ea93b06e373caaf84fcce0139687b4be672b54de969f591da206e84585f75
MD5 861dbbc3de66b2afce1dac48c9c0592b
BLAKE2b-256 a46cd9964d55f2216afa74ac75cd5e9efc11ce6f203e83716edf1c0d88d5e5b5

See more details on using hashes here.

File details

Details for the file document_ingestor-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for document_ingestor-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 e440985ff1e1aa1d0847716a356f79cfa5466573e6718898c7f5c685a43932de
MD5 ea646fc5cf734fdd355189b3c28d7a8a
BLAKE2b-256 fe39ebd5e8f3269b7bbc8dc704fa24e620765a7990a62ad89a92cf343bbff9a8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page