An integration package created by the company LOGYCA to interact with ChatGPT and analyze documents, files and other functionality of the OpenAI library.

These details have not been verified by PyPI

Project links

Homepage

Project description

LOGYCA public libraries

About us

LOGYCA public libraries: To interact with ChatGPT and analyze documents, files and other functionality of the OpenAI library.

Source code | Package (PyPI) | Samples

To interact with the examples, keep the following in mind

FastAPI example. Through Swagger, you can:

https://github.com/logyca/python-libraries/tree/main/logyca-ai/samples/fastapi_async
Use the example endpoints to obtain the input schemas for the post method and interact with the available parameters.
Endpoint publishing is asynchronous of openai SDK.
The model currently used is ChatGPT-4o, no other models have been tested so far.
Currently the formats supported to receive files and extract the text to interact with artificial intelligence are: txt, csv, pdf, images, Microsoft (docx, xlsx).

Script example. Through of code, you can:

https://github.com/logyca/python-libraries/tree/main/logyca-ai/samples/script_app_sync
Examples shared with the example written in FastAPI.
The examples use synchronous functionality of openai SDK.
The model used is ChatGPT-4o for testing.

Environment variables documentation for example: fastapi_async

The examples are built in the Microsoft Azure OpenAI environment, and the variables to use are the following:

.env.sample

# Environment variables documentation:

# API_KEY:
# The general API key used for authentication with services. This key is typically used for accessing cloud-based or other API-driven platforms. Replace '***' with the actual key.

# AZURE_OPENAI_DEPLOYMENT:
# The name or identifier of the OpenAI deployment within Azure. This defines the specific model version and configuration you are using in Azure OpenAI Service. Set this to the name of the deployed model, such as 'chatgpt3.5-turbo-1106'.

# AZURE_OPENAI_ENDPOINT:
# The base URL of the Azure OpenAI Service endpoint. This is the URL where API requests are sent, typically formatted like 'https://<your-endpoint>.openai.azure.com/'.

# AZURE_OPENAI_MODEL_NAME:
# The name of the specific OpenAI model being used in Azure, for example, 'gpt-35-turbo'. This identifies which model variant will be used for processing requests.

# AZURE_OPENAI_MODEL_VERSION:
# The version of the OpenAI model deployed in Azure. This typically reflects updates or optimizations to the model, such as '1106' to indicate a version from November 6th.

# OPENAI_API_KEY:
# The API key provided by OpenAI directly (not through Azure). This is used to authenticate and access OpenAI services outside of Azure.

# OPENAI_API_VERSION:
# The version of the OpenAI API being used. This specifies the version of the API and its capabilities, for example, '2023-03-15-preview'. It dictates the available features and request format.

API_KEY=***
AZURE_OPENAI_DEPLOYMENT=***
AZURE_OPENAI_ENDPOINT=***
AZURE_OPENAI_MODEL_NAME=***
AZURE_OPENAI_MODEL_VERSION=***
OPENAI_API_KEY=***
OPENAI_API_VERSION=***

# Example
# API_KEY=CUSTOM_ABC
# AZURE_OPENAI_DEPLOYMENT=chat4omni
# AZURE_OPENAI_ENDPOINT=azurenameforendpoint
# AZURE_OPENAI_MODEL_NAME=gpt-4o
# AZURE_OPENAI_MODEL_VERSION=2024-05-13
# OPENAI_API_KEY=AZURE_ABC
# OPENAI_API_VERSION=2024-07-01-preview

OCR engine to extract images.

Tesseract is an optical character recognition engine for various operating systems. It is free software, released under the Apache License. Originally developed by Hewlett-Packard as proprietary software in the 1980s, it was released as open source in 2005 and development was sponsored by Google in 2006

Install

(Source Code) https://tesseract-ocr.github.io/tessdoc/Downloads.html
(Windows Binaries) https://github.com/UB-Mannheim/tesseract/wiki
(Linux/Docker) apt-get -y install tesseract-ocr

Example for simple conversation.

{
  "system": "Voy a definirte tu personalidad, contexto y proposito.\nActua como un experto en venta de frutas.\nSe muy positivo.\nTrata a las personas de usted, nunca tutees sin importar como te escriban.",
  "messages": [
    {
      "additional_content": "",
      "type": "text",
      "user": "Dime 5 frutas amarillas"
    },
    {
      "assistant": "\nÂ¡Claro! AquÃ te van 5 frutas amarillas:\n\n1. PlÃ¡tano\n2. PiÃ±a\n3. Mango\n4. MelÃ³n\n5. Papaya\n"
    },
    {
      "additional_content": "",
      "type": "text",
      "user": "Dame los nombres en ingles."
    }
  ]
}

Example for image conversation.

Using public published URL for image

{
  "system": "Actua como una maquina lectora de imagenes.\nDevuelve la informaciÃ³n sin lenguaje natural, sÃ³lo responde lo que se estÃ¡ solicitando.\nEl dispositivo que va a interactuar contigo es una api, y necesita la informaciÃ³n sin markdown u otros caracteres especiales.",
  "messages": [
    {
      "additional_content": {
        "base64_content_or_url": "https://raw.githubusercontent.com/logyca/python-libraries/main/logyca-ai/logyca_ai/assets_for_examples/file_or_documents/image.png",
        "image_format": "image_url",
        "image_resolution": "auto"
      },
      "type": "image_url",
      "user": "Extrae el texto que recibas en la imagen y devuelvelo en formato json."
    }
  ]
}

Using image content in base64

{
  "system": "Actua como una maquina lectora de imagenes.\nDevuelve la informaciÃ³n sin lenguaje natural, sÃ³lo responde lo que se estÃ¡ solicitando.\nEl dispositivo que va a interactuar contigo es una api, y necesita la informaciÃ³n sin markdown u otros caracteres especiales.",
  "messages": [
    {
      "additional_content": {
        "base64_content_or_url": "<base64 image png content>",
        "image_format": "png",
        "image_resolution": "auto"
      },
      "type": "image_base64",
      "user": "Extrae el texto que recibas en la imagen y devuelvelo en formato json."
    }
  ]
}

Example for pdf conversation.

Using public published URL for pdf

{
  "system": "No uses lenguaje natural para la respuesta.\nDame la informaciÃ³n que puedas extraer de la imagen en formato JSON.\nSolo devuelve la informaciÃ³n, no formatees con caracteres adicionales la respuesta.",
  "messages": [
    {
      "additional_content": {
        "base64_content_or_url": "https://raw.githubusercontent.com/logyca/python-libraries/main/logyca-ai/logyca_ai/assets_for_examples/file_or_documents/pdf.pdf",
        "pdf_format": "pdf_url"
      },
      "type": "pdf_url",
      "user": "Dame los siguientes datos: Expediente, radicaciÃ³n, Fecha, Numero de registro, Vigencia."
    }
  ]
}

Using pdf content in base64

{
  "system": "No uses lenguaje natural para la respuesta.\nDame la informaciÃ³n que puedas extraer de la imagen en formato JSON.\nSolo devuelve la informaciÃ³n, no formatees con caracteres adicionales la respuesta.",
  "messages": [
    {
      "additional_content": {
        "base64_content_or_url": "<base64 pdf content>",
        "pdf_format": "pdf"
      },
      "type": "pdf_base64",
      "user": "Dame los siguientes datos: Expediente, radicaciÃ³n, Fecha, Numero de registro, Vigencia."
    }
  ]
}

Example for plain_text conversation.

Using public published URL for plain_text

{
  "system": "No uses lenguaje natural para la respuesta.\n                Dame la informaciÃ³n que puedas extraer en formato JSON.\n                Solo devuelve la informaciÃ³n, no formatees con caracteres adicionales la respuesta.\n                Te voy a enviar un texto que representa informaciÃ³n en formato csv.",
  "messages": [
    {
      "additional_content": {
        "base64_content_or_url": "https://raw.githubusercontent.com/logyca/python-libraries/main/logyca-ai/logyca_ai/assets_for_examples/file_or_documents/plain_text.csv",
        "file_format": "plain_text_url"
      },
      "type": "plain_text_url",
      "user": "Dame los siguientes datos de la primera fila del documento: Expediente, radicaciÃ³n, Fecha, Numero de registro, Vigencia.\n                A partir de la fila 2 del documento, suma los valores de la columna Valores_A.\n                A partir de la fila 2 del documento, Suma los valores de la columna Valores_B."
    }
  ]
}

Using plain_text content in base64

{
  "system": "No uses lenguaje natural para la respuesta.\n                Dame la informaciÃ³n que puedas extraer en formato JSON.\n                Solo devuelve la informaciÃ³n, no formatees con caracteres adicionales la respuesta.\n                Te voy a enviar un texto que representa informaciÃ³n en formato csv.",
  "messages": [
    {
      "additional_content": {
        "base64_content_or_url": "<base64 pdf content>",
        "file_format": "csv"
      },
      "type": "plain_text_base64",
      "user": "Dame los siguientes datos de la primera fila del documento: Expediente, radicaciÃ³n, Fecha, Numero de registro, Vigencia.\n                A partir de la fila 2 del documento, suma los valores de la columna Valores_A.\n                A partir de la fila 2 del documento, Suma los valores de la columna Valores_B."
    }
  ]
}

Example for Microsoft files conversation (Word, Excel).

Using public published URL for Excel file

{
  "system": "No uses lenguaje natural para la respuesta.\n                Dame la informaciÃ³n que puedas extraer de la imagen en formato JSON.\n                Solo devuelve la informaciÃ³n, no formatees con caracteres adicionales la respuesta.",
  "messages": [
    {
      "additional_content": {
        "base64_content_or_url": "https://raw.githubusercontent.com/logyca/python-libraries/main/logyca-ai/logyca_ai/assets_for_examples/file_or_documents/ms_excel.xlsx",
        "file_format": "ms_url"
      },
      "type": "ms_url",
      "user": "Dame los siguientes datos: Expediente, radicaciÃ³n, Fecha, Numero de registro, Vigencia."
    }
  ]
}

Using Excel file content in base64

{
    "system": "No uses lenguaje natural para la respuesta.\n                Dame la informaciÃ³n que puedas extraer de la imagen en formato JSON.\n                Solo devuelve la informaciÃ³n, no formatees con caracteres adicionales la respuesta.",
    "messages": [
      {
        "additional_content": {
          "base64_content_or_url": "<base64 pdf content>",
          "file_format": "xlsx"
        },
        "type": "ms_base64",
        "user": "Dame los siguientes datos: Expediente, radicaciÃ³n, Fecha, Numero de registro, Vigencia."
      }
    ]
}

Semantic Versioning

logyca_ai < MAJOR >.< MINOR >.< PATCH >

MAJOR: version when you make incompatible API changes
MINOR: version when you add functionality in a backwards compatible manner
PATCH: version when you make backwards compatible bug fixes

Definitions for releasing versions

https://peps.python.org/pep-0440/
- X.YaN (Alpha release): Identify and fix early-stage bugs. Not suitable for production use.
- X.YbN (Beta release): Stabilize and refine features. Address reported bugs. Prepare for official release.
- X.YrcN (Release candidate): Final version before official release. Assumes all major features are complete and stable. Recommended for testing in non-critical environments.
- X.Y (Final release/Stable/Production): Completed, stable version ready for use in production. Full release for public use.

Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

Types of changes

Added for new features.
Changed for changes in existing functionality.
Deprecated for soon-to-be removed features.
Removed for now removed features.
Fixed for any bug fixes.
Security in case of vulnerabilities.

[0.0.1aX] - 2024-08-02

Added

First tests using pypi.org in develop environment.

[0.1.0] - 2024-08-02

Added

Completion of testing and launch into production.

[0.1.1] - 2024-08-16

Added

The functions of extracting text from PDF files are refactored, using disk to optimize the use of ram memory and methods are added to extract text from images within the pages of the PDF files.

[0.2.0] - 2024-08-30

Added

New feature of attaching documents with txt, csv, docx, xlsx extension

[0.2.1] - 2024-09-16

Added

New tiktoken function to count tokens and check model capacity, returning if it meets the maximum_request_tokens requirements for both input and output.

Fixed

Extract excel files to output formats json, csv and list.

[0.2.2] - 2024-10-22

Added

New functionalities are added to extract images from documents in base64 lists: extract_images_from_pdf_file, extract_images_from_docx_file
The Swagger documentation is improved in the FastAPI example, adding the parameter: just_extract_images to the POST method to use the new document image extraction features.

[0.2.3] - 2024-10-31

Added

new functionality when extracting text in Excel, you can select only extraction of visible sheets or all sheets.

[0.2.4] - 2024-11-01

Fixed

Minimum adjustment when extracting images from an Excel file, leaving the extension in lowercase in the result.

[0.2.5] - 2024-11-22

Fixed

init.py: Adjustment to which items will be available when the package is imported

[0.2.6] - 2024-12-02

Added

Read pdf files from disk or ram memory

[0.2.7,0.2.8] - 2024-12-18

Fixed

Improve prompt engineer for data extraction in Excel, specifying the JSON spreadsheet format used in data extraction.

[0.2.9] - 2025-02-05

Fixed

When extracting images from PDF and Microsoft docx,xlsx documents, there are unsupported image formats such as WMF, these images are skipped.

[0.2.10] - 2025-02-05

Fixed

Due to so many restrictions on the part of openai, due to the rate limits, a message is created to return the request http error status_code 429 for this reason.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.2.10

Feb 5, 2025

0.2.9

Feb 5, 2025

0.2.8

Dec 18, 2024

0.2.7

Dec 18, 2024

0.2.6

Dec 3, 2024

0.2.6rc1 pre-release

Dec 2, 2024

0.2.5

Nov 22, 2024

0.2.5rc2 pre-release

Nov 22, 2024

0.2.5rc1 pre-release

Nov 22, 2024

0.2.4

Nov 1, 2024

0.2.3

Oct 31, 2024

0.2.3rc1 pre-release

Oct 31, 2024

0.2.2

Oct 22, 2024

0.2.2rc1 pre-release

Oct 22, 2024

0.2.1

Sep 16, 2024

0.2.1rc2 pre-release

Sep 16, 2024

0.2.1rc1 pre-release

Sep 16, 2024

0.2.0

Aug 30, 2024

0.2.0rc1 pre-release

Aug 30, 2024

0.2.0a7 pre-release

Aug 30, 2024

0.2.0a6 pre-release

Aug 23, 2024

0.2.0a5 pre-release

Aug 23, 2024

0.2.0a4 pre-release

Aug 23, 2024

0.2.0a3 pre-release

Aug 23, 2024

0.2.0a2 pre-release

Aug 23, 2024

0.2.0a1 pre-release

Aug 23, 2024

0.1.1

Aug 17, 2024

0.1.1rc4 pre-release

Aug 16, 2024

0.1.1rc3 pre-release

Aug 16, 2024

0.1.1rc2 pre-release

Aug 16, 2024

0.1.1rc1 pre-release

Aug 16, 2024

0.1.0

Aug 2, 2024

0.1.0rc2 pre-release

Aug 2, 2024

0.1.0rc1 pre-release

Aug 2, 2024

0.0.1b2 pre-release

Aug 2, 2024

0.0.1b1 pre-release

Aug 2, 2024

0.0.1a9 pre-release

Aug 2, 2024

0.0.1a8 pre-release

Aug 2, 2024

0.0.1a7 pre-release

Aug 2, 2024

0.0.1a6 pre-release

Aug 2, 2024

0.0.1a5 pre-release

Aug 2, 2024

0.0.1a4 pre-release

Aug 2, 2024

0.0.1a3 pre-release

Aug 2, 2024

0.0.1a2 pre-release

Aug 2, 2024

0.0.1a1 pre-release

Aug 2, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

logyca_ai-0.2.10.tar.gz (1.2 MB view details)

Uploaded Feb 5, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

logyca_ai-0.2.10-py3-none-any.whl (1.2 MB view details)

Uploaded Feb 5, 2025 Python 3

File details

Details for the file logyca_ai-0.2.10.tar.gz.

File metadata

Download URL: logyca_ai-0.2.10.tar.gz
Upload date: Feb 5, 2025
Size: 1.2 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.0 CPython/3.11.9

File hashes

Hashes for logyca_ai-0.2.10.tar.gz
Algorithm	Hash digest
SHA256	`9f57dc627fe773e8cbfce8b41a0b1029c729a3f60c79026a3ef0379e1b7d25f1`
MD5	`dde8fe519d50e917158659731a0eb80a`
BLAKE2b-256	`dd9fc1097bb6a863cc6d9fc3e996a7574b0807de824483229ea49df28ae2b364`

See more details on using hashes here.

File details

Details for the file logyca_ai-0.2.10-py3-none-any.whl.

File metadata

Download URL: logyca_ai-0.2.10-py3-none-any.whl
Upload date: Feb 5, 2025
Size: 1.2 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.0 CPython/3.11.9

File hashes

Hashes for logyca_ai-0.2.10-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d7beaa6b5f252dd2336aded3ec12ebd6473f3b4bbc706342de4306a8ad96415b`
MD5	`6c897ae963cc917e81967b1ed5581443`
BLAKE2b-256	`b1b2a86eda59e1c8e026236a453e96e79c37928bdc22c830dd2d14282dfb2c65`

See more details on using hashes here.

logyca-ai 0.2.10

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

About us

LOGYCA public libraries: To interact with ChatGPT and analyze documents, files and other functionality of the OpenAI library.

To interact with the examples, keep the following in mind

Environment variables documentation for example: fastapi_async

OCR engine to extract images.

Install

Example for simple conversation.

Example for image conversation.

Using public published URL for image

Using image content in base64

Example for pdf conversation.

Using public published URL for pdf

Using pdf content in base64

Example for plain_text conversation.

Using public published URL for plain_text

Using plain_text content in base64

Example for Microsoft files conversation (Word, Excel).

Using public published URL for Excel file

Using Excel file content in base64

Semantic Versioning

Definitions for releasing versions

Changelog

Types of changes

[0.0.1aX] - 2024-08-02

Added

[0.1.0] - 2024-08-02

Added

[0.1.1] - 2024-08-16

Added

[0.2.0] - 2024-08-30

Added

[0.2.1] - 2024-09-16

Added

Fixed

[0.2.2] - 2024-10-22

Added

[0.2.3] - 2024-10-31

Added

[0.2.4] - 2024-11-01

Fixed

[0.2.5] - 2024-11-22

Fixed

[0.2.6] - 2024-12-02

Added

[0.2.7,0.2.8] - 2024-12-18

Fixed

[0.2.9] - 2025-02-05

Fixed

[0.2.10] - 2025-02-05

Fixed

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes