Skip to main content

No project description provided

Project description

PyPI version License: MIT Downloads

etoile_pixtral_description

etoile_pixtral_description is a Python package designed to extract detailed descriptions from images using advanced language models. It focuses on providing concrete descriptions of visible content characteristics on web pages or other digital media, making it particularly useful for content analysis, accessibility improvements, and SEO enhancements.

Installation

To install etoile_pixtral_description, use pip:

pip install etoile_pixtral_description

Usage

The package provides a function describe_image that generates a detailed description of the content found within an image. It uses the ScreenContentSimplified model for structured output.

Setting Up the Model

from langchain_mistralai import ChatMistralAI

CVISION_MODEL = "pixtral-12b-2409"

llm = ChatMistralAI(
    model=CVISION_MODEL,
    temperature=0,
    max_retries=2,
)

Describing an Image

from etoile_pixtral_description import describe_image

# `screenshot.link` should be a string containing the URL to the image you want to describe.
screenshot_link = "https://example.com/path/to/screenshot.jpg"
result = describe_image(llm, screenshot_link, verbose=True)
print(result)
# Assuming `result` contains a property `description` which holds the detailed description.

This function returns a structured response with a detailed description of each content characteristic on the screen, which is useful for various applications, including digital asset management and accessibility compliance.

Features

  • Provides detailed and concrete descriptions of images for enhanced understanding and analysis.
  • Utilizes advanced machine learning models to interpret and describe visual content.
  • Supports detailed descriptions including the semantics of textual content and the aesthetics of visual elements.

Contributing

Contributions, issues, and feature requests are welcome! Feel free to check the issues page.

License

etoile_pixtral_description is licensed under the MIT License.

Acknowledgements

This package was developed by Evgenii (Eugene) Evstafev as part of the Mistral Étoile suite during the London Hackathon. It aims to provide tools that assist in the detailed analysis of digital content for various professional fields.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

etoile_pixtral_description-0.0.1.tar.gz (3.5 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file etoile_pixtral_description-0.0.1.tar.gz.

File metadata

File hashes

Hashes for etoile_pixtral_description-0.0.1.tar.gz
Algorithm Hash digest
SHA256 e190f20c4ec504cf2eb372dd2c89366d38f1b403e86a8dcbbca8c45a56da2f1c
MD5 8fc01e780a47f3a83c9b667e77b8c427
BLAKE2b-256 dd6fbec859aad2c21a44eeea1b6d79da697919db9fed44e48d1891f523525129

See more details on using hashes here.

File details

Details for the file etoile_pixtral_description-0.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for etoile_pixtral_description-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 03bc05123bf788c127cd754dfc1cf5c083dd66f17e9bb2541ca311a1cf731308
MD5 bff9d753073abfad62d9d31312922b60
BLAKE2b-256 1d7ac60a24cb90ad36f514de39f3c989cd3293d6fc545401d8982a30c658a32a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page