Analyze video/image with machine learning methods, exif data, and other file based information.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

Media Analyzer

Media Analyzer is a Python library designed to analyze media files, providing insights into their content and metadata. It supports various functionalities, including image classification, captioning, optical character recognition (OCR), and facial recognition.

Features

GPS: Gather GPS coordinates from exif, and reverse geocode to get the country, province and city.
Exif: Extract exif data/metadata from photos, videos, gifs, etc.
Weather: Get weather info at the time the photo/video was taken.
- temperature
- dewpoint
- relative humidity
- precipitation
- wind gust
- pressure
- sun hours
- condition

Quality Detection: Detect objectively measurable quality of images:
- Sharpness
- Noise
- Exposure
- Dynamic range
- Color clipping (white/black levels)
- A composite score is generated, so photos can be ranked by quality.

Image Classification: Identify objects, scene type, activities, animals, and events present in images.
Image Captioning: Generate descriptive captions for images using models like BLIP or LLM-based captioners.
Embedding: Generate clip embeddings for images and text. Can be used to cluster images or search semantically through images.
Optical Character Recognition (OCR): Extract text from images to identify documents, receipts, menus, and more.
LLM: Get detailed image summary, more indepth than just a caption, using an LLM. Can also be used to generate a summary of a document shown in a photo or video.
Facial Recognition: Detect faces in images and provide details such as age, sex, bounding box, and facial landmarks. Includes an embedding of the face, which can be used for clustering.
Datetime Taken: Photo and video files are messy and have unreliable datetime tags. This packages uses six different methods with varying priority to get the datetime a photo is taken, including the timezone if possible.
Data Url: Generate data url for tiny preload thumbnail.

Installation

To install Media Analyzer, use pip:

pip install media-analyzer

Requirements

You must have the following in PATH.

ExifTool: https://exiftool.org/
Tesseract OCR: https://tesseract-ocr.github.io/tessdoc/Installation.html

Examples

Example output of the main analyze function can be viewed at example_output.json. Further example code is available at /examples.

Usage

Here's a basic example of how to use Media Analyzer:

from media_analyzer import MediaAnalyzer
from pathlib import Path

analyzer = MediaAnalyzer()
media_file = Path("image.jpg")
result = analyzer.photo(media_file)

print(result)

Disable analysis modules

The analysis is done based on modules, the following modules are available and enabled by default (click the links to see the possible modules).

File-based Modules

Visual Modules

Modules can be turned off by changing the config provided to the MediaAnalyzer class:

from media_analyzer import MediaAnalyzer, AnalyzerSettings, FileModule, VisualModule
from pathlib import Path

config = AnalyzerSettings(
    enabled_file_modules={FileModule.EXIF},  # Only do exif data analysis on file
    enabled_visual_modules={VisualModule.CAPTION},  # Only do caption module as visual module
)
analyzer = MediaAnalyzer(config=config)
media_file = Path(__file__).parents[1] / "tests/assets/tent.jpg"
result = analyzer.photo(media_file)

Configuration

The AnalyzerSettings class allows you to customize various aspects of the analysis:

media_languages: List of languages for OCR to consider.
theme_color_variant: The color variant used for the generated theme.
captions_provider: The provider for image captioning (e.g., 'BLIP', 'LLM').
llm_provider: The provider for the large language model (LLM), which can be used for summaries and captions.
enable_text_summary: Enable or disable text summarization.
enable_document_summary: Enable or disable document summarization.
document_detection_threshold: Confidence threshold for document detection.
face_detection_threshold: Confidence threshold for face detection.
enabled_file_modules: List of file modules to enable (e.g., exif data, gps, weather detection).
enabled_visual_modules: List of visual modules to enable (e.g., 'classification', 'captioning', 'ocr', 'facial_recognition').

Full docs can be found at https://ruurdbijlsma.github.io/media-analyzer.

Attribution

Meteostat for weather info: https://dev.meteostat.net/python/
Reverse geocoding data from geonames: https://download.geonames.org/
ExifTool for exif and similar data: https://exiftool.org/

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

RuteNL

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.4.1

Oct 9, 2025

0.4.0

Oct 9, 2025

0.3.7

Mar 27, 2025

0.3.6

Mar 26, 2025

0.3.5

Mar 25, 2025

0.3.4

Mar 23, 2025

0.3.3

Feb 26, 2025

0.3.2

Feb 5, 2025

0.3.1

Jan 24, 2025

This version

0.3.0

Jan 24, 2025

0.2.3

Jan 22, 2025

0.2.2

Jan 21, 2025

0.2.1

Jan 20, 2025

0.2.0

Jan 20, 2025

0.1.4

Jan 19, 2025

0.1.3

Jan 19, 2025

0.1.2

Jan 18, 2025

0.1.1

Jan 18, 2025

0.1.0

Jan 18, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

media_analyzer-0.3.0.tar.gz (104.2 MB view details)

Uploaded Jan 24, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

media_analyzer-0.3.0-py3-none-any.whl (71.7 kB view details)

Uploaded Jan 24, 2025 Python 3

File details

Details for the file media_analyzer-0.3.0.tar.gz.

File metadata

Download URL: media_analyzer-0.3.0.tar.gz
Upload date: Jan 24, 2025
Size: 104.2 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for media_analyzer-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`c23f61212bc4d9ac65e78a5a09a1f6f3700ecab3cf119df6b179de75e7324186`
MD5	`963c1c36501fe234682c9fa8718a9894`
BLAKE2b-256	`6186f62b63f7a10ddd09aaf66ee4d07adcbdbd17f56795a46dc8fb5832ecb9e9`

See more details on using hashes here.

Provenance

The following attestation bundles were made for media_analyzer-0.3.0.tar.gz:

Publisher: publish-to-pypi.yml on RuurdBijlsma/media-analyzer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: media_analyzer-0.3.0.tar.gz
- Subject digest: c23f61212bc4d9ac65e78a5a09a1f6f3700ecab3cf119df6b179de75e7324186
- Sigstore transparency entry: 165343350
- Sigstore integration time: Jan 24, 2025
Source repository:
- Permalink: RuurdBijlsma/media-analyzer@9a4e8e18b0a863bae9c65022fcc7919835aec518
- Branch / Tag: refs/tags/0.3.0
- Owner: https://github.com/RuurdBijlsma
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-to-pypi.yml@9a4e8e18b0a863bae9c65022fcc7919835aec518
- Trigger Event: push

File details

Details for the file media_analyzer-0.3.0-py3-none-any.whl.

File metadata

Download URL: media_analyzer-0.3.0-py3-none-any.whl
Upload date: Jan 24, 2025
Size: 71.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for media_analyzer-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c72d9d5be32dc92f97cdb88cb8603c221c310f3cf1b5451d62d8b9988d1c5ade`
MD5	`301ea6bba8dfeb966b902f25d5064f9c`
BLAKE2b-256	`ddde5c0b8f19d6afd0ae4beb0b6159304aafd94edf4d687f90065e71e54b2c6b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for media_analyzer-0.3.0-py3-none-any.whl:

Publisher: publish-to-pypi.yml on RuurdBijlsma/media-analyzer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: media_analyzer-0.3.0-py3-none-any.whl
- Subject digest: c72d9d5be32dc92f97cdb88cb8603c221c310f3cf1b5451d62d8b9988d1c5ade
- Sigstore transparency entry: 165343351
- Sigstore integration time: Jan 24, 2025
Source repository:
- Permalink: RuurdBijlsma/media-analyzer@9a4e8e18b0a863bae9c65022fcc7919835aec518
- Branch / Tag: refs/tags/0.3.0
- Owner: https://github.com/RuurdBijlsma
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-to-pypi.yml@9a4e8e18b0a863bae9c65022fcc7919835aec518
- Trigger Event: push

media-analyzer 0.3.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

Media Analyzer

Features

Installation

Requirements

Examples

Usage

Disable analysis modules

File-based Modules

Visual Modules

Configuration

Attribution

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance