Performs tagging of image and videos based on various taggers
Project description
Media Tagger
Problem statement
When analyzing large amount of creatives of any nature (being images and videos) it might be challenging to quickly and reliably understand their content and gain insights.
Solution
media-tagger
performs tagging of image and videos based on various taggers
- simply provide a path to your media files and
media-tagger
will do the rest.
Deliverable (implementation)
media-tagger
is implemented as a:
- library - Use it in your projects with a help of
media_tagging.tagger.create_tagger
function. - CLI tool -
media-tagger
tool is available to be used in the terminal. - HTTP endpoint -
media-tagger
can be easily exposed as HTTP endpoint. - Langchain tool - integrated
media-tagger
into your Langchain applications.
Deployment
Prerequisites
- Python 3.11+
- A GCP project with billing account attached
- Video Intelligence API and Vision API enabled.
- API key to access to access Google Gemini.
-
Once you created API key export it as an environmental variable
export GOOGLE_API_KEY=<YOUR_API_KEY_HERE>
-
Installation
Install media-tagger
with pip install media-tagging[all]
command.
Alternatively you can install subsets of media-tagging
library:
media-tagging[api]
- tagging videos and images with Google Cloud APIs.media-tagging[image-api]
- only for tagging images.media-tagging[video-api]
- only for tagging videos.
media-tagging[llm]
- tagging videos and images with LLMs.media-tagging[base-llm]
- only for tagging images with llms.media-tagging[google-genai]
- only for tagging images via Gemini.media-tagging[google-vertexai]
- only for tagging videos via Gemini.
Usage
This section is focused on using
media-tagger
as a CLI tool. Check library, http endpoint, langchain tool sections to learn more.
Once media-tagger
is installed you can call it:
media-tagger --media-path MEDIA_PATH --tagger TAGGER_TYPE --writer WRITER_TYPE
where:
- MEDIA_PATH - comma-separated names of files for tagging (can be urls).
- TAGGER_TYPE - name of tagger, supported options:
vision-api
- tags images based on Google Cloud Vision API,video-api
for videos based on Google Cloud Video Intelligence APIgemini-image
- Uses Gemini to tags images. Add--tagger.n_tags=<N_TAGS>
parameter to control number of tags returned by tagger.gemini-structured-image
- Uses Gemini to find certain tags in the images. Add--tagger.tags='tag1, tag2, ..., tagN
parameter to find certain tags in the image.gemini-description-image
- Provides brief description of the image,
- WRITER_TYPE - name of writer, one of
csv
,json
By default script will create a single file with tagging results for each media_path.
If you want to combine results into a single file add --output OUTPUT_NAME
flag (without extension, i.e. --output tagging_sample
.
Disclaimer
This is not an officially supported Google product.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file media_tagging-0.5.2.tar.gz
.
File metadata
- Download URL: media_tagging-0.5.2.tar.gz
- Upload date:
- Size: 20.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9ca6de2216435b58d48438f688149059d204100edfab52f8c75df144e7b9a224 |
|
MD5 | 416ad4cc51496a6fb203c45867078794 |
|
BLAKE2b-256 | 994e28a57f806ad2d7cb96e5782c08b7fb8600ad3b3c5c5a9d9bd0138fc7cc3f |
File details
Details for the file media_tagging-0.5.2-py3-none-any.whl
.
File metadata
- Download URL: media_tagging-0.5.2-py3-none-any.whl
- Upload date:
- Size: 28.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 28f8aecf75ac4f41ad003f4a2307f85f0b3ff50f41412a872e843ce8d0040a30 |
|
MD5 | 154b9e9e4abba7e7665ee163fa227011 |
|
BLAKE2b-256 | ddb1e202029dc84f9a3afdfb4ff3b5c895b20b9e8bc23e24727d3ac37ee26acf |