Python package for easily interfacing with MediaCatch APIs and data formats.
Project description
MediaCatch
Python package for easily interfacing with MediaCatch APIs and data formats. Documentation can be found at api.mediacatch.io/mediacatch/docs/.
Requirements
- Python >= 3.10
- MediaCatch API key (contact support@medicatch.io)
Installation
Install with pip
pip install mediacatch
Getting Started
Firstly, add your MediaCatch API key to your environment variables
export MEDIACATCH_API_KEY=<your-api-key-here>
Then you can start using the command line interface
mediacatch --help
usage: mediacatch <command> [<args>]
MediaCatch CLI tool
positional arguments:
{speech,vision,viz} mediacatch command helpers
speech CLI tool to run inference with MediaCatch Speech API
vision CLI tool to run inference with MediaCatch Vision API
viz CLI tool to visualize the results of MediaCatch
options:
-h, --help show this help message and exit
Upload a file to MediaCatch Speech API and get the results
mediacatch speech path/to/file --save-result path/to/save/result.json
# or to see options
mediacatch speech --help
Upload a file to MediaCatch vision API and get the results
mediacatch vision path/to/file ocr --save-result path/to/save/result.json
# or to see options
mediacatch vision --help
Or import as a module
from mediacatch.vision import upload, wait_for_result
file_id = upload(
fpath='path/to/file',
type='ocr',
# Optional parameters with their default values
fps=1, # Frames per second for video processing. Defaults to 1.
tolerance=10, # Tolerance for text detection. Defaults to 10.
min_bbox_iou=0.5 # Minimum bounding box intersection over union for text detection. Defaults to 0.5.
min_levenshtein_ratio=0.75 # Minimum Levenshtein ratio for merging text detection (more info here: https://rapidfuzz.github.io/Levenshtein/levenshtein.html#ratio). Defaults to 0.75.
moving_threshold=50, # If merged text detections center moves more pixels than this threshold, it will be considered moving text. Defaults to 50.
max_text_length=3, # If text length is less than this value, use max_text_confidence as confidence threshold. Defaults to 3.
min_text_confidence=0.5, # Confidence threshold for text detection (if text length is greater than max_text_length). Defaults to 0.5.
max_text_confidence=0.8, # Confidence threshold for text detection (if text length is less than max_text_length). Defaults to 0.8.
max_height_width_ratio=2.0, # Discard detection if height/width ratio is greater than this value. Defaults to 2.0.
get_detection_histogram=False, # If true, get histogram of detection. Defaults to False.
detection_histogram_bins=8, # Number of bins for histogram calculation. Defaults to 8.
)
result = wait_for_result(file_id)
Visualize the results from MediaCatch
mediacatch viz ocr path/to/file path/to/result.json path/to/save.mp4
# or to see options
mediacatch viz --help
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
mediacatch-0.2.2.tar.gz
(221.2 kB
view details)
Built Distribution
mediacatch-0.2.2-py3-none-any.whl
(180.6 kB
view details)
File details
Details for the file mediacatch-0.2.2.tar.gz
.
File metadata
- Download URL: mediacatch-0.2.2.tar.gz
- Upload date:
- Size: 221.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 11e66f32b830e4abc03adb80293d23eac7cd635d98be0782b5ff2392827bf741 |
|
MD5 | f41cf58b4bd7bf14d37b16348e4dee34 |
|
BLAKE2b-256 | 4037dfa6a878895f0377dbc5a4a85fa83c66e4d09276187d6aaca7989b341328 |
File details
Details for the file mediacatch-0.2.2-py3-none-any.whl
.
File metadata
- Download URL: mediacatch-0.2.2-py3-none-any.whl
- Upload date:
- Size: 180.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 55549e856e75a86fc703875928c510ebf26a82df3f14140775d32ce02d9b2698 |
|
MD5 | 8113b7928bc713575e1baa318f63f4f6 |
|
BLAKE2b-256 | 471620644c6bfc3378ccfb66cafe5197204a0b3d67b34d5d0cdc5cdc17a9fb5b |