Skip to main content

Python toolkit for pluggable algorithms and data structures for multimedia-based machine learning.

Project description

SMQTK - Core

A light-weight, non-intrusive framework for developing interfaces that have built-in implementation discovery and factory construction from a simple configuration structure.

While anything may make use of this library, this was originally developed as a foundation for a suite of packages that predominantly support AI and Machine Learning use-cases:

  • Scalable data structure interfaces and implementations, with a focus on those relevant for machine learning like descriptors, classifications, and object detections.

  • Interfaces and implementations of machine learning algorithms with a focus on media-based functionality.


Libraries

Some above-mentioned packages supporting AI/ML topics include the following:

  • SMQTK-Dataprovider provides abstractions around data storage and retrieval.

  • SMQTK-Image-IO provides interfaces and implementations around image reading and writing using abstractions defined in SMQTK-Dataprovider.

  • SMQTK-Descriptors provides algorithms and data structures around computing descriptor vectors from different kinds of input data.

  • SMQTK-Classifier provides interfaces and implementations around black-box classification.

  • SMQTK-Detection provides interfaces and support for black-box object detection.

  • SMQTK-Indexing provides interfaces and implementations for efficient, large-scale indexing of descriptor vectors. The sources of such descriptor vectors may come from a multitude of sources, such as hours of video archives. Some provided implementation plugins include Locality-sensitive Hashing (LSH) and FAIR's FAISS library.

  • SMQTK-Relevancy provides interfaces and implementations for ranking datasets using human-in-the-loop feedback. This is a primary component for Interactive Query Refinement (IQR) systems that makes use of human feedback.

  • SMQTK-IQR provides classes and utilities to perform the Interactive Query Refinement (IQR) process. This package also includes a web API exposing the use of these tools as well as an example web UI service to demonstrate the capability. These services are additionally containerized to provide some portability of these services.

These packages are related as follows:

Dependency Graph

This looks a lot like KWIVER! Why use this instead?

KWIVER is another open source package that similarly holds modularity, plugins and configurability at its core.

The SMQTK-* suite of functionality exists separately from KWIVER for a few reasons (for now):

  • History
    • The origins of KWIVER and SMQTK were initiated at roughly the same time and were never resolved into the same thing because...
  • Language
    • KWIVER has historically been predominantly C++ while SMQTK-* is (mostly) pure python. (see note below)
  • Configuration UX
    • SMQTK takes an "add on" approach to configurability: concrete implementations have parameterized constructors and should be usable after construction like a "normal" object. Configuration semantics are derived from introspection of, and explicitly related to, the constructor. KWIVER takes an alternative approach where construction is generally empty and configuration setting is a required separate step via a custom object (ConfigBlock).
  • Pythonic Plugin Support
    • Plugins are exposed via standard package entrypoints.

If I'm using python, does that mean that SMQTK is always the better choice?

At this point, not necessarily. While this used to be true for a number of years due to SMQTK being the toolkit with python support. This is becoming more blurry KWIVER's continuously improving python binding support.

Building Documentation

Documentation is hosted on ReadTheDocs.io here.

You can also build the sphinx documentation locally for the most up-to-date reference:

# Install dependencies
poetry install
# Navigate to the documentation root.
cd docs
# Build the docs.
poetry run make html
# Open in your favorite browser!
firefox _build/html/index.html

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

smqtk_core-0.20.0.tar.gz (19.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

smqtk_core-0.20.0-py3-none-any.whl (18.3 kB view details)

Uploaded Python 3

File details

Details for the file smqtk_core-0.20.0.tar.gz.

File metadata

  • Download URL: smqtk_core-0.20.0.tar.gz
  • Upload date:
  • Size: 19.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.5 CPython/3.10.18 Linux/6.11.0-1018-azure

File hashes

Hashes for smqtk_core-0.20.0.tar.gz
Algorithm Hash digest
SHA256 64e2992ed4923f7f0cfe2111edc93dfd54913800eb4cbdd1f77e9c90d310686f
MD5 2a27c2f7d071367f85a61db38a084ef7
BLAKE2b-256 f6154e9ff0dcf54f641b97fcc71e5297c40dfa1cde279ee71be14da08e3a5e0b

See more details on using hashes here.

File details

Details for the file smqtk_core-0.20.0-py3-none-any.whl.

File metadata

  • Download URL: smqtk_core-0.20.0-py3-none-any.whl
  • Upload date:
  • Size: 18.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.5 CPython/3.10.18 Linux/6.11.0-1018-azure

File hashes

Hashes for smqtk_core-0.20.0-py3-none-any.whl
Algorithm Hash digest
SHA256 4ff0d4bbb7ed03018c432a5cbb43e97e43b45c1752c61affe7748e20bc768fe0
MD5 6a9aca5d0ca2a573b670b6d1343a3d1e
BLAKE2b-256 d29097ee25620a905d64d2674c366e71ac5c73d7aad7717324100a957a6393f7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page