Skip to main content

Model for use with Autodistill

Project description

Autodistill Gemini Module

This repository contains the code supporting the Gemini base model for use with Autodistill.

Gemini, developed by Google, is a multimodal computer vision model that allows you to ask questions about images. You can use Gemini with Autodistill for image classification.

You can combine Gemini with other base models to label regions of an object. For example, you can use Grounding DINO to identify abstract objects (i.e. a vinyl record) then Gemini to classify the object (i.e. say which of five vinyl records the region represents). Read the Autodistill Combine Models guide for more information.

[!NOTE] Using this project will incur billing charges for API calls to the Gemini API. Refer to the Google Cloud pricing page for more information and to calculate your expected pricing. This package makes one API call per image you want to label.

Read the full Autodistill documentation.

Installation

To use Gemini with autodistill, you need to install the following dependency:

pip3 install autodistill-gemini

Quickstart

from autodistill_gemini import Gemini

# define an ontology to map class names to our Gemini prompt
# the ontology dictionary has the format {caption: class}
# where caption is the prompt sent to the base model, and class is the label that will
# be saved for that caption in the generated annotations
# then, load the model
base_model = Gemini(
    ontology=CaptionOntology(
        {
            "person": "person",
            "a forklift": "forklift"
        }
    ),
    gcp_region="us-central1",
    gcp_project="project-name",
)

# run inference on an image
result = base_model.predict("image.jpg")

print(result)

# label a folder of images
base_model.label("./context_images", extension=".jpeg")

License

This project is licensed under an MIT license.

🏆 Contributing

We love your input! Please see the core Autodistill contributing guide to get started. Thank you 🙏 to all our contributors!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autodistill-gemini-0.1.0.tar.gz (4.2 kB view details)

Uploaded Source

Built Distribution

autodistill_gemini-0.1.0-py3-none-any.whl (4.5 kB view details)

Uploaded Python 3

File details

Details for the file autodistill-gemini-0.1.0.tar.gz.

File metadata

  • Download URL: autodistill-gemini-0.1.0.tar.gz
  • Upload date:
  • Size: 4.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for autodistill-gemini-0.1.0.tar.gz
Algorithm Hash digest
SHA256 d56c4fce0a01718eccc3bd35e8c243aade2e777cf40ed74ff90d0b92b7823b4b
MD5 9423a464a5acfd8b78ac2d6f2ede234d
BLAKE2b-256 7d1964178760bd2e19641c76b2fc764ee2e05bbc3baa8cf44d46389d0e7d6259

See more details on using hashes here.

File details

Details for the file autodistill_gemini-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for autodistill_gemini-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 70b9754dfc8d19749a7f49fd4cfbe00a698902428ab0f1e3a2cc4e60acd743c0
MD5 40950db8a856b61dde467ccff1a2fee7
BLAKE2b-256 919dec48d70bd928d0473644bdc9829e9c956452558050ff87914b26d73b0a04

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page