GPT-4V model for use with Autodistill
Project description
Autodistill GPT-4V Module
This repository contains the code supporting the GPT-4V base model for use with Autodistill.
GPT-4V, developed by OpenAI, is a multi-modal language model. With GPT-4V, you can ask questions about images in natural language. The autodistill-gpt4v
module enables you to classify images using GPT-4V.
This model uses the gpt-4-vision-preview API announced by OpenAI on November 6th, 2023.
[!NOTE]
Using this project will incur billing charges for API calls to the OpenAI GPT-4 Vision API. Refer to the OpenAI pricing page for more information and to calculate your expected pricing. This package makes one API call per image you want to label.
Read the full Autodistill documentation.
Read the GPT-4V Autodistill documentation.
Installation
To use GPT-4V with autodistill, you need to install the following dependency:
pip3 install autodistill-gpt-4v
Quickstart
from autodistill_gpt_4v import GPT4V
# define an ontology to map class names to our GPT-4V prompt
# the ontology dictionary has the format {caption: class}
# where caption is the prompt sent to the base model, and class is the label that will
# be saved for that caption in the generated annotations
# then, load the model
base_model = GPT4V(
ontology=CaptionOntology(
{
"person": "person",
"a forklift": "forklift"
}
)
)
base_model.label("./context_images", extension=".jpeg")
License
This project is licensed under an MIT license.
🏆 Contributing
We love your input! Please see the core Autodistill contributing guide to get started. Thank you 🙏 to all our contributors!
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for autodistill_gpt_4v-0.1.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 18fcb809720e5d89f654fa6916af2c07c004d27a37a3bbeaf57562f2044c0a41 |
|
MD5 | 31a306773b5faa250791ff30426f4299 |
|
BLAKE2b-256 | 348d2e6f05eea1dbf5275e4a439557545b5b981a4b3b6949dc82e6e94a44f421 |