Templates for GroundingDINO with HuggingFace
Project description
Sinapsis Hugging Face Grounding DINO
Templates for seamless integration with Grounding DINO models
🐍 Installation • 📦 Features • ▶️ Example Usage • 🌐 Webapp • 📙 Documentation • 🔍 License
🐍 Installation
Install using your package manager of choice. We encourage the use of uv
Example with uv:
uv pip install sinapsis-huggingface-grounding-dino --extra-index-url https://pypi.sinapsis.tech
or with raw pip:
pip install sinapsis-huggingface-grounding-dino --extra-index-url https://pypi.sinapsis.tech
[!IMPORTANT] Templates may require extra optional dependencies. For development, we recommend installing the package with all the optional dependencies:
with uv:
uv pip install sinapsis-huggingface-grounding-dino[all] --extra-index-url https://pypi.sinapsis.tech
or with raw pip:
pip install sinapsis-huggingface-grounding-dino[all] --extra-index-url https://pypi.sinapsis.tech
📦 Features
The templates in this package include functionality for different Grounding DINO-based tasks:
- GroundingDINO: Detect objects with bounding boxes based on text prompts (zero-shot object detection).
- GroundingDINOClassification: Classify images using predefined classes or text prompts, handling as many classes as possible within token limits.
- GroundingDINOFineTuning: Fine-tune Grounding DINO checkpoints on custom datasets.
▶️ Example Usage
Below is an example YAML configuration for zero-shot object detection using Grounding DINO.
Config
agent:
name: grounding_dino_detection
templates:
- template_name: InputTemplate
class_name: InputTemplate
attributes: {}
- template_name: FolderImageDatasetCV2
class_name: FolderImageDatasetCV2
template_input: InputTemplate
attributes:
data_dir: my_dataset
- template_name: GroundingDINO
class_name: GroundingDINO
template_input: FolderImageDatasetCV2
attributes:
model_path: IDEA-Research/grounding-dino-base
inference_mode: zero_shot
text_input: a person.
device: cuda
threshold: 0.2
text_threshold: 0.3
- template_name: BBoxDrawer
class_name: BBoxDrawer
template_input: GroundingDINO
attributes:
overwrite: true
randomized_color: false
- template_name: ImageSaver
class_name: ImageSaver
template_input: BBoxDrawer
attributes:
save_dir: ./output_dir
extension: png
[!IMPORTANT] The FolderImageDatasetCV2, BBoxDrawer and ImageSaver templates correspond to the sinapsis-data-readers, sinapsis-data-visualization and sinapsis-data-writers packages respectively. If you want to use the example, please make sure you install these packages.
To run the config, use the CLI:
sinapsis run name_of_config.yml
🌐 Webapp
Th Sinapsis web applications provide an interactive way to explore and experiment with AI models. They allow users to generate outputs, test different inputs, and visualize results in real time, making it easy to experience the capabilities of each model. Below are the available webapps and instructions to launch them.
[!IMPORTANT] To run any of the apps, you first need to clone this repo:
git clone git@github.com:Sinapsis-ai/sinapsis-huggingface.git
cd sinapsis-huggingface
[!NOTE] If you'd like to enable external app sharing in Gradio,
export GRADIO_SHARE_APP=True.
[!NOTE] Agent configuration can be changed through the AGENT_CONFIG_PATH env var. You can check the available configurations in each package configs folder.
🐳 Build with Docker
IMPORTANT The docker image depends on the sinapsis-nvidia:base image. To build it, refer to the [official sinapsis documentation](https://
- Build the sinapsis-huggingface image:
docker compose -f docker/compose.yaml build
- Start the container:
docker compose -f docker/compose_vision.yaml up sinapsis-huggingface-vision-gradio -d
- Check the status:
docker logs -f sinapsis-huggingface-vision-gradio
- The logs will display the URL to access the webapp, e.g.,:
Running on local URL: http://127.0.0.1:7860
NOTE: The local URL can be different, please check the logs
- To stop the app:
docker compose -f docker/compose_vision.yaml down
📦 UV
- Create the virtual environment and sync the dependencies:
uv sync --frozen
- Install the dependencies:
uv pip install sinapsis-huggingface[all] --extra-index-url https://pypi.sinapsis.tech
- Run the webapp.
uv run webapps/vision_demo.py
- The terminal will display the URL to access the webapp, e.g., :
Running on local URL: http://127.0.0.1:7860
📙 Documentation
Documentation is available on the sinapsis website
Tutorials for different projects within sinapsis are available at sinapsis tutorials page
🔍 License
This project is licensed under the AGPLv3 license, which encourages open collaboration and sharing. For more details, please refer to the LICENSE file.
For commercial use, please refer to our official Sinapsis website for information on obtaining a commercial license.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sinapsis_huggingface_grounding_dino-0.1.0.tar.gz.
File metadata
- Download URL: sinapsis_huggingface_grounding_dino-0.1.0.tar.gz
- Upload date:
- Size: 47.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.5.16
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
aebb9189c41283ebe1fddb6ba4b90f8bda0b90499cc1f18ef1fdbb01d278ecb0
|
|
| MD5 |
ce3dfe31beeb16fc70414d0a08d61cc4
|
|
| BLAKE2b-256 |
6f3d62efd193554c51601556460cab34d890459c350932556cba64915f93444d
|
File details
Details for the file sinapsis_huggingface_grounding_dino-0.1.0-py3-none-any.whl.
File metadata
- Download URL: sinapsis_huggingface_grounding_dino-0.1.0-py3-none-any.whl
- Upload date:
- Size: 36.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.5.16
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
427d7ad3a07bbf84920fba0229aecb808f90c9223fbb3881e7adc56869fef60a
|
|
| MD5 |
c6094b708733359354c4922b3cbbc97d
|
|
| BLAKE2b-256 |
5428960b4dc89f042e26458829eb05c642e0e6a5599748ceaf0a7bd80b7c37dc
|