Skip to main content

Extract foreground from images or video via text prompt

Project description

ezsam (easy segment anything model)

A command line and gui tool to segment images and video via text prompts.

Input images and videos, describe the subjects or objects you want to keep, and output new images and videos with the background removed.

Check out the docs! ezsam.org

Why?

Meta's Segment Anything is a powerful tool for separating parts of images, but requires coordinate prompts—either bounding boxes or points. And manual prompt generation is tedious for large collections of still images or video.

In contrast, text-based prompts describing the object(s) in the foreground to segment can be constant. Inspired by Grounded-Segment-Anything, this project tries to package a simpler to use tool.

If you're not interested in text-based prompts with Segment Anything, check out rembg.

How does it work?

The foreground is selected using text prompts to GroundingDINO to detect objects. Image segments are generated using Segment Anything or Segment Anything HQ (SAM-HQ).

Quick start

# Ubuntu 22.04, Python 3.9 - 3.11
pip install ezsam
sudo apt install ffmpeg imagemagick
ezsam --help
ezsam-gui

For more detailed info, see the documentation site here: ezsam.org

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ezsam-0.3.0.tar.gz (76.0 kB view hashes)

Uploaded Source

Built Distribution

ezsam-0.3.0-py3-none-any.whl (34.0 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page