Skip to main content

Extract foreground from images or video via text prompt

Project description

ezsam (easy segment anything model)

A command line and gui tool to segment images and video via text prompts.

Input images and videos, describe the subjects or objects you want to keep, and output new images and videos with the background removed.

Check out the docs! ezsam.org

A large table with plates of Thanksgiving food including a whole turkey on a conveyor-style lazy susan. A whole cooked turkey flying through the void.

Why?

Meta's Segment Anything is a powerful tool for separating parts of images, but requires coordinate prompts—either bounding boxes or points. And manual prompt generation is tedious for large collections of still images or video.

In contrast, text-based prompts describing the object(s) in the foreground to segment can be constant. Inspired by Grounded-Segment-Anything, this project tries to package a simpler to use tool.

If you're not interested in text-based prompts with Segment Anything, check out rembg.

How does it work?

The foreground is selected using text prompts to GroundingDINO to detect objects. Image segments are generated using Segment Anything or Segment Anything HQ (SAM-HQ).

Quick start

# Ubuntu 22.04, Python 3.9 - 3.11
pip install ezsam
sudo apt install ffmpeg imagemagick
ezsam --help
ezsam-gui

For more detailed info, see the documentation site here: ezsam.org

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ezsam-0.5.0.tar.gz (65.7 kB view details)

Uploaded Source

Built Distribution

ezsam-0.5.0-py3-none-any.whl (35.8 kB view details)

Uploaded Python 3

File details

Details for the file ezsam-0.5.0.tar.gz.

File metadata

  • Download URL: ezsam-0.5.0.tar.gz
  • Upload date:
  • Size: 65.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.0 CPython/3.12.4

File hashes

Hashes for ezsam-0.5.0.tar.gz
Algorithm Hash digest
SHA256 2d754bfef2e024e563db9d3a316a69fc262cb4ce474a6fd7ee02a2ad53364dcb
MD5 02ff954775e6092172df3e7e33a2aec8
BLAKE2b-256 8ed5df56f7d0c5101e1612fb08b650662f66659fb06868fea0ca4cfac95e919a

See more details on using hashes here.

File details

Details for the file ezsam-0.5.0-py3-none-any.whl.

File metadata

  • Download URL: ezsam-0.5.0-py3-none-any.whl
  • Upload date:
  • Size: 35.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.0 CPython/3.12.4

File hashes

Hashes for ezsam-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ec03f96085616821f2632ee784b97ede1b65dcc340c576bfe2b9e579c1f52622
MD5 d090ed97bce5f5d79c7f45dd189cee9d
BLAKE2b-256 fa1f825f904da27551ae6f1e0307a67b29423c296482a14fbabc9b87f33c4bd8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page