Video deduplicator utility for Hydrus Network
Project description
Hydrus Video Deduplicator
Hydrus Video Deduplicator finds potential duplicate videos through the Hydrus API
How It Works:
The deduplicator works by comparing videos similarity by their perceptual hash.
Potential duplicates can be processed through the Hydrus duplicates processing page just like images.
You can choose to process only a subset of videos with --query
using Hydrus tags, e.g. --query="character:edward"
will only process videos with the tag character:edward
.
For more information check out the wiki and the FAQ
Installation:
Dependencies:
- Python >=3.10
python3 -m pip install hydrusvideodeduplicator
Usage:
python3 -m hydrusvideodeduplicator --api-key="<your key>"
For full list of options see --help
or the usage page.
TODO:
- Option to rollback and remove potential duplicates
- OR predicates for --query
- Parallelize hashing and duplicate search
- Automatically generate access key with Hydrus API
- Docker container
- Upload Docker container to Docker Hub (GitHub Action)
- Pure Python port of vpdq
- Windows compatibility without WSL or Docker
Contact:
Create an issue on GitHub for any problems/concerns. Provide as much detail as possible in your issue.
Message @applenanner on the Hydrus Discord for other general questions/concerns
Attribution:
Hydrus Network (DWTFYWTPL)
Hydrus API Library (GNU AGPLv3) by cryzed
pdq (BSD) by Meta
vpdq (BSD) by Meta
Big Buck Bunny, Sintel (CC BY 3.0) clips by Blender Foundation
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file hydrusvideodeduplicator-0.4.2.tar.gz
.
File metadata
- Download URL: hydrusvideodeduplicator-0.4.2.tar.gz
- Upload date:
- Size: 47.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.18
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | cf826838724bc10db14d3c2119bb6ccfeaaf1185709051a47d581fc60ce9c3af |
|
MD5 | fb659d800979c0b0c01a4a312df470ee |
|
BLAKE2b-256 | 1d78ed4152104129cd055ab1e503ffc2e9fe19ad7146cccc63b041789ae93701 |
File details
Details for the file hydrusvideodeduplicator-0.4.2-py3-none-any.whl
.
File metadata
- Download URL: hydrusvideodeduplicator-0.4.2-py3-none-any.whl
- Upload date:
- Size: 53.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.18
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7b29dacb8dacb88ce1357d9ca45011137729971a3b8e6468345607db9f3d9a22 |
|
MD5 | dddaac2cef34f262a940e8373b901546 |
|
BLAKE2b-256 | adb8a90957602e4c1e218abc438785e83f68c955724934cc61e2b58b1aa34e63 |