No project description provided
Project description
[toc]
dedup
Compare images in two image lists and find dulplicate ones.
Install
- Clone the repo
- Install with
python setup.py install
Usage
- Use it in command line
dedup path1 path2 save_dir [--log]
Compare result will be sved in dedup.json
in the specified save_dir
Example
Input
path1
imglist1.txt
/mnt/Storage/test1/000001.jpg
/mnt/Storage/test1/000002.jpg
path2
imglist2.txt
/mnt/Storage/test2/000001.jpg
/mnt/Storage/test2/000002.jpg
/mnt/Storage/test2/000003.jpg
save_dir
/mnt/Storage
Output
imglist1_phash.hdf5
Store the phashes of images in imglist1 in .hdf5 format.imglist2_phash.hdf5
Store the phashes of images in imglist2 in .hdf5 format.dedup.json
Store the comparision results in json format. Each image in list1 will correspond to one or more images in list2 if they are similar.
{
"/mnt/Storage/test1/000001.jpg":[
"/mnt/Storage/test2/000001.jpg"
],
"/mnt/Storage/test1/000002.jpg":[
"/mnt/Storage/test2/000002.jpg"
]
}
Parameters
path1
Path of the first image list.
Image paths in the list should be Absolute path.path2
Path of the second image list.
Image paths in the list should be Absolute path.save_dir
Folder path to save the results.--log
[optional]
With this option added, the program will print detail log for each thread and each dulplicate image pair. This may cause message flush on screen so it is recommended to pipe it in to a log file.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
ddup-0.0.1.tar.gz
(6.3 kB
view hashes)