Skip to main content

No project description provided

Project description

[toc]

dedup

Compare images in two image lists and find dulplicate ones.

Install

  • Clone the repo
  • Install with
python setup.py install

Usage

  • Use it in command line
dedup path1 path2 save_dir [--log]

Compare result will be sved in dedup.json in the specified save_dir

Example

Input

  • path1
    imglist1.txt
/mnt/Storage/test1/000001.jpg
/mnt/Storage/test1/000002.jpg
  • path2
    imglist2.txt
/mnt/Storage/test2/000001.jpg
/mnt/Storage/test2/000002.jpg
/mnt/Storage/test2/000003.jpg
  • save_dir
/mnt/Storage

Output

  • imglist1_phash.hdf5
    Store the phashes of images in imglist1 in .hdf5 format.
  • imglist2_phash.hdf5
    Store the phashes of images in imglist2 in .hdf5 format.
  • dedup.json
    Store the comparision results in json format. Each image in list1 will correspond to one or more images in list2 if they are similar.
{
    "/mnt/Storage/test1/000001.jpg":[
        "/mnt/Storage/test2/000001.jpg"
    ],
    "/mnt/Storage/test1/000002.jpg":[
        "/mnt/Storage/test2/000002.jpg"
    ]
}

Parameters

  • path1
    Path of the first image list.
    Image paths in the list should be Absolute path.
  • path2
    Path of the second image list.
    Image paths in the list should be Absolute path.
  • save_dir
    Folder path to save the results.
  • --log[optional]
    With this option added, the program will print detail log for each thread and each dulplicate image pair. This may cause message flush on screen so it is recommended to pipe it in to a log file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ddup-0.0.1.tar.gz (6.3 kB view hashes)

Uploaded Source

Built Distribution

ddup-0.0.1-py2.py3-none-any.whl (6.8 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page