Skip to main content

URLs Deduplication Tool.

Project description

UDdup - URLs Deduplication Tool

The tool gets a list of URLs, and removes "duplicate" pages in the sense of URL patterns that are probably repetitive and points to the same web template.

For example:

All the above are probably points to the same product "template". Therefore it should be enough to scan only some of these URLs by our various scanners.

The result of the above after UDdup should be:

Why do I need it?

Mostly for better (automated) reconnaissance process, with less noise (for both the tester and the target).


Take a look at demo.txt which is the raw URLs file which results in demo-results.txt.


With pip (Recommended)

pip install uddup

Manual (from code)

# Clone the repository.
git clone

# Install the Python requirements.
cd uddup
pip install -r requirements.txt


uddup -u demo.txt -o ./demo-result.txt

More Usage Options

uddup -h

Short Form Long Form Description
-h --help Show this help message and exit
-u --urls File with a list of urls
-o --output Save results to a file
-s --silent Print only the result URLs
-fp --filter-path Filter paths by a given Regex

Filter Paths by Regex

Allows filtering custom paths pattern. For example, if we would like to filter all paths that starts with /product we will need to run:

# Single Regex
uddup -u demo.txt -fp "^product"



Advanced Regex with multiple path filters

uddup -u demo.txt -fp "(^product)|(^category)"


Feel free to fork the repository and submit pull-requests.


Create new GitHub issue

Want to say thanks? :) Message me on Linkedin



Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for uddup, version 0.9.3
Filename, size File type Python version Upload date Hashes
Filename, size uddup-0.9.3.tar.gz (4.9 kB) File type Source Python version None Upload date Hashes View
Filename, size uddup-0.9.3-py3-none-any.whl (5.8 kB) File type Wheel Python version py3 Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page