Source information extractor
Project description
PTINSEARCHER
Source information extractor
ptinsearcher is a tool for extracting information from provided sources (websites, files). This tool allows dumping of HTML comments, e-mail addresses, phone numbers, IP addresses, subdomains, HTML forms, links and metadata of documents.
Installation
pip install ptinsearcher
Add to PATH
If you cannot invoke the script in your terminal, its probably because its not in your PATH. Fix it by running commands below.
Add to PATH for Bash
echo "export PATH=\"`python3 -m site --user-base`/bin:\$PATH\"" >> ~/.bashrc
source ~/.bashrc
Add to PATH for ZSH
echo "export PATH=\"`python3 -m site --user-base`/bin:\$PATH\"" >> ~/.zshrc
source ~/.zshrc
Usage examples
ptinsearcher -u https://www.example.com/
ptinsearcher -u https://www.example.com/ --extract E # Dump emails
ptinsearcher -u https://www.example.com/ --extract UQX # Dump internal URLs, internal URLs w/ parameters, external URLs
ptinsearcher -f url_list.txt
ptinsearcher -f url_list.txt --grouping
ptinsearcher -f url_list.txt --grouping-complete
Options
-u --url <url> Test URL
-f --file <file> Load URL list from file
-d --domain <domain> Domain - merge domain with filepath. Use when wordlist contains filepaths (e.g. /index.php)
-e --extract <extract> Set specific data to extract:
A Extract All (extract everything; default option)
E Extract Emails
S Extract Subdomains
C Extract Comments
F Extract Forms
I Extract IP addresses
P Extract Phone numbers
U Extract Internal urls
Q Extract Internal urls with parameters
X Extract External urls
M Extract Metadata
-g --grouping Group findings from multiple sources into one table
-gc --grouping-complete Group and merge findings from multiple sources into one result
-gp --group-parameters Group URL parameters
-wp --without-parameters Without URL parameters
-op --output-parts Save each extract-type to separate file
-o --output <output> Save output to file
-p --proxy <proxy> Set proxy (e.g. http://127.0.0.1:8080)
-T --timeout <timeout> Set timeout
-c --cookie <cookie=value> Set cookie
-ua --user-agent <user-agent> Set User-Agent
-H --headers <header:value> Set custom header(s)
-r --redirects Follow redirects (default False)
-C --cache Cache requests (load from tmp in future)
-v --version Show script version and exit
-h --help Show this help message and exit
-j --json Output in JSON format
Extract arguments
Specify which data to extract from source
A - grab all (default)
E - Emails
S - Subdomains
C - Comments
F - Forms
I - IP addresses
U - Internal URLs
Q - Internal URLs with parameters
X - External URLs
P - Phone numbers
M - Metadata
Dependencies
We use ExifTool to extract metadata.
requests
bs4
lxml
pyexiftool
tldextract
magic
ptlibs
Version History
1.0.3 - 1.0.4
- <file> error fix
- content-type error fix
1.0.1 - 1.0.2
- tldextract version fix
1.0.0
- Script logically divided into relevant submodules
- Code refactorization
- Updated for latest ptlibs
0.0.5 - 0.0.7
- Improved stability
- Updated help message
- Replaced extract parameter for comment extraction from 'H' to 'C'
- Fixed grouping, spacing, json output
0.0.1 - 0.0.4
- Alpha releases
License
Copyright (c) 2023 Penterep Security s.r.o.
ptinsearcher is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
ptinsearcher is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with ptinsearcher. If not, see https://www.gnu.org/licenses/.
Warning
You are only allowed to run the tool against the websites which you have been given permission to pentest. We do not accept any responsibility for any damage/harm that this application causes to your computer, or your network. Penterep is not responsible for any illegal or malicious use of this code. Be Ethical!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file ptinsearcher-1.0.4.tar.gz
.
File metadata
- Download URL: ptinsearcher-1.0.4.tar.gz
- Upload date:
- Size: 5.3 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1f12599103feae13bc701325aff3582212c9617382cd13b7a2edf7426584077c |
|
MD5 | 4a938125405477e1115fe1d008eb62aa |
|
BLAKE2b-256 | d7ffc29d637fdeee43840cad9182e3d3644ba383a941e8d470ea3b59a6edb878 |
File details
Details for the file ptinsearcher-1.0.4-py3-none-any.whl
.
File metadata
- Download URL: ptinsearcher-1.0.4-py3-none-any.whl
- Upload date:
- Size: 6.0 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 549264b981cf52b10ca0d59b4c1efbf001d6998188cc435fe0013befc05e7c61 |
|
MD5 | 82e7ad4cc12654d4543584df3630bdb3 |
|
BLAKE2b-256 | 906337ff4cd1e61a2bb24e59769ef1a8bf44787f0a747bf7f6f0b57d6579bcdd |