MetaFinder - Metadata search through Search Engines
Project description
|Supported Python versions| |License|
MetaFinder - Metadata search through Search Engines
::
_____ __ ___________ .__ .___
/ \ ____ _/ |_ _____ \_ _____/ |__| ____ __| _/ ____ _______
/ \ / \ _/ __ \ \ __\ \__ \ | __) | | / \ / __ | _/ __ \ \_ __ \
/ Y \ \ ___/ | | / __ \_ | \ | | | | \ / /_/ | \ ___/ | | \/
\____|__ / \___ > |__| (____ / \___ / |__| |___| / \____ | \___ > |__|
\/ \/ \/ \/ \/ \/ \/
|_ Author: @JosueEncinar
|_ Description: Search for documents in a domain through Search Engines. The objective is to extract metadata
|_ Usage: metafinder -d domain.com -l 50 -o /tmp -go -bi
Installation:
::
> pip3 install metafinder
Upgrades are also available using:
::
> pip3 install metafinder --upgrade
Usage
CLI
::
metafinder -d domain.com -l 20 -o folder [-t 10] -go -bi -ba
Parameters:
- d: Specifies the target domain.
- l: Specify the maximum number of results to be searched.
- o: Specify the path to save the report.
- t: Optional. Used to configure the threads (4 by default).
- v: Show Metafinder version.
- go: Optional. Search in Google. (Default)
- bi: Optional. Search in Bing.
- ba: Optional. Search in Baidu. (Experimental)
In Code
::
import metafinder.extractor as metadata_extractor
documents_limit = 5
domain = "target_domain"
result = metadata_extractor.extract_metadata_from_google_search(domain, documents_limit)
# result = metadata_extractor.extract_metadata_from_bing_search(domain, documents_limit)
# result = metadata_extractor.extract_metadata_from_baidu_search(domain, documents_limit)
authors = result.get_authors()
software = result.get_software()
for k,v in result.get_metadata().items():
print(f"{k}:")
print(f"|_ URL: {v['url']}")
for metadata,value in v['metadata'].items():
print(f"|__ {metadata}: {value}")
document_name = "test.pdf"
try:
metadata_file = metadata_extractor.extract_metadata_from_document(document_name)
for k,v in metadata_file.items():
print(f"{k}: {v}")
except FileNotFoundError:
print("File not found")
Author
This project has been developed by:
- Josué Encinar García -- https://twitter.com/JosueEncinar
Contributors
- Félix Brezo Fernández -- https://twitter.com/febrezo
Disclaimer!
The software is designed to leave no trace in the documents we upload to a domain. The author is not responsible for any illegitimate use.
.. |Supported Python versions| image:: https://img.shields.io/badge/python-3.6+-blue.svg?style=flat-square&logo=python .. |License| image:: https://img.shields.io/badge/license-GNU-green.svg?style=flat-square&logo=gnu
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
metafinder-0.3.2b0.tar.gz
(11.5 kB
view details)
File details
Details for the file metafinder-0.3.2b0.tar.gz
.
File metadata
- Download URL: metafinder-0.3.2b0.tar.gz
- Upload date:
- Size: 11.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7c7f17cf2d2d754bf859647dccb499af89b37c158401e12ee800a260999b40ba |
|
MD5 | 16dfc5e83a9ea901cca40e0d678d3a2e |
|
BLAKE2b-256 | 0afe29e1227d4f17a4a72e3d0c3944f54eb9ab8aa29bb7ee19376799096a9845 |