Skip to main content

Website information extractor library

Project description

Grabio ๐Ÿ”

Grabio is a fast and simple Python library to extract detailed information from websites.

It can retrieve IP addresses, SSL details, WHOIS info, server type, page metadata, emails, forms, and more โ€” all with multi-threading for speed and efficiency.


๐Ÿš€ Features

  • โœ… Extract IP address
  • โœ… WHOIS information
  • โœ… SSL certificate details
  • โœ… Server type detection
  • โœ… Host location (Country, Region, City, Organization)
  • โœ… Website response time
  • โœ… Security headers extraction
  • โœ… CDN detection
  • โœ… Redirect detection
  • โœ… Extract all image URLs
  • โœ… Extract emails from the page
  • โœ… Count forms on the page
  • โœ… Meta description & keywords
  • โœ… Page size in KB
  • โœ… Robots.txt detection
  • โœ… Multi-threading for fast data retrieval
  • โœ… CLI Support
  • โœ… User-Agent randomization

๐Ÿ“ฆ Installation

pip install grabio

Or clone the repository:

git clone https://github.com/xr-xr4/grabio.git
cd grabio
pip install -r requirements.txt


---

๐Ÿ› ๏ธ Usage (Python)

from grabio import Grabio

# Initialize
g = Grabio('https://example.com')

# Extract specific info
print("IP Address:", g.ip())
print("WHOIS Info:", g.whois_info())
print("SSL Info:", g.ssl_info())

# Extract full info
full_info = g.info()
print(full_info)

# Save info as JSON
g.show_info()


---

๐Ÿ–ฅ๏ธ CLI Usage

python cli.py --url example.com --info       # Full information
python cli.py --url example.com --ip         # Only IP address
python cli.py --url example.com --whois      # Only WHOIS info


---

๐Ÿ—‚๏ธ Project Structure

grabio/
โ”œโ”€โ”€ grabio/
โ”‚   โ”œโ”€โ”€ __init__.py
โ”‚   โ”œโ”€โ”€ extractor.py
โ”‚   โ”œโ”€โ”€ utils.py
โ”‚   โ””โ”€โ”€ user_agent.py
โ”‚
โ”œโ”€โ”€ cli.py
โ”œโ”€โ”€ setup.py
โ”œโ”€โ”€ requirements.txt
โ”œโ”€โ”€ README.md
โ””โ”€โ”€ .gitignore


---

โœ… Requirements

Python 3.6+

requests

beautifulsoup4

user_agent

python-whois


Install all requirements:

pip install -r requirements.txt


---

๐Ÿ“„ License

This project is licensed under the MIT License.


---

๐Ÿ™‹ Author

Ahmed Saoud

GitHub: @xr-xr4

Telegram: @xr_xr4

Telegram Channel: @xr_xr4_dev



---

๐Ÿ’ฌ Contributions

Contributions are welcome!
Feel free to fork the repo and submit pull requests ๐Ÿš€.

---

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

grabio-1.1.1.tar.gz (4.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

grabio-1.1.1-py3-none-any.whl (5.0 kB view details)

Uploaded Python 3

File details

Details for the file grabio-1.1.1.tar.gz.

File metadata

  • Download URL: grabio-1.1.1.tar.gz
  • Upload date:
  • Size: 4.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for grabio-1.1.1.tar.gz
Algorithm Hash digest
SHA256 3771f6a5b65f0b445bb70ebc5d56289037e15306f859972434beee930ebb502a
MD5 4f71950bcaae84f26ff4badfb6789f24
BLAKE2b-256 bba771cf90984f5d893ddace84f7689416a9470efd66e52ffdf7ac2738088416

See more details on using hashes here.

File details

Details for the file grabio-1.1.1-py3-none-any.whl.

File metadata

  • Download URL: grabio-1.1.1-py3-none-any.whl
  • Upload date:
  • Size: 5.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for grabio-1.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 f6260e97360c1f4006b4ef7a78cbf07a1e1cc22897140a948d2fd93731265a93
MD5 18d01aad33ccefad4aeea328d0fc5f3e
BLAKE2b-256 376e6348d6918ed5b3ee80561e0e2921c428d3bc6841955523e4f1453ebc9998

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page