Skip to main content

Website information extractor library

Project description

Grabio ๐Ÿ”

Grabio is a fast and simple Python library to extract detailed information from websites.

It can retrieve IP addresses, SSL details, WHOIS info, server type, page metadata, emails, forms, and more โ€” all with multi-threading for speed and efficiency.


๐Ÿš€ Features

  • โœ… Extract IP address
  • โœ… WHOIS information
  • โœ… SSL certificate details
  • โœ… Server type detection
  • โœ… Host location (Country, Region, City, Organization)
  • โœ… Website response time
  • โœ… Security headers extraction
  • โœ… CDN detection
  • โœ… Redirect detection
  • โœ… Extract all image URLs
  • โœ… Extract emails from the page
  • โœ… Count forms on the page
  • โœ… Meta description & keywords
  • โœ… Page size in KB
  • โœ… Robots.txt detection
  • โœ… Multi-threading for fast data retrieval
  • โœ… CLI Support
  • โœ… User-Agent randomization

๐Ÿ“ฆ Installation

pip install grabio

Or clone the repository:

git clone https://github.com/xr-xr4/grabio.git
cd grabio
pip install -r requirements.txt


---

๐Ÿ› ๏ธ Usage (Python)

from grabio import Grabio

# Initialize
g = Grabio('https://example.com')

# Extract specific info
print("IP Address:", g.ip())
print("WHOIS Info:", g.whois_info())
print("SSL Info:", g.ssl_info())

# Extract full info
full_info = g.info()
print(full_info)

# Save info as JSON
g.show_info()


---

๐Ÿ–ฅ๏ธ CLI Usage

python cli.py --url example.com --info       # Full information
python cli.py --url example.com --ip         # Only IP address
python cli.py --url example.com --whois      # Only WHOIS info


---

๐Ÿ—‚๏ธ Project Structure

grabio/
โ”œโ”€โ”€ grabio/
โ”‚   โ”œโ”€โ”€ __init__.py
โ”‚   โ”œโ”€โ”€ extractor.py
โ”‚   โ”œโ”€โ”€ utils.py
โ”‚   โ””โ”€โ”€ user_agent.py
โ”‚
โ”œโ”€โ”€ cli.py
โ”œโ”€โ”€ setup.py
โ”œโ”€โ”€ requirements.txt
โ”œโ”€โ”€ README.md
โ””โ”€โ”€ .gitignore


---

โœ… Requirements

Python 3.6+

requests

beautifulsoup4

user_agent

python-whois


Install all requirements:

pip install -r requirements.txt


---

๐Ÿ“„ License

This project is licensed under the MIT License.


---

๐Ÿ™‹ Author

Ahmed Saoud

GitHub: @xr-xr4

Telegram: @xr_xr4

Telegram Channel: @xr_xr4_dev



---

๐Ÿ’ฌ Contributions

Contributions are welcome!
Feel free to fork the repo and submit pull requests ๐Ÿš€.

---

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

grabio-1.0.tar.gz (4.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

grabio-1.0-py3-none-any.whl (4.9 kB view details)

Uploaded Python 3

File details

Details for the file grabio-1.0.tar.gz.

File metadata

  • Download URL: grabio-1.0.tar.gz
  • Upload date:
  • Size: 4.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for grabio-1.0.tar.gz
Algorithm Hash digest
SHA256 82a1f8a5baed2a315855fe251c5bb659aef2b67de66d57ab6e1314a15e3d972f
MD5 c0d3a630327b735ed643617fd4acdbe0
BLAKE2b-256 b794350e946828d22b3be10734b548da6b523141478b3d1cd328c8527a7fa226

See more details on using hashes here.

File details

Details for the file grabio-1.0-py3-none-any.whl.

File metadata

  • Download URL: grabio-1.0-py3-none-any.whl
  • Upload date:
  • Size: 4.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for grabio-1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 68847f479f2b8f4facccd5b9a954744450dd34ef00ae4dd808f7ae99a54c8264
MD5 f7dfb5a6bcaa5be9bb9171d3278870fd
BLAKE2b-256 7bf1958081a739e3120bec94a30e601709e192ab6f6617f42ac7333c6eb2b635

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page