Skip to main content

Website information extractor library

Project description

Grabio ๐Ÿ”

Grabio is a fast and simple Python library to extract detailed information from websites.

It can retrieve IP addresses, SSL details, WHOIS info, server type, page metadata, emails, forms, and more โ€” all with multi-threading for speed and efficiency.


๐Ÿš€ Features

  • โœ… Extract IP address
  • โœ… WHOIS information
  • โœ… SSL certificate details
  • โœ… Server type detection
  • โœ… Host location (Country, Region, City, Organization)
  • โœ… Website response time
  • โœ… Security headers extraction
  • โœ… CDN detection
  • โœ… Redirect detection
  • โœ… Extract all image URLs
  • โœ… Extract emails from the page
  • โœ… Count forms on the page
  • โœ… Meta description & keywords
  • โœ… Page size in KB
  • โœ… Robots.txt detection
  • โœ… Multi-threading for fast data retrieval
  • โœ… CLI Support
  • โœ… User-Agent randomization

๐Ÿ“ฆ Installation

pip install grabio

Or clone the repository:

git clone https://github.com/xr-xr4/grabio.git
cd grabio
pip install -r requirements.txt


---

๐Ÿ› ๏ธ Usage (Python)

from grabio import Grabio

# Initialize
g = Grabio('https://example.com')

# Extract specific info
print("IP Address:", g.ip())
print("WHOIS Info:", g.whois_info())
print("SSL Info:", g.ssl_info())

# Extract full info
full_info = g.info()
print(full_info)

# Save info as JSON
g.show_info()


---

๐Ÿ–ฅ๏ธ CLI Usage

python cli.py --url example.com --info       # Full information
python cli.py --url example.com --ip         # Only IP address
python cli.py --url example.com --whois      # Only WHOIS info


---

๐Ÿ—‚๏ธ Project Structure

grabio/
โ”œโ”€โ”€ grabio/
โ”‚   โ”œโ”€โ”€ __init__.py
โ”‚   โ”œโ”€โ”€ extractor.py
โ”‚   โ”œโ”€โ”€ utils.py
โ”‚   โ””โ”€โ”€ user_agent.py
โ”‚
โ”œโ”€โ”€ cli.py
โ”œโ”€โ”€ setup.py
โ”œโ”€โ”€ requirements.txt
โ”œโ”€โ”€ README.md
โ””โ”€โ”€ .gitignore


---

โœ… Requirements

Python 3.6+

requests

beautifulsoup4

user_agent

python-whois


Install all requirements:

pip install -r requirements.txt


---

๐Ÿ“„ License

This project is licensed under the MIT License.


---

๐Ÿ™‹ Author

Ahmed Saoud

GitHub: @xr-xr4

Telegram: @xr_xr4

Telegram Channel: @xr_xr4_dev



---

๐Ÿ’ฌ Contributions

Contributions are welcome!
Feel free to fork the repo and submit pull requests ๐Ÿš€.

---

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

grabio-1.1.tar.gz (4.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

grabio-1.1-py3-none-any.whl (4.9 kB view details)

Uploaded Python 3

File details

Details for the file grabio-1.1.tar.gz.

File metadata

  • Download URL: grabio-1.1.tar.gz
  • Upload date:
  • Size: 4.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for grabio-1.1.tar.gz
Algorithm Hash digest
SHA256 1de948c8634ef8504c8d340f4019fe6be3e2629651e5949dc84ec3e59df16bed
MD5 5d1e4e4efebbe81286373d7a22b9a4e0
BLAKE2b-256 74f93bf5c5b31aacf657b910602c27eea92a85f0fc5829179be2d8032f5f37a8

See more details on using hashes here.

File details

Details for the file grabio-1.1-py3-none-any.whl.

File metadata

  • Download URL: grabio-1.1-py3-none-any.whl
  • Upload date:
  • Size: 4.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for grabio-1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 73facb25c993a05fb5ae8ae4f71a73257819da70103150ffda402d143fd4004c
MD5 c6b743560c4785a204b109c99b327e8b
BLAKE2b-256 a07491889735c8580859a70dae0343c5edf428b3e992af6aa1da1894e885e010

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page