Skip to main content

Website information extractor library

Project description

Grabio ๐Ÿ”

Grabio is a fast and simple Python library to extract detailed information from websites.

It can retrieve IP addresses, SSL details, WHOIS info, server type, page metadata, emails, forms, and more โ€” all with multi-threading for speed and efficiency.


๐Ÿš€ Features

  • โœ… Extract IP address
  • โœ… WHOIS information
  • โœ… SSL certificate details
  • โœ… Server type detection
  • โœ… Host location (Country, Region, City, Organization)
  • โœ… Website response time
  • โœ… Security headers extraction
  • โœ… CDN detection
  • โœ… Redirect detection
  • โœ… Extract all image URLs
  • โœ… Extract emails from the page
  • โœ… Count forms on the page
  • โœ… Meta description & keywords
  • โœ… Page size in KB
  • โœ… Robots.txt detection
  • โœ… Multi-threading for fast data retrieval
  • โœ… CLI Support
  • โœ… User-Agent randomization

๐Ÿ“ฆ Installation

pip install grabio

Or clone the repository:

git clone https://github.com/xr-xr4/grabio.git
cd grabio
pip install -r requirements.txt


---

๐Ÿ› ๏ธ Usage (Python)

from grabio import Grabio

# Initialize
g = Grabio('https://example.com')

# Extract specific info
print("IP Address:", g.ip())
print("WHOIS Info:", g.whois_info())
print("SSL Info:", g.ssl_info())

# Extract full info
full_info = g.info()
print(full_info)

# Save info as JSON
g.show_info()


---

๐Ÿ–ฅ๏ธ CLI Usage

python cli.py --url example.com --info       # Full information
python cli.py --url example.com --ip         # Only IP address
python cli.py --url example.com --whois      # Only WHOIS info


---

๐Ÿ—‚๏ธ Project Structure

grabio/
โ”œโ”€โ”€ grabio/
โ”‚   โ”œโ”€โ”€ __init__.py
โ”‚   โ”œโ”€โ”€ extractor.py
โ”‚   โ”œโ”€โ”€ utils.py
โ”‚   โ””โ”€โ”€ user_agent.py
โ”‚
โ”œโ”€โ”€ cli.py
โ”œโ”€โ”€ setup.py
โ”œโ”€โ”€ requirements.txt
โ”œโ”€โ”€ README.md
โ””โ”€โ”€ .gitignore


---

โœ… Requirements

Python 3.6+

requests

beautifulsoup4

user_agent

python-whois


Install all requirements:

pip install -r requirements.txt


---

๐Ÿ“„ License

This project is licensed under the MIT License.


---

๐Ÿ™‹ Author

Ahmed Saoud

GitHub: @xr-xr4

Telegram: @xr_xr4

Telegram Channel: @xr_xr4_dev



---

๐Ÿ’ฌ Contributions

Contributions are welcome!
Feel free to fork the repo and submit pull requests ๐Ÿš€.

---

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

grabio-1.2.tar.gz (4.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

grabio-1.2-py3-none-any.whl (5.1 kB view details)

Uploaded Python 3

File details

Details for the file grabio-1.2.tar.gz.

File metadata

  • Download URL: grabio-1.2.tar.gz
  • Upload date:
  • Size: 4.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for grabio-1.2.tar.gz
Algorithm Hash digest
SHA256 e4ec1f091bb3bf1d86de2c6e199b3b4f03ea8d13bc06de4073b2332228432765
MD5 4e8bcfeb94b0939de0440ad4c5cc0b91
BLAKE2b-256 e914cffafdae3c8d8ac8893b92fc1145b3d7fa47878c44acf6fce8ee0e2e31b8

See more details on using hashes here.

File details

Details for the file grabio-1.2-py3-none-any.whl.

File metadata

  • Download URL: grabio-1.2-py3-none-any.whl
  • Upload date:
  • Size: 5.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for grabio-1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 0e0e472b8074658dd511049f5672f88ab00c2699c91547849054c30f4591ab04
MD5 e7fc97f0c8d144f87068c9ae922d7620
BLAKE2b-256 7646f69ce087c97094e8deccfbffec58cc0f42a3aa29239c5ae4971490b35141

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page