Skip to main content

CLI tool to extract inlinks and anchor text from a URL

Project description

🔗 Inlink Extractor

Inlink Extractor is a simple and lightweight Python CLI tool that extracts all internal links (<a href="">) and their associated anchor texts from a given webpage URL. It's useful for SEO audits, link analysis, and content crawling tasks.


🚀 Features

  • Extract all hyperlinks (<a> tags) from a webpage
  • Normalize and resolve relative URLs
  • Capture anchor texts
  • Export results directly to CSV
  • Simple and scriptable CLI interface

📦 Installation

Make sure you have Python 3.6+ and pip installed. Then install the tool from source:

pip install .

⚠️ You must be in the root directory of the project where setup.py is located.


🛠️ Usage

inlink-extractor <URL> [--output filename.csv]

✅ Example

inlink-extractor https://www.example.com/finance/personal-loan/

This will:

  • Crawl the given page
  • Extract all anchor links and their anchor texts
  • Save them to inlinks.csv by default

🔄 Custom Output File

inlink-extractor https://example.com --output example_links.csv

📁 Output Format (CSV)

Inlink Anchor Text
https://www.example.com/personal-loan Personal Loan
https://www.example.com/about-us About Us

💡 Use Cases

  • Internal link audits for SEO
  • Detecting orphan pages or weak linking
  • Scraping anchor text context
  • Competitive link structure analysis

📜 License

MIT License. Feel free to use, modify, and contribute.


🙌 Contributing

If you'd like to improve or add features, feel free to fork the repo and submit a pull request!


🧑‍💻 Author

Built by Amal Alexander — SEO & Python enthusiast.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

inlink_extractor-0.1.0.tar.gz (3.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

inlink_extractor-0.1.0-py3-none-any.whl (3.6 kB view details)

Uploaded Python 3

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page