Skip to main content

Deadlink Check Utility using Python Modules

Project description

Description

<Work In progress>

This is a tool that can be used for Analyzing and identifying the Deadlink in a website. This can come handy during the regular website maintenance.

The contents uploaded here is only the Alpha content. I’ve uploaded it here to get some feedback and suggestion as to how I can make this more efficient.

Any suggestion / comment / feedback / request is welcome. Please drop any of these messages using the following link.

https://www.assembla.com/spaces/deadcheck/tickets

Installation

pip install lxml
pip install deadcheck

or

pip install lxml
git clone git@github.com:harshanarayana/deadcheck.git
python setup.py install

Usage

prompt>python run.py -url <baseURLToAnalyze> [-proxy <proxyURL>:<prompt> -username <userName for Protected Page>
-password <Password to access Protected Page> -auth_base <Super URL for Authentication> -log <logFile>
-exempt <ExceptionURL File> -depth <int, default = 1> -v <default = True> -out <OutputDir>]

Pending Implementation

  • Report Generation. ( A custom Package / Using liches ) : DONE
  • Handling the Processing of Links using Multiple Threads. ( To reduce processing time )
  • Regex support to Exception links.
  • Additional Log / Debug Option Support.
  • GUI

Change Log

v0.0.1:

  • Initial Draft
  • Analysis method and URLLinks class were combined together.
  • Report Information Extraction was manual.

v0.0.2:

  • Second Draft
  • Code Re-structured. Independent classes created for URLLinks, handling Custom Error and Other necessary informaiton.
  • URLLinks
    • Custom Class for Storing URL information.
    • info() method for accessing the URLLink information in the form of a string to display / Report.
    • Additional parameters included for Reporting.
      • File Size
      • Dowload Time
      • Check Time
      • Last Modified.
      • Status Information ( With Error and / or other information as applicable )
  • ErrorCodes
    • Custom class created for Storing and retrieving HTTPError codes with suitable messages.
  • ErrorHandler
    • Custom Error handler class for Handling Custom errors. Additional items will be added to this during the course of development.
  • deadcheck
    • Main Package now containins a class Deadcheck which handles all the operations.
    • User the Deadcheck class to perform any operations.

v0.0.3:

  • Third Draft
  • Report Generation Modules Included.
  • Class documentations updated. (2013-12-17)

v0.0.3.1:

  • Low-level URL extraction from ‘javascript’ type hrefs.
  • Fix for UnicodeEncodeError.
  • Minor Cleanup / change for the reports.

Project details


Release history Release notifications

History Node

0.0.3.3.1

History Node

0.0.3.2

This version
History Node

0.0.3.1

History Node

0.0.3

History Node

0.0.2

History Node

0.0.1

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
deadcheck-0.0.3.1.win-amd64.exe (245.7 kB) Copy SHA256 hash SHA256 Windows Installer any Mar 27, 2014
deadcheck-0.0.3.1.zip (23.3 kB) Copy SHA256 hash SHA256 Source None Mar 27, 2014

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging CloudAMQP CloudAMQP RabbitMQ AWS AWS Cloud computing Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page