Skip to main content

Deadlink Check Utility using Python Modules

Project description

Description

<Work In progress>

This is a tool that can be used for Analyzing and identifying the Deadlink in a website. This can come handy during the regular website maintenance.

The contents uploaded here is only the Alpha content. I’ve uploaded it here to get some feedback and suggestion as to how I can make this more efficient.

Any suggestion / comment / feedback / request is welcome. Please drop any of these messages using the following link.

https://www.assembla.com/spaces/deadcheck/tickets

Installation

pip install lxml
pip install deadcheck

or

pip install lxml
git clone git@github.com:harshanarayana/deadcheck.git
python setup.py install

Usage

prompt>python run.py -url <baseURLToAnalyze> [-proxy <proxyURL>:<prompt> -username <userName for Protected Page>
-password <Password to access Protected Page> -auth_base <Super URL for Authentication> -log <logFile>
-exempt <ExceptionURL File> -depth <int, default = 1> -v <default = True> -out <OutputDir>]

Using DeadcheckAPI

from deadcheck.deadcheck import DeadcheckAPI
checker = DeadcheckAPI([proxyURL, userName, password, auth_base_url])
urlObj = checker.amIDead(urlToCheck)
print urlObj.info()
children = urlObj.getChildren()

Pending Implementation

  • Report Generation. ( A custom Package / Using liches ) : DONE

  • Handling the Processing of Links using Multiple Threads. ( To reduce processing time )

  • Regex support to Exception links. DONE

  • Additional Log / Debug Option Support. DONE

  • GUI (Work In Progress)

  • Custom API to facilitate the validation of Minor Links or Single Links. DONE

Change Log

v0.0.1:

  • Initial Draft

  • Analysis method and URLLinks class were combined together.

  • Report Information Extraction was manual.

v0.0.2:

  • Second Draft

  • Code Re-structured. Independent classes created for URLLinks, handling Custom Error and Other necessary informaiton.

  • URLLinks

    • Custom Class for Storing URL information.

    • info() method for accessing the URLLink information in the form of a string to display / Report.

    • Additional parameters included for Reporting.

      • File Size

      • Dowload Time

      • Check Time

      • Last Modified.

      • Status Information ( With Error and / or other information as applicable )

  • ErrorCodes

    • Custom class created for Storing and retrieving HTTPError codes with suitable messages.

  • ErrorHandler

    • Custom Error handler class for Handling Custom errors. Additional items will be added to this during the course of development.

  • deadcheck

    • Main Package now containins a class Deadcheck which handles all the operations.

    • User the Deadcheck class to perform any operations.

v0.0.3:

  • Third Draft

  • Report Generation Modules Included.

  • Class documentations updated. (2013-12-17)

v0.0.3.1:

  • Low-level URL extraction from ‘javascript’ type hrefs.

  • Fix for UnicodeEncodeError.

  • Minor Cleanup / change for the reports.

v0.0.3.2:

  • Fix for AttributeError in __cleanupJavaScript Function ( Bug #3 )

  • Minor Modification for RegExp used in Exceptions file.

v0.0.3.3:

  • Addition of new Class ‘DeadcheckAPI’ that provides the user with a small scare function that can be used to analyze a single URL in the CLI

  • The Return value of this function call will be URLLinks Object and hence you can iterate through the Child Links and call the function amIDead manually to analyze ‘n’ number of links.

  • Custom Test Suite has been included to support the running of tests during Installation process.

v0.0.3.3.1:

  • Minor changes made into run.py

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deadcheck-0.0.3.3.1.zip (26.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

deadcheck-0.0.3.3.1.win-amd64.exe (248.2 kB view details)

Uploaded Source

File details

Details for the file deadcheck-0.0.3.3.1.zip.

File metadata

  • Download URL: deadcheck-0.0.3.3.1.zip
  • Upload date:
  • Size: 26.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for deadcheck-0.0.3.3.1.zip
Algorithm Hash digest
SHA256 e80a64d8daa956d9aaf730705e3ea874ea4911f899c83dbc6b9f0540ff89a004
MD5 7d00113c157226b71dc59349bf42ce32
BLAKE2b-256 7238722ac1668e5ba146552f573b0823113c45dfc12ef8871d613c43aab0991a

See more details on using hashes here.

File details

Details for the file deadcheck-0.0.3.3.1.win-amd64.exe.

File metadata

File hashes

Hashes for deadcheck-0.0.3.3.1.win-amd64.exe
Algorithm Hash digest
SHA256 079386c2d5f9a5442e3219fbd17351af92a755fbfbc9ef85156bc6813e6ca905
MD5 e40c937ba63d71298e7009e11abdacc5
BLAKE2b-256 cb50c303f312ce2690d108f6997ce8d6375e2e18cc2a5a6542fde528d632947b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page