Skip to main content

Linkie looks through files for broken links using Python 3.

Project description

Linkie logo

Linkie

Linkie looks through files for broken links using Python 3.5+

Build Status

Usage

Linkie will search all files within the directory it’s run and any subdirectories, and requires a simple YAML config file to run. You can then run Linkie from the command line.

linkie

You can also pass Linkie a YAML file of configuration values (for example linkie linkie.yaml). The YAML file can contain the following optional settings:

  1. exclude-directories - Any directories listed will be ignored, these are relative to the directory Linkie is run from.

  2. file-types - The file extensions to search for URLs.

  3. skip-urls - URLs to skip checking.

Example configuration file (these are the default values Linkie uses):

exclude-directories:
  - .git/
  - docs/build/

file-types:
  - html
  - md
  - rst
  - txt

Linkie can also be used within Python:

import linkie
checker = linkie.Linkie()  # Creates a linkie object.
result = checker.run()     # Returns 1 if broken links found, otherwise 0.

You can pass a dictionary of settings directly using Python:

import linkie
settings = {"file-types": ["md", "rst"]}
checker = linkie.Linkie(config=settings)

You can also use a config file within Python:

import linkie
checker = linkie.Linkie(config_file_path='linkie.yaml')

You can also access the following attributes from the linkie after it’s run:

linkie.urls  # Dictionary of processed URLs and their data.
linkie.status_counts  # Dictionary of status codes and their counts.
linkie.file_count     # Number of files processed.

License

Linkie is licensed under the MIT License. Read the license file for more details.

Changelog

1.4.1

  • Add logic to delay Linkie requesting from a domain if it responds with code 429.

  • Reorganise logging output so that the URL is printed last.

  • Update dependencies.

1.4.0

  • Update to only check links prefixed by one of [=”, (, <, ‘ ‘(a space)].

  • Linkie now finds all unique links at once, then uses multithreading to check them all.

  • Linkie now rechecks links that had a ConnectionError, as these are often valid.

  • Broken links in the SUMMARY are now also displayed with their status code.

1.3.1

  • Update logging configuration.

  • Update dependencies.

1.3.0

  • Set User-Agent to emulate browser viewing.

  • Use Python logging module.

1.2.0

  • Allow passing of variable of config settings in Python.

1.1.1

  • Update method for URLs with brackets.

1.1.0

  • Allow adding URLs to skip to configuration file.

  • Skip checking URLs that have already been checked.

  • Show connection error names instead of 999 status.

  • Uses class based object allowing user to retrieve values after running.

1.0.0

  • Initial linkie release.

FAQ

Why was this created as a Python package?

We required a script to check our repositories for broken links. This tool was initially written in Python, and a published Python package makes it easy for repositories to use this tool, in combination with pyup notifying if the package is updated.

Can you update linkie to support this specific URL?

Probably not. This script was initally created as an internal tool so we are not actively developing and supporting it compared to our other repositories. However we have published it freely under the MIT License to allow you to copy and modify linkie as you wish.

Will you get around to writing proper documentation?

Maybe. This script was initally created as an internal tool so doesn’t have the same level of polish as other projects we create. If we have more time down the road, we may spend more time developing linkie.

How do I install the development version as local package?

  1. $ git clone https://github.com/uccser/linkie.git

  2. $ cd linkie

  3. $ pip3 install .

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

linkie-1.4.1.tar.gz (7.3 kB view details)

Uploaded Source

Built Distribution

linkie-1.4.1-py3-none-any.whl (8.3 kB view details)

Uploaded Python 3

File details

Details for the file linkie-1.4.1.tar.gz.

File metadata

  • Download URL: linkie-1.4.1.tar.gz
  • Upload date:
  • Size: 7.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/47.1.1 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.5.2

File hashes

Hashes for linkie-1.4.1.tar.gz
Algorithm Hash digest
SHA256 19c73ac52552a05c15805834a65e566976500fc2bb34c324c725f3ad0af0160b
MD5 facd11e2b5e6d1c60913ed223f65b31a
BLAKE2b-256 adf6250dccd4167ecd4c947db503c73d94adba8bd0f8c69bf73123a2952aa1ca

See more details on using hashes here.

File details

Details for the file linkie-1.4.1-py3-none-any.whl.

File metadata

  • Download URL: linkie-1.4.1-py3-none-any.whl
  • Upload date:
  • Size: 8.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/47.1.1 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.5.2

File hashes

Hashes for linkie-1.4.1-py3-none-any.whl
Algorithm Hash digest
SHA256 d567c575e6b3170131d9440dc323a0a8fed0b56fd1e971f2d658eedc2b476c66
MD5 bbd74fcd3e094f1f7aa839d737459774
BLAKE2b-256 2ea9acd4f91b3fc7ef49d06f3669e8633e5d19c14e2efa3db832fa62078eba2b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page