Linkie looks through files for broken links using Python 3.
Project description
Linkie
Linkie looks through files for broken links using Python 3.5+
Usage
Linkie will search all files within the directory it’s run and any subdirectories, and requires a simple YAML config file to run. You can then run Linkie from the command line.
linkie
You can also pass Linkie a YAML file of configuration values (for example linkie linkie.yaml). The YAML file can contain the following optional settings:
exclude-directories - Any directories listed will be ignored, these are relative to the directory Linkie is run from.
file-types - The file extensions to search for URLs.
skip-urls - URLs to skip checking.
Example configuration file (these are the default values Linkie uses):
exclude-directories:
- .git/
- docs/build/
file-types:
- html
- md
- rst
- txt
Linkie can also be used within Python:
import linkie
checker = linkie.Linkie() # Creates a linkie object.
result = checker.run() # Returns 1 if broken links found, otherwise 0.
You can pass a dictionary of settings directly using Python:
import linkie
settings = {"file-types": ["md", "rst"]}
checker = linkie.Linkie(config=settings)
You can also use a config file within Python:
import linkie
checker = linkie.Linkie(config_file_path='linkie.yaml')
You can also access the following attributes from the linkie after it’s run:
linkie.urls # Dictionary of processed URLs and their data.
linkie.status_counts # Dictionary of status codes and their counts.
linkie.file_count # Number of files processed.
License
Linkie is licensed under the MIT License. Read the license file for more details.
Changelog
1.4.1
Add logic to delay Linkie requesting from a domain if it responds with code 429.
Reorganise logging output so that the URL is printed last.
Update dependencies.
1.4.0
Update to only check links prefixed by one of [=”, (, <, ‘ ‘(a space)].
Linkie now finds all unique links at once, then uses multithreading to check them all.
Linkie now rechecks links that had a ConnectionError, as these are often valid.
Broken links in the SUMMARY are now also displayed with their status code.
1.3.1
Update logging configuration.
Update dependencies.
1.3.0
Set User-Agent to emulate browser viewing.
Use Python logging module.
1.2.0
Allow passing of variable of config settings in Python.
1.1.1
Update method for URLs with brackets.
1.1.0
Allow adding URLs to skip to configuration file.
Skip checking URLs that have already been checked.
Show connection error names instead of 999 status.
Uses class based object allowing user to retrieve values after running.
1.0.0
Initial linkie release.
FAQ
Why was this created as a Python package?
We required a script to check our repositories for broken links. This tool was initially written in Python, and a published Python package makes it easy for repositories to use this tool, in combination with pyup notifying if the package is updated.
Can you update linkie to support this specific URL?
Probably not. This script was initally created as an internal tool so we are not actively developing and supporting it compared to our other repositories. However we have published it freely under the MIT License to allow you to copy and modify linkie as you wish.
Will you get around to writing proper documentation?
Maybe. This script was initally created as an internal tool so doesn’t have the same level of polish as other projects we create. If we have more time down the road, we may spend more time developing linkie.
How do I install the development version as local package?
$ git clone https://github.com/uccser/linkie.git
$ cd linkie
$ pip3 install .
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file linkie-1.4.1.tar.gz
.
File metadata
- Download URL: linkie-1.4.1.tar.gz
- Upload date:
- Size: 7.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/47.1.1 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.5.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 19c73ac52552a05c15805834a65e566976500fc2bb34c324c725f3ad0af0160b |
|
MD5 | facd11e2b5e6d1c60913ed223f65b31a |
|
BLAKE2b-256 | adf6250dccd4167ecd4c947db503c73d94adba8bd0f8c69bf73123a2952aa1ca |
File details
Details for the file linkie-1.4.1-py3-none-any.whl
.
File metadata
- Download URL: linkie-1.4.1-py3-none-any.whl
- Upload date:
- Size: 8.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/47.1.1 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.5.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d567c575e6b3170131d9440dc323a0a8fed0b56fd1e971f2d658eedc2b476c66 |
|
MD5 | bbd74fcd3e094f1f7aa839d737459774 |
|
BLAKE2b-256 | 2ea9acd4f91b3fc7ef49d06f3669e8633e5d19c14e2efa3db832fa62078eba2b |