Skip to main content

security.txt parser and validator

Project description

SecTXT: security.txt parser and validator

This package contains a security.txt (RFC 9116) parser and validator.

When security risks in web services are discovered by independent security researchers who understand the severity of the risk, they often lack the channels to disclose them properly. As a result, security issues may be left unreported. security.txt defines a standard to help organizations define the process for security researchers to disclose security vulnerabilities securely.

Installation

The package is available on pypi. It can be installed using pip:

> python -m pip install sectxt

Usage

>>> from sectxt import SecurityTXT
>>> s = SecurityTXT("www.example.com")
>>> s.is_valid()
True

Validation

>>> from sectxt import SecurityTXT
>>> s = SecurityTXT("www.example.com")
>>> s.errors
[{'code': 'no_uri', 'message': 'Field policy value must be an URI', 'line': 2}, {'code': 'no_expire', 'message': 'The Expires field is missing', 'line': None}]
>>> s.recommendations
[{'code': 'long_expiry', 'message': 'Expiry date is more than one year in the future', 'line': 3}]

The "errors", "recommendations" and "notifications" attribute return a list of entries. An entry is a dict with three keys:

key value
code A fixed error code string
message A human readable error message in English
line The 1 based integer line number where the error occurred or None when the error applies to the entire file

Possible errors

code message
"no_security_txt" "security.txt could not be located."
"location" "security.txt was located on the top-level path (legacy place), but must be placed under the '/.well-known/' path."
"invalid_uri_scheme" "Insecure URI scheme HTTP is not allowed. The security.txt file access must use the HTTPS scheme"
"invalid_cert" "security.txt must be served with a valid TLS certificate."
"no_content_type" "HTTP Content-Type header must be sent."
"invalid_media" "Media type in Content-Type header must be 'text/plain'."
"invalid_charset" "Charset parameter in Content-Type header must be 'utf-8' if present."
"utf8" "Content must be utf-8 encoded."
"no_expire" "'Expires' field must be present."
"multi_expire" "'Expires' field must not appear more than once."
"invalid_expiry" "Date and time in 'Expires' field must be formatted according to ISO 8601."
"expired" "Date and time in 'Expires' field must not be in the past."
"no_contact" "'Contact' field must appear at least once."
"no_canonical_match" "Web URI where security.txt is located must match with a 'Canonical' field. In case of redirecting either the first or last web URI of the redirect chain must match."
"multi_lang" "'Preferred-Languages' field must not appear more than once."
"invalid_lang" "Value in 'Preferred-Languages' field must match one or more language tags as defined in RFC5646, separated by commas."
"no_uri" "Field '{field}' value must be a URI."
"no_https" "Web URI must begin with 'https://'."
"prec_ws" "There must be no whitespace before the field separator (colon)."
"no_space" "Field separator (colon) must be followed by a space."
"empty_key" "Field name must not be empty."
"empty_value" "Field value must not be empty."
"invalid_line" "Line must contain a field name and value, unless the line is blank or contains a comment."
"no_line_separators" "Every line, including the last one, must end with either a carriage return and line feed characters or just a line feed character"
"signed_format_issue" "Signed security.txt must start with the header '-----BEGIN PGP SIGNED MESSAGE-----'. "
"data_after_sig" "Signed security.txt must not contain data after the signature."
"no_csaf_file" "All CSAF fields must point to a provider-metadata.json file."
"pgp_data_error" "Signed message did not contain a correct ASCII-armored PGP block."
"pgp_error" "Decoding or parsing of the pgp message failed."
"bom_in_file" "The Byte-Order Mark was found at the start of the file. Security.txt must be encoded using UTF-8 in Net-Unicode form, the BOM signature must not appear at the beginning."

Possible recommendations

code message
"long_expiry" "Date and time in 'Expires' field should be less than a year into the future."
"no_encryption" "'Encryption' field should be present when 'Contact' field contains an email address."
"not_signed"[1] "security.txt should be digitally signed."
"no_canonical" "'Canonical' field should be present in a signed file."
"multiple_csaf_fields" "It is allowed to have more than one CSAF field, however this should be removed if possible."

Possible notifications

code message
"unknown_field"[2] "security.txt contains an unknown field. Field {unknown_field} is either a custom field which may not be widely supported, or there is a typo in a standardised field name.

Security.txt scraping information

The scraper attempts to find the security.txt of the given domain in the correct location /.well-known/security.txt. It also looks in the old location and with unsecure http scheme which would result in validation errors. To prevent possible errors getting the file from the domain a user-agent is added to the header of the request. The user agent that is added is Mozilla/5.0 (Windows NT 6.1; WOW64; rv:12.0) Gecko/20100101 Firefox/12.0, which would mock a browser in firefox with a Windows 7 OS. If a security.txt file is found that file is than parsed. Any errors, recommendations or notifications that are found would be returned.

Test security.txt files locally

It is possible to give a local path as the url parameter. For this behaviour you have to turn on the is_local parameter. Doing this will only validate the contents of the file given.

>>> from sectxt import SecurityTXT
>>> s = SecurityTXT("/home/example/security.txt", is_local=True)

[1] The security.txt parser will check for the addition of the digital signature, but it will not verify the validity of the signature.

[2] Regarding code "unknown_field": According to RFC 9116 section 2.4, any fields that are not explicitly supported must be ignored. This parser does add a notification for unknown fields by default. This behaviour can be turned off using the parameter recommend_unknown_fields:

>>> from sectxt import SecurityTXT
>>> s = SecurityTXT("www.example.com", recommend_unknown_fields=False)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sectxt-0.9.3.tar.gz (17.9 kB view hashes)

Uploaded Source

Built Distribution

sectxt-0.9.3-py3-none-any.whl (14.5 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page