license-expression is a comprehensive utility library to parse, compare, simplify and normalize license expressions (such as SPDX license expressions) using boolean logic.
Project description
license-expression is a comprehensive utility library to parse, compare, simplify and normalize license expressions (such as SPDX license expressions) using boolean logic.
License: Apache-2.0
Python: 3.7+
Install: pip install license-expression also available in most Linux distro.
Software project licenses are often a combination of several free and open source software licenses. License expressions – as specified by SPDX – provide a concise and human readable way to express these licenses without having to read long license texts, while still being machine-readable.
License expressions are used by key FOSS projects such as Linux; several packages ecosystem use them to document package licensing metadata such as npm and Rubygems; they are important when exchanging software data (such as with SPDX and SBOM in general) as a way to express licensing precisely.
license-expression is a comprehensive utility library to parse, compare, simplify and normalize these license expressions (such as SPDX license expressions) using boolean logic like in: GPL-2.0-or-later WITH Classpath-exception-2.0 AND MIT.
It includes the license keys from SPDX https://spdx.org/licenses/ (version 3.22) and ScanCode license DB (version 32.0.8, last published on 2023-11-16). See https://scancode-licensedb.aboutcode.org/ to get started quickly.
license-expression is both powerful and simple to use and is a used as the license expression engine in several projects and products such as:
AboutCode-toolkit https://github.com/nexB/aboutcode-toolkit
AlekSIS (School Information System) https://edugit.org/AlekSIS/official/AlekSIS-Core
Barista https://github.com/Optum/barista
Conda forge tools https://github.com/conda-forge/conda-smithy
DejaCode https://dejacode.com
DeltaCode https://github.com/nexB/deltacode
FenixscanX https://github.com/SmartsYoung/FenixscanX
FetchCode https://github.com/nexB/fetchcode
Flict https://github.com/vinland-technology/flict and https://github.com/vinland-technology
license.sh https://github.com/webscopeio/license.sh
liferay_inbound_checker https://github.com/carmenbianca/liferay_inbound_checker
REUSE https://reuse.software/ and https://github.com/fsfe/reuse-tool
ScanCode-io https://github.com/nexB/scancode.io
ScanCode-toolkit https://github.com/nexB/scancode-toolkit
See also for details: - https://spdx.github.io/spdx-spec/appendix-IV-SPDX-license-expressions/
license-expression is also packaged for most Linux distributions. See below.
Alternative:
There is no known alternative library for Python, but there are several similar libraries in other languages (but not as powerful of course!):
JavaScript https://github.com/jslicense/spdx-expression-parse.js
Haskell https://github.com/phadej/spdx
Java https://github.com/spdx/tools and https://github.com/aschet/spdx-license-expression-tools
Build and tests status
Linux & macOS (Travis) |
Windows (AppVeyor) |
Linux, Windows & macOS (Azure) |
---|---|---|
Source code and download
Also available in several Linux distros:
Arch Linux https://archlinux.org/packages/extra/any/python-license-expression/
Debian https://packages.debian.org/unstable/source/license-expression
DragonFly BSD https://github.com/DragonFlyBSD/DPorts/tree/master/textproc/py-license-expression
Fedora https://src.fedoraproject.org/rpms/python-license-expression/
FreeBSD https://www.freshports.org/textproc/py-license-expression
openSUSE https://build.opensuse.org/package/show/openSUSE:Factory/python-license-expression
Support
Submit bugs and questions at: https://github.com/nexB/license-expression/issues
Join the chat at: https://gitter.im/aboutcode-org/discuss
Description
This module defines a mini language to parse, validate, simplify, normalize and compare license expressions using a boolean logic engine.
This supports SPDX license expressions and also accepts other license naming conventions and license identifiers aliases to resolve and normalize any license expressions.
Using boolean logic, license expressions can be tested for equality, containment, equivalence and can be normalized or simplified.
It also bundles the SPDX License list (3.20 as of now) and the ScanCode license DB (based on latest ScanCode) to easily parse and validate expressions using the license symbols.
Usage examples
The main entry point is the Licensing object that you can use to parse, validate, compare, simplify and normalize license expressions.
Create an SPDX Licensing and parse expressions:
>>> from license_expression import get_spdx_licensing >>> licensing = get_spdx_licensing() >>> expression = ' GPL-2.0 or LGPL-2.1 and mit ' >>> parsed = licensing.parse(expression) >>> print(parsed.pretty()) OR( LicenseSymbol('GPL-2.0-only'), AND( LicenseSymbol('LGPL-2.1-only'), LicenseSymbol('MIT') ) ) >>> str(parsed) 'GPL-2.0-only OR (LGPL-2.1-only AND MIT)' >>> licensing.parse('unknwon with foo', validate=True, strict=True) license_expression.ExpressionParseError: A plain license symbol cannot be used as an exception in a "WITH symbol" statement. for token: "foo" at position: 13 >>> licensing.parse('unknwon with foo', validate=True) license_expression.ExpressionError: Unknown license key(s): unknwon, foo >>> licensing.validate('foo and MIT and GPL-2.0+') ExpressionInfo( original_expression='foo and MIT and GPL-2.0+', normalized_expression=None, errors=['Unknown license key(s): foo'], invalid_symbols=['foo'] )
Create a simple Licensing and parse expressions:
>>> from license_expression import Licensing, LicenseSymbol >>> licensing = Licensing() >>> expression = ' GPL-2.0 or LGPL-2.1 and mit ' >>> parsed = licensing.parse(expression) >>> expression = ' GPL-2.0 or LGPL-2.1 and mit ' >>> expected = 'GPL-2.0-only OR (LGPL-2.1-only AND mit)' >>> assert str(parsed) == expected >>> assert parsed.render('{symbol.key}') == expected
Create a Licensing with your own license symbols:
>>> expected = [ ... LicenseSymbol('GPL-2.0'), ... LicenseSymbol('LGPL-2.1'), ... LicenseSymbol('mit') ... ] >>> assert licensing.license_symbols(expression) == expected >>> assert licensing.license_symbols(parsed) == expected >>> symbols = ['GPL-2.0+', 'Classpath', 'BSD'] >>> licensing = Licensing(symbols) >>> expression = 'GPL-2.0+ with Classpath or (bsd)' >>> parsed = licensing.parse(expression) >>> expected = 'GPL-2.0+ WITH Classpath OR BSD' >>> assert parsed.render('{symbol.key}') == expected >>> expected = [ ... LicenseSymbol('GPL-2.0+'), ... LicenseSymbol('Classpath'), ... LicenseSymbol('BSD') ... ] >>> assert licensing.license_symbols(parsed) == expected >>> assert licensing.license_symbols(expression) == expected
And expression can be deduplicated, to remove duplicate license subexpressions without changing the order and without consider license choices as simplifiable:
>>> expression2 = ' GPL-2.0 or (mit and LGPL 2.1) or bsd Or GPL-2.0 or (mit and LGPL 2.1)' >>> parsed2 = licensing.parse(expression2) >>> str(parsed2) 'GPL-2.0 OR (mit AND LGPL 2.1) OR BSD OR GPL-2.0 OR (mit AND LGPL 2.1)' >>> assert str(parsed2.simplify()) == 'BSD OR GPL-2.0 OR (LGPL 2.1 AND mit)'
Expression can be simplified, treating them as boolean expressions:
>>> expression2 = ' GPL-2.0 or (mit and LGPL 2.1) or bsd Or GPL-2.0 or (mit and LGPL 2.1)' >>> parsed2 = licensing.parse(expression2) >>> str(parsed2) 'GPL-2.0 OR (mit AND LGPL 2.1) OR BSD OR GPL-2.0 OR (mit AND LGPL 2.1)' >>> assert str(parsed2.simplify()) == 'BSD OR GPL-2.0 OR (LGPL 2.1 AND mit)'
Two expressions can be compared for equivalence and containment:
>>> expr1 = licensing.parse(' GPL-2.0 or (LGPL 2.1 and mit) ') >>> expr2 = licensing.parse(' (mit and LGPL 2.1) or GPL-2.0 ') >>> licensing.is_equivalent(expr1, expr2) True >>> licensing.is_equivalent(' GPL-2.0 or (LGPL 2.1 and mit) ', ... ' (mit and LGPL 2.1) or GPL-2.0 ') True >>> expr1.simplify() == expr2.simplify() True >>> expr3 = licensing.parse(' GPL-2.0 or mit or LGPL 2.1') >>> licensing.is_equivalent(expr2, expr3) False >>> expr4 = licensing.parse('mit and LGPL 2.1') >>> expr4.simplify() in expr2.simplify() True >>> licensing.contains(expr2, expr4) True
Development
Checkout a clone from https://github.com/nexB/license-expression.git
Then run ./configure --dev and then source tmp/bin/activate on Linux and POSIX. This will install all dependencies in a local virtualenv, including development deps.
On Windows run configure.bat --dev and then Scripts\bin\activate instead.
To run the tests, run pytest -vvs
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file license-expression-30.2.0.tar.gz
.
File metadata
- Download URL: license-expression-30.2.0.tar.gz
- Upload date:
- Size: 174.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 599928edd995c43fc335e0af342076144dc71cb858afa1ed9c1c30c4e81794f5 |
|
MD5 | 10e449034652229f925bc778ea4518e9 |
|
BLAKE2b-256 | 8b5cdb493282aeb3f05e89b4db45898faddaa339740eaccb752942042410703f |
File details
Details for the file license_expression-30.2.0-py3-none-any.whl
.
File metadata
- Download URL: license_expression-30.2.0-py3-none-any.whl
- Upload date:
- Size: 107.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1a7dc2bb2d09cdc983d072e4f9adc787e107e09def84cbb3919baaaf4f8e6fa1 |
|
MD5 | a23dd2f53d43b27bb4f2520f79124d70 |
|
BLAKE2b-256 | f00386025ce25a5c888e9bb3ecd0bcf38a51c087a35652537047f678df7a2ba8 |