Links recognition library with FULL unicode support.

These details have been verified by PyPI

Project links

Homepage

GitHub Statistics

Maintainers

tsutsu3

These details have not been verified by PyPI

Project description

linkify-it-py

This is Python port of linkify-it.

Links recognition library with FULL unicode support. Focused on high quality link patterns detection in plain text.

Demo

Javascript Demo

Why it's awesome:

Full unicode support, with astral characters!
International domains support.
Allows rules extension & custom normalizers.

Python Version Support

Tested on Python 3.10–3.14. The primary version for code coverage follows the latest security phase release (currently 3.12).

Install

pip install linkify-it-py

conda install -c conda-forge linkify-it-py

Usage examples

Example 1. Simple use

from linkify_it import LinkifyIt


linkify = LinkifyIt()

print(linkify.test("Site github.com!"))
# => True

print(linkify.match("Site github.com!"))
# => [linkify_it.main.Match({
#         'schema': '',
#         'index': 5,
#         'last_index': 15,
#         'raw': 'github.com',
#         'text': 'github.com',
#         'url': 'http://github.com'
#     }]

Example 2. With options

from linkify_it import LinkifyIt
from linkify_it.tlds import TLDS


# Reload full tlds list & add unofficial `.onion` domain.
linkify = (
    LinkifyIt()
    .tlds(TLDS)               # Reload with full tlds list
    .tlds("onion", True)      # Add unofficial `.onion` domain
    .add("git:", "http:")     # Add `git:` protocol as "alias"
    .add("ftp:", None)        # Disable `ftp:` protocol
    .set({"fuzzy_ip": True})  # Enable IPs in fuzzy links (without schema)
)
print(linkify.test("Site tamanegi.onion!"))
# => True

print(linkify.match("Site tamanegi.onion!"))
# => [linkify_it.main.Match({
#         'schema': '',
#         'index': 5,
#         'last_index': 19,
#         'raw': 'tamanegi.onion',
#         'text': 'tamanegi.onion',
#         'url': 'http://tamanegi.onion'
#     }]

Example 3. Add twitter mentions handler

from linkify_it import LinkifyIt


linkify = LinkifyIt()

def validate(obj, text, pos):
    tail = text[pos:]

    if not obj.re.get("twitter"):
        obj.re["twitter"] = re.compile(
            "^([a-zA-Z0-9_]){1,15}(?!_)(?=$|" + obj.re["src_ZPCc"] + ")"
        )
    if obj.re["twitter"].search(tail):
        if pos > 2 and tail[pos - 2] == "@":
            return False
        return len(obj.re["twitter"].search(tail).group())
    return 0

def normalize(obj, match):
    match.url = "https://twitter.com/" + re.sub(r"^@", "", match.url)

linkify.add("@", {"validate": validate, "normalize": normalize})

API

API documentation

LinkifyIt(schemas, options)

Creates new linkifier instance with optional additional schemas.

By default understands:

http(s)://... , ftp://..., mailto:... & //... links
"fuzzy" links and emails (google.com, foo@bar.com).

schemas is an dict, where each key/value describes protocol/rule:

key - link prefix (usually, protocol name with : at the end, skype: for example). linkify-it-py makes sure that prefix is not preceded with alphanumeric char.
value - rule to check tail after link prefix
- str
  - just alias to existing rule
- dict
  - validate - either a re.Pattern (start with ^, and don't include the link prefix itself), or a validator function which, given arguments self, text and pos, returns the length of a match in text starting at index pos. pos is the index right after the link prefix. self can be used to access the linkify object to cache data.
  - normalize - optional function to normalize text & url of matched result (for example, for twitter mentions).

options:

fuzzy_link - recognize URL-s without http(s):// head. Default True.
fuzzy_ip - allow IPs in fuzzy links above. Can conflict with some texts like version numbers. Default False.
fuzzy_email - recognize emails without mailto: prefix. Default True.
--- - set True to terminate link with --- (if it's considered as long dash).

.test(text)

Searches linkifiable pattern and returns True on success or False on fail.

.pretest(text)

Quick check if link MAY BE can exist. Can be used to optimize more expensive .test() calls. Return False if link can not be found, True - if .test() call needed to know exactly.

.test_schema_at(text, name, position)

Similar to .test() but checks only specific protocol tail exactly at given position. Returns length of found pattern (0 on fail).

.match(text)

Returns list of found link matches or null if nothing found.

Each match has:

schema - link schema, can be empty for fuzzy links, or // for protocol-neutral links.
index - offset of matched text
last_index - index of next char after mathch end
raw - matched text
text - normalized text
url - link, generated from matched text

.matchAtStart(text)

Checks if a match exists at the start of the string. Returns Match (see docs for match(text)) or null if no URL is at the start. Doesn't work with fuzzy links.

.tlds(list_tlds, keep_old=False)

Load (or merge) new tlds list. Those are needed for fuzzy links (without schema) to avoid false positives. By default:

2-letter root zones are ok.
biz|com|edu|gov|net|org|pro|web|xxx|aero|asia|coop|info|museum|name|shop|рф are ok.
encoded (xn--...) root zones are ok.

If that's not enough, you can reload defaults with more detailed zones list.

.add(key, value)

Add a new schema to the schemas object. As described in the constructor definition, key is a link prefix (skype:, for example), and value is a str to alias to another schema, or an dict with validate and optionally normalize definitions. To disable an existing rule, use .add(key, None).

.set(options)

Override default options. Missed properties will not be changed.

License

MIT

Project details

These details have been verified by PyPI

Project links

Homepage

GitHub Statistics

Maintainers

tsutsu3

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

2.1.0

Mar 1, 2026

2.0.3

Feb 4, 2024

2.0.2

May 2, 2023

2.0.1

May 1, 2023

2.0.0

May 7, 2022

1.0.3

Dec 18, 2021

1.0.2

Oct 9, 2021

1.0.1

Dec 18, 2020

1.0.0

Nov 14, 2020

0.0.1

Nov 11, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

linkify_it_py-2.1.0.tar.gz (29.2 kB view details)

Uploaded Mar 1, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

linkify_it_py-2.1.0-py3-none-any.whl (19.9 kB view details)

Uploaded Mar 1, 2026 Python 3

File details

Details for the file linkify_it_py-2.1.0.tar.gz.

File metadata

Download URL: linkify_it_py-2.1.0.tar.gz
Upload date: Mar 1, 2026
Size: 29.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for linkify_it_py-2.1.0.tar.gz
Algorithm	Hash digest
SHA256	`43360231720999c10e9328dc3691160e27a718e280673d444c38d7d3aaa3b98b`
MD5	`887a913bb65438a83acca4a543616993`
BLAKE2b-256	`2ec906ea13676ef354f0af6169587ae292d3e2406e212876a413bf9eece4eb23`

See more details on using hashes here.

Provenance

The following attestation bundles were made for linkify_it_py-2.1.0.tar.gz:

Publisher: github-ci.yml on tsutsu3/linkify-it-py

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: linkify_it_py-2.1.0.tar.gz
- Subject digest: 43360231720999c10e9328dc3691160e27a718e280673d444c38d7d3aaa3b98b
- Sigstore transparency entry: 1006367962
- Sigstore integration time: Mar 1, 2026
Source repository:
- Permalink: tsutsu3/linkify-it-py@b9ee3494c2eb2d1fa4982dcbf00777b6bbc8e380
- Branch / Tag: refs/tags/v2.1.0
- Owner: https://github.com/tsutsu3
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: github-ci.yml@b9ee3494c2eb2d1fa4982dcbf00777b6bbc8e380
- Trigger Event: push

File details

Details for the file linkify_it_py-2.1.0-py3-none-any.whl.

File metadata

Download URL: linkify_it_py-2.1.0-py3-none-any.whl
Upload date: Mar 1, 2026
Size: 19.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for linkify_it_py-2.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0d252c1594ecba2ecedc444053db5d3a9b7ec1b0dd929c8f1d74dce89f86c05e`
MD5	`46f12b96f293ac32663931f6a8d235cc`
BLAKE2b-256	`b4de88b3be5c31b22333b3ca2f6ff1de4e863d8fe45aaea7485f591970ec1d3e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for linkify_it_py-2.1.0-py3-none-any.whl:

Publisher: github-ci.yml on tsutsu3/linkify-it-py

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: linkify_it_py-2.1.0-py3-none-any.whl
- Subject digest: 0d252c1594ecba2ecedc444053db5d3a9b7ec1b0dd929c8f1d74dce89f86c05e
- Sigstore transparency entry: 1006367972
- Sigstore integration time: Mar 1, 2026
Source repository:
- Permalink: tsutsu3/linkify-it-py@b9ee3494c2eb2d1fa4982dcbf00777b6bbc8e380
- Branch / Tag: refs/tags/v2.1.0
- Owner: https://github.com/tsutsu3
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: github-ci.yml@b9ee3494c2eb2d1fa4982dcbf00777b6bbc8e380
- Trigger Event: push

linkify-it-py 2.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

linkify-it-py

Python Version Support

Install

Usage examples

Example 1. Simple use

Example 2. With options

Example 3. Add twitter mentions handler

API

LinkifyIt(schemas, options)

.test(text)

.pretest(text)

.test_schema_at(text, name, position)

.match(text)

.matchAtStart(text)

.tlds(list_tlds, keep_old=False)

.add(key, value)

.set(options)

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance