Skip to main content

Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.

Project description

semantic-text-splitter

Documentation Status Licence

Large language models (LLMs) can be used for many tasks, but often have a limited context size that can be smaller than documents you might want to use. To use documents of larger length, you often have to split your text into chunks to fit within this context size.

This crate provides methods for splitting longer pieces of text into smaller chunks, aiming to maximize a desired chunk size, but still splitting at semantically sensible boundaries whenever possible.

Get Started

By Number of Characters

from semantic_text_splitter import TextSplitter

# Maximum number of characters in a chunk
max_characters = 1000
# Optionally can also have the splitter not trim whitespace for you
splitter = TextSplitter(max_characters)
# splitter = TextSplitter(max_characters, trim=False)

chunks = splitter.chunks("your document text")

Using a Range for Chunk Capacity

You also have the option of specifying your chunk capacity as a range.

Once a chunk has reached a length that falls within the range it will be returned.

It is always possible that a chunk may be returned that is less than the start value, as adding the next piece of text may have made it larger than the end capacity.

from semantic_text_splitter import TextSplitter


# Maximum number of characters in a chunk. Will fill up the
# chunk until it is somewhere in this range.
splitter = TextSplitter((200,1000))

chunks = splitter.chunks("your document text")

Using a Hugging Face Tokenizer

from semantic_text_splitter import TextSplitter
from tokenizers import Tokenizer

# Maximum number of tokens in a chunk
max_tokens = 1000
tokenizer = Tokenizer.from_pretrained("bert-base-uncased")
# If your tokenizer has truncation enabled, disable it before passing it to
# the splitter. Otherwise chunk sizes can be capped by the tokenizer's
# truncation limit.
tokenizer.no_truncation()
splitter = TextSplitter.from_huggingface_tokenizer(tokenizer, max_tokens)

chunks = splitter.chunks("your document text")

Using a Tiktoken Tokenizer

from semantic_text_splitter import TextSplitter

# Maximum number of tokens in a chunk
max_tokens = 1000
splitter = TextSplitter.from_tiktoken_model("gpt-3.5-turbo", max_tokens)

chunks = splitter.chunks("your document text")

Using a Custom Callback

from semantic_text_splitter import TextSplitter

splitter = TextSplitter.from_callback(lambda text: len(text), 1000)

chunks = splitter.chunks("your document text")

Markdown

All of the above examples also can also work with Markdown text. You can use the MarkdownSplitter in the same ways as the TextSplitter.

from semantic_text_splitter import MarkdownSplitter

# Maximum number of characters in a chunk
max_characters = 1000
# Optionally can also have the splitter not trim whitespace for you
splitter = MarkdownSplitter(max_characters)
# splitter = MarkdownSplitter(max_characters, trim=False)

chunks = splitter.chunks("# Header\n\nyour document text")

Method

To preserve as much semantic meaning within a chunk as possible, each chunk is composed of the largest semantic units that can fit in the next given chunk. For each splitter type, there is a defined set of semantic levels. Here is an example of the steps used:

  1. Split the text by a increasing semantic levels.
  2. Check the first item for each level and select the highest level whose first item still fits within the chunk size.
  3. Merge as many of these neighboring sections of this level or above into a chunk to maximize chunk length. Boundaries of higher semantic levels are always included when merging, so that the chunk doesn't inadvertantly cross semantic boundaries.

The boundaries used to split the text if using the chunks method, in ascending order:

TextSplitter Semantic Levels

  1. Characters
  2. Unicode Grapheme Cluster Boundaries
  3. Unicode Word Boundaries
  4. Unicode Sentence Boundaries
  5. Ascending sequence length of newlines. (Newline is \r\n, \n, or \r) Each unique length of consecutive newline sequences is treated as its own semantic level. So a sequence of 2 newlines is a higher level than a sequence of 1 newline, and so on.

Splitting doesn't occur below the character level, otherwise you could get partial bytes of a char, which may not be a valid unicode str.

MarkdownSplitter Semantic Levels

Markdown is parsed according to the CommonMark spec, along with some optional features such as GitHub Flavored Markdown.

  1. Characters
  2. Unicode Grapheme Cluster Boundaries
  3. Unicode Word Boundaries
  4. Unicode Sentence Boundaries
  5. Soft line breaks (single newline) which isn't necessarily a new element in Markdown.
  6. Inline elements such as: text nodes, emphasis, strong, strikethrough, link, image, table cells, inline code, footnote references, task list markers, and inline html.
  7. Block elements suce as: paragraphs, code blocks, footnote definitions, metadata. Also, a block quote or row/item within a table or list that can contain other "block" type elements, and a list or table that contains items.
  8. Thematic breaks or horizontal rules.
  9. Headings by level

Splitting doesn't occur below the character level, otherwise you could get partial bytes of a char, which may not be a valid unicode str.

Note on sentences

There are lots of methods of determining sentence breaks, all to varying degrees of accuracy, and many requiring ML models to do so. Rather than trying to find the perfect sentence breaks, we rely on unicode method of sentence boundaries, which in most cases is good enough for finding a decent semantic breaking point if a paragraph is too large, and avoids the performance penalties of many other methods.

Inspiration

This crate was inspired by LangChain's TextSplitter. But, looking into the implementation, there was potential for better performance as well as better semantic chunking.

A big thank you to the Unicode team for their icu_segmenter crate that manages a lot of the complexity of matching the Unicode rules for words and sentences.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

semantic_text_splitter-0.32.0.tar.gz (292.5 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_x86_64.whl (8.5 MB view details)

Uploaded CPython 3.15CPython 3.15+manylinux: glibc 2.28+ x86-64

semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_s390x.whl (8.7 MB view details)

Uploaded CPython 3.15CPython 3.15+manylinux: glibc 2.28+ s390x

semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_ppc64le.whl (8.8 MB view details)

Uploaded CPython 3.15CPython 3.15+manylinux: glibc 2.28+ ppc64le

semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_armv7l.whl (8.4 MB view details)

Uploaded CPython 3.15CPython 3.15+manylinux: glibc 2.28+ ARMv7l

semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_aarch64.whl (8.5 MB view details)

Uploaded CPython 3.15CPython 3.15+manylinux: glibc 2.28+ ARM64

semantic_text_splitter-0.32.0-cp315-abi3.abi3t-macosx_11_0_arm64.whl (8.3 MB view details)

Uploaded CPython 3.15CPython 3.15+macOS 11.0+ ARM64

semantic_text_splitter-0.32.0-cp315-abi3.abi3t-macosx_10_12_x86_64.whl (8.3 MB view details)

Uploaded CPython 3.15CPython 3.15+macOS 10.12+ x86-64

semantic_text_splitter-0.32.0-cp314-cp314t-win_amd64.whl (8.0 MB view details)

Uploaded CPython 3.14tWindows x86-64

semantic_text_splitter-0.32.0-cp314-cp314t-win32.whl (7.8 MB view details)

Uploaded CPython 3.14tWindows x86

semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_x86_64.whl (8.5 MB view details)

Uploaded CPython 3.14tmanylinux: glibc 2.28+ x86-64

semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_s390x.whl (8.7 MB view details)

Uploaded CPython 3.14tmanylinux: glibc 2.28+ s390x

semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_ppc64le.whl (8.8 MB view details)

Uploaded CPython 3.14tmanylinux: glibc 2.28+ ppc64le

semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_armv7l.whl (8.4 MB view details)

Uploaded CPython 3.14tmanylinux: glibc 2.28+ ARMv7l

semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_aarch64.whl (8.5 MB view details)

Uploaded CPython 3.14tmanylinux: glibc 2.28+ ARM64

semantic_text_splitter-0.32.0-cp314-cp314t-macosx_11_0_arm64.whl (8.3 MB view details)

Uploaded CPython 3.14tmacOS 11.0+ ARM64

semantic_text_splitter-0.32.0-cp314-cp314t-macosx_10_12_x86_64.whl (8.3 MB view details)

Uploaded CPython 3.14tmacOS 10.12+ x86-64

semantic_text_splitter-0.32.0-cp314-cp314-win_amd64.whl (8.0 MB view details)

Uploaded CPython 3.14Windows x86-64

semantic_text_splitter-0.32.0-cp314-cp314-win32.whl (7.8 MB view details)

Uploaded CPython 3.14Windows x86

semantic_text_splitter-0.32.0-cp310-abi3-win_amd64.whl (8.1 MB view details)

Uploaded CPython 3.10+Windows x86-64

semantic_text_splitter-0.32.0-cp310-abi3-win32.whl (7.8 MB view details)

Uploaded CPython 3.10+Windows x86

semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_x86_64.whl (8.5 MB view details)

Uploaded CPython 3.10+manylinux: glibc 2.28+ x86-64

semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_s390x.whl (8.7 MB view details)

Uploaded CPython 3.10+manylinux: glibc 2.28+ s390x

semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_ppc64le.whl (8.9 MB view details)

Uploaded CPython 3.10+manylinux: glibc 2.28+ ppc64le

semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_armv7l.whl (8.4 MB view details)

Uploaded CPython 3.10+manylinux: glibc 2.28+ ARMv7l

semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_aarch64.whl (8.5 MB view details)

Uploaded CPython 3.10+manylinux: glibc 2.28+ ARM64

semantic_text_splitter-0.32.0-cp310-abi3-macosx_11_0_arm64.whl (8.3 MB view details)

Uploaded CPython 3.10+macOS 11.0+ ARM64

semantic_text_splitter-0.32.0-cp310-abi3-macosx_10_12_x86_64.whl (8.3 MB view details)

Uploaded CPython 3.10+macOS 10.12+ x86-64

File details

Details for the file semantic_text_splitter-0.32.0.tar.gz.

File metadata

  • Download URL: semantic_text_splitter-0.32.0.tar.gz
  • Upload date:
  • Size: 292.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0.tar.gz
Algorithm Hash digest
SHA256 f5ac695ec37c9df6e0aec991e6a4bb50ea013c293df4bb2873bd804be9adccd5
MD5 549f56b0897d2bf25e8a43c0aae04d20
BLAKE2b-256 e39b0ab37abd69e7aba1906517b42b66b27b9adbada21258c6b14d5c5218e243

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_x86_64.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_x86_64.whl
  • Upload date:
  • Size: 8.5 MB
  • Tags: CPython 3.15, CPython 3.15+, manylinux: glibc 2.28+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 79ea6a590cd5b3b925f44011a9c86433b56990865e30895a1ee106c4f964b5fb
MD5 8d421778f1a88a733f0b9cff12841870
BLAKE2b-256 d51e3120e5047bd12e8bf62ad39978dad67caec92082901b6fb26cfef80a0c3b

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_s390x.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_s390x.whl
  • Upload date:
  • Size: 8.7 MB
  • Tags: CPython 3.15, CPython 3.15+, manylinux: glibc 2.28+ s390x
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_s390x.whl
Algorithm Hash digest
SHA256 70ee827f3e39f7f403b0b6bbb3586d134888e7ef62e0b791df0a349ca572781d
MD5 07895c1d47b3260b34879cb0254e5aeb
BLAKE2b-256 4b6f9972d79631c3c6a0eb39f3dfd2b200b7acd7016e37758184cc60769579c6

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_ppc64le.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_ppc64le.whl
  • Upload date:
  • Size: 8.8 MB
  • Tags: CPython 3.15, CPython 3.15+, manylinux: glibc 2.28+ ppc64le
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_ppc64le.whl
Algorithm Hash digest
SHA256 b370b457844a41afb58e181800ad5755b47b7f8a0c334b1a146255b869a962ec
MD5 190e174a6d893815404f53216ca39490
BLAKE2b-256 d33059a6111e897e159bcb06294f5e82091ba8b719c665bd900d544e8b96a124

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_armv7l.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_armv7l.whl
  • Upload date:
  • Size: 8.4 MB
  • Tags: CPython 3.15, CPython 3.15+, manylinux: glibc 2.28+ ARMv7l
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_armv7l.whl
Algorithm Hash digest
SHA256 d57c14b266e73d25311290fd96db2cb040b48a1a3eaf3871719bce854d1c9fa3
MD5 197666d0d48de1b7cdac9793ae8a8954
BLAKE2b-256 bb46bdfa95a5c97ed62375037c8ab8f991cda7b57728620b0caa3d8de94e3afc

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_aarch64.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_aarch64.whl
  • Upload date:
  • Size: 8.5 MB
  • Tags: CPython 3.15, CPython 3.15+, manylinux: glibc 2.28+ ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp315-abi3.abi3t-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 d954266f18c384231c8335a1f035cb069746b6f65019b1f32d040917f89598c2
MD5 a8c2a7e5ccb7fb40a3f1b48a33741d1c
BLAKE2b-256 2f773a0167376541ced609b35fb93c01bb903e65578bda2fa1c6390fd537e449

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp315-abi3.abi3t-macosx_11_0_arm64.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp315-abi3.abi3t-macosx_11_0_arm64.whl
  • Upload date:
  • Size: 8.3 MB
  • Tags: CPython 3.15, CPython 3.15+, macOS 11.0+ ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp315-abi3.abi3t-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 0118b6a4ade6b66d030c8b196e35772d3f48585f70fe7c5912dbbb70458d9147
MD5 b32544c9b0aaa5ae4733a77dd0676448
BLAKE2b-256 779df85e80c0c9c84e39c7f1724e85e18da5cdcb311b28e4aa8380abeebc7eb3

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp315-abi3.abi3t-macosx_10_12_x86_64.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp315-abi3.abi3t-macosx_10_12_x86_64.whl
  • Upload date:
  • Size: 8.3 MB
  • Tags: CPython 3.15, CPython 3.15+, macOS 10.12+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp315-abi3.abi3t-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 d1cf3dbbd2b98eff8099efbdbb723efe06a9572c76ded06a46bef7612e7e9428
MD5 5435c4791ca19c2aa71b795f60c6e349
BLAKE2b-256 9c3714bafdc91e6f2fefaf28681f9440f6869aa2a6f9dde17f77d40d9882a5a8

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp314-cp314t-win_amd64.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp314-cp314t-win_amd64.whl
  • Upload date:
  • Size: 8.0 MB
  • Tags: CPython 3.14t, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp314-cp314t-win_amd64.whl
Algorithm Hash digest
SHA256 a88266d919a022de8c5bdc1a7d8fdf26bee141c44f0854c32eb926e9c59cca71
MD5 6d509edd11f784b629574474ec4392b4
BLAKE2b-256 e60094d0a6db7e1969e627e7d56708c29514a0bcbe04697b21d1d86a76b2ad28

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp314-cp314t-win32.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp314-cp314t-win32.whl
  • Upload date:
  • Size: 7.8 MB
  • Tags: CPython 3.14t, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp314-cp314t-win32.whl
Algorithm Hash digest
SHA256 b70b934aa8320fb3a4914155fe635923cc08409c3aecf1e7763a55159133d782
MD5 2010d85a89b31a036519c6fd7a5dacf8
BLAKE2b-256 297e3dac7f15adec86ae0164de93d5aee906b99a79fcd4f0d57bdbc4c77d4ce2

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_x86_64.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_x86_64.whl
  • Upload date:
  • Size: 8.5 MB
  • Tags: CPython 3.14t, manylinux: glibc 2.28+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 2f04432cce363bf3b9a79211667d1e1c894a523e96050753fa63e90a862f28c5
MD5 84e0fac95f2edc3d107a09dde9853ac7
BLAKE2b-256 5eebac671d2b6d151d7be19dd599cb2f85ce9ff1ad6a44aa48fffcaa2e7b5375

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_s390x.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_s390x.whl
  • Upload date:
  • Size: 8.7 MB
  • Tags: CPython 3.14t, manylinux: glibc 2.28+ s390x
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_s390x.whl
Algorithm Hash digest
SHA256 1384820563f0e01544010a6524a45e01c7e4ae2231defbf652232dfdc675a7b0
MD5 8dc1c55c313673c8b21cc18393b5a9f2
BLAKE2b-256 2d800d856cc5c5c34dcb9283b5c2e283f07473b6501e4f8c1be9aaf51bc0122e

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_ppc64le.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_ppc64le.whl
  • Upload date:
  • Size: 8.8 MB
  • Tags: CPython 3.14t, manylinux: glibc 2.28+ ppc64le
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_ppc64le.whl
Algorithm Hash digest
SHA256 e45f45939bce0dd1821714fc6a6e0d1951dc5c83e45fab1de80855e2912ff4a2
MD5 40b915bfde9ee79d7ec6624db1c5f75f
BLAKE2b-256 5177f79aa02be47746d5f3e11bff72617e740dbc7e913758a166f73d5763cd3b

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_armv7l.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_armv7l.whl
  • Upload date:
  • Size: 8.4 MB
  • Tags: CPython 3.14t, manylinux: glibc 2.28+ ARMv7l
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_armv7l.whl
Algorithm Hash digest
SHA256 21c1d80fd69918075c937f8b86321aa203febf8ae5f4cc0c575b5d5ccf97d8d3
MD5 8f53139549aeb539978d14b3880a2102
BLAKE2b-256 00ec85ffef36fc85fd8f294f0e8186e38dbabb3e7cbb3b1de708f5894233fbd4

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_aarch64.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_aarch64.whl
  • Upload date:
  • Size: 8.5 MB
  • Tags: CPython 3.14t, manylinux: glibc 2.28+ ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp314-cp314t-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 5184635b4b160e9af1d5a3681d162e7e3b6f69eaca208e3c646ff226ef08dc82
MD5 97094021627c0e9553dc755bfda585bf
BLAKE2b-256 7f1b5087485c303bacdde975ea4a0104887c18a5daef18c028b9ae64218805cc

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp314-cp314t-macosx_11_0_arm64.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp314-cp314t-macosx_11_0_arm64.whl
  • Upload date:
  • Size: 8.3 MB
  • Tags: CPython 3.14t, macOS 11.0+ ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp314-cp314t-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 f1fe25a48e2d88770c5b8a4e39aae0c8005ac3c0f9f57073175f626b3677f7d0
MD5 98556899f074c963a4f8620d286027a1
BLAKE2b-256 ac171c86c00cff059ba5f94882d202dac19f77259ee8685b38a417fb6c0b180f

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp314-cp314t-macosx_10_12_x86_64.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp314-cp314t-macosx_10_12_x86_64.whl
  • Upload date:
  • Size: 8.3 MB
  • Tags: CPython 3.14t, macOS 10.12+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp314-cp314t-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 22a5a99cae0d372076301ed11be8a76eb4f3967818408bd4b45108fab9095368
MD5 f82e649d6c52a37cdc60019fe56bb94e
BLAKE2b-256 b20cffbff14aff6100e2f115cbd785a005c79bbb4a2cf2a3963c878bb29b1405

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp314-cp314-win_amd64.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp314-cp314-win_amd64.whl
  • Upload date:
  • Size: 8.0 MB
  • Tags: CPython 3.14, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp314-cp314-win_amd64.whl
Algorithm Hash digest
SHA256 7a9752556c18dd21a752d7aaba2f49d1f0edd19606c655a7e9e3f33751bc60c9
MD5 70a55c5933f5c35f6491cb04e538cbe5
BLAKE2b-256 38899d3e92ee7cefbbbb5d7e0c131eaab0924215d6e3f729686436d1dfa453c6

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp314-cp314-win32.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp314-cp314-win32.whl
  • Upload date:
  • Size: 7.8 MB
  • Tags: CPython 3.14, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp314-cp314-win32.whl
Algorithm Hash digest
SHA256 af8ff08944fd31109a3bfb07767b9e0ca1ee640396366d07dae39ff7a511cbed
MD5 3586f217cb544746ad6ca88ed57ee9ac
BLAKE2b-256 a3c4ca836371286e5afad6b21de568473196961ca3788dcd3e7ea8ed4d3aad03

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp310-abi3-win_amd64.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp310-abi3-win_amd64.whl
  • Upload date:
  • Size: 8.1 MB
  • Tags: CPython 3.10+, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp310-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 32510bb69c908fec1b32c3c0db60137235946d92eed2ced3bdacfe0893800b31
MD5 a223ca2e5285243cc569d4a33b0a0f96
BLAKE2b-256 ff5dae0a7d15ab6912988b160c9638354743dd822a2a9f4878bff599a089f2b6

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp310-abi3-win32.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp310-abi3-win32.whl
  • Upload date:
  • Size: 7.8 MB
  • Tags: CPython 3.10+, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp310-abi3-win32.whl
Algorithm Hash digest
SHA256 5d3087587e12a95673062d2e675a54573020da7db4dc8739ac843fd10ab4f990
MD5 a061ee7e85e7137cb08f2c7dedb2c766
BLAKE2b-256 12a60e017ec1aaa1f06957be2a93156db58ed968ce8f6f2a91ca47603b3a91c9

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_x86_64.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_x86_64.whl
  • Upload date:
  • Size: 8.5 MB
  • Tags: CPython 3.10+, manylinux: glibc 2.28+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 f1a4db3fc0124e1b58e42a3d4558e6aa3dcf37905cd8d96ecbdf45fde2ab1930
MD5 174b55d3f7a14e0fa7964b3f33a21543
BLAKE2b-256 bb83f958b491a41ab0e63968c21b5289190ea4c4a7424c458e6ce40109042849

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_s390x.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_s390x.whl
  • Upload date:
  • Size: 8.7 MB
  • Tags: CPython 3.10+, manylinux: glibc 2.28+ s390x
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_s390x.whl
Algorithm Hash digest
SHA256 e17ac14939faed0c1902f7f6df447a3efcc8fe80c86480a12a49cacfa07ccd74
MD5 21881247cdabaf785dab56247e964acc
BLAKE2b-256 a8907d951855a3128f7699329cba0e13826fbc28e1f39d03eb8866abe426c342

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_ppc64le.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_ppc64le.whl
  • Upload date:
  • Size: 8.9 MB
  • Tags: CPython 3.10+, manylinux: glibc 2.28+ ppc64le
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_ppc64le.whl
Algorithm Hash digest
SHA256 edc62ab9823a581b72bfda4b2f91060eb4918f1b23535e6d5f69b2119ff195d5
MD5 a2cc660fd6f71b180bc375973990848c
BLAKE2b-256 eba61e20b094f0dbff3da8a37a983bfb7eec6400d77f1c192d6f3bce034fae02

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_armv7l.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_armv7l.whl
  • Upload date:
  • Size: 8.4 MB
  • Tags: CPython 3.10+, manylinux: glibc 2.28+ ARMv7l
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_armv7l.whl
Algorithm Hash digest
SHA256 7bd53d3f640d6fb24b955765f0b19e24ecedb5f8aaa976768f7dbaad889da2ca
MD5 a4c1f662c250161d0df9a901c103e6f2
BLAKE2b-256 b68347876b8c8aea154289448241824a244124e33d374cb8878230fb39a108a9

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_aarch64.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_aarch64.whl
  • Upload date:
  • Size: 8.5 MB
  • Tags: CPython 3.10+, manylinux: glibc 2.28+ ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp310-abi3-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 1d985c01b9ffe7d63d4d649bad6ebb2ac13cc95ef3f22d2eb2e258ca381f42ee
MD5 014e1aa9f1435407885087dcc46dfbed
BLAKE2b-256 cae65df0f9d7bfc2b45a7d4a2367e1a92735c560bc2c3b886bc9a1b03fa5dd3f

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp310-abi3-macosx_11_0_arm64.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp310-abi3-macosx_11_0_arm64.whl
  • Upload date:
  • Size: 8.3 MB
  • Tags: CPython 3.10+, macOS 11.0+ ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp310-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 a09ea5c4d960cd86e7d3d2e256e45ec06b0ef9ae5af20927fd20a06c2f2f1766
MD5 7d720e33d01221ae12992786c81e47c0
BLAKE2b-256 bf5cdeae3c77dd195716b1d831ecce4809ee9610dcd1d45f34d58ad7540bb70f

See more details on using hashes here.

File details

Details for the file semantic_text_splitter-0.32.0-cp310-abi3-macosx_10_12_x86_64.whl.

File metadata

  • Download URL: semantic_text_splitter-0.32.0-cp310-abi3-macosx_10_12_x86_64.whl
  • Upload date:
  • Size: 8.3 MB
  • Tags: CPython 3.10+, macOS 10.12+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_text_splitter-0.32.0-cp310-abi3-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 dd5bd0209402ef7a56759ff9764311aa59886903c0176b67e9f8aa9fae456560
MD5 089296aeaf3d9c6942e0b573b5073c43
BLAKE2b-256 b4dce341c6684b04e4d9f8844188f0de99bca6d67fe16fced62e3f390715ba71

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page