Unsupervised text tokenizer and detokenizer.

These details have not been verified by PyPI

Project links

Homepage

Project description

SentencePiece Python Wrapper

Python wrapper for SentencePiece. This API will offer the encoding, decoding and training of Sentencepiece.

Build and Install SentencePiece

For Linux (x64/i686), macOS, and Windows(win32/x64/arm64) environment, you can simply use pip command to install SentencePiece python module.

% pip install sentencepiece

Before building SentencePiece from source on Linux, ensure that the following dependencies are installed.

% sudo apt update
% sudo apt install -y cmake pkg-config libsentencepiece-dev

To build and install the Python wrapper from source, try the following commands to build and install wheel package.

% git clone https://github.com/google/sentencepiece.git
% cd sentencepiece
% mkdir build
% cd build
% cmake .. -DSPM_ENABLE_SHARED=OFF -DCMAKE_INSTALL_PREFIX=./root -DSPM_DISABLE_EMBEDDED_DATA=ON
% make install
% cd ../python
% python setup.py bdist_wheel
% pip install dist/sentencepiece*.whl

If you don’t have write permission to the global site-packages directory or don’t want to install into it, please try:

% python setup.py install --user

For Windows users who want to build from source, you can build and install the Python wrapper using Visual Studio. First, you need to install the pwsh.exe (Powershell 7). Use winget install --id Microsoft.Powershell --source winget to install directly. Then open the Developer PowerShell for VS 2022, and execute the following commands.

git clone https://github.com/google/sentencepiece.git
cd sentencepiece
mkdir build
cd build
cmake .. -DSPM_ENABLE_SHARED=OFF -DCMAKE_INSTALL_PREFIX=".\root" -DSPM_DISABLE_EMBEDDED_DATA=ON
cmake --build . --config Release --target install
cd ../python
pip install wheel
python setup.py bdist_wheel
Get-ChildItem .\dist\sentencepiece*.whl | ForEach-Object { pip install $_.FullName }

Usage

See this google colab page to run sentencepiece interactively.

Segmentation

% python
>>> import sentencepiece as spm
>>> sp = spm.SentencePieceProcessor(model_file='test/test_model.model')

>>> sp.encode('This is a test')
[284, 47, 11, 4, 15, 400]

>>> sp.encode(['This is a test', 'Hello world'], out_type=int)
[[284, 47, 11, 4, 15, 400], [151, 88, 21, 887]]

>>> sp.encode_as_ids(['This is a test', 'Hello world'])
[[284, 47, 11, 4, 15, 400], [151, 88, 21, 887]]

>>> sp.encode('This is a test', out_type=str)
['▁This', '▁is', '▁a', '▁', 't', 'est']

>>> sp.encode(['This is a test', 'Hello world'], out_type=str)
[['▁This', '▁is', '▁a', '▁', 't', 'est'], ['▁He', 'll', 'o', '▁world']]

>>> sp.encode_as_pieces(['This is a test', 'Hello world'])
[['▁This', '▁is', '▁a', '▁', 't', 'est'], ['▁He', 'll', 'o', '▁world']]

>>> proto = sp.encode('This is a test', out_type='immutable_proto')
>>> for n in proto.pieces:
...     print('piece="{}" surface="{}" id={} begin={} end={}'.format(n.piece, n.surface, n.id, n.begin, n.end))
...
piece="▁This" surface="This" id=284 begin=0 end=4
piece="▁is" surface=" is" id=47 begin=4 end=7
piece="▁a" surface=" a" id=11 begin=7 end=9
piece="▁" surface=" " id=4 begin=9 end=10
piece="t" surface="t" id=15 begin=10 end=11
piece="est" surface="est" id=400 begin=11 end=14

>>> [[x.id for x in proto.pieces], [x.piece for x in proto.pieces], [x.begin for x in proto.pieces], [x.end for x in proto.pieces]]
[[284, 47, 11, 4, 15, 400], ['▁This', '▁is', '▁a', '▁', 't', 'est'], [0, 4, 7, 9, 10, 11], [4, 7, 9, 10, 11, 14]]

>>> proto2 = sp.encode_as_immutable_proto('This is a test')
>>> proto2 == proto
True

>>> for _ in range(10):
...     sp.encode('This is a test', out_type=str, enable_sampling=True, alpha=0.1, nbest_size=-1)
...
['▁', 'This', '▁', 'is', '▁a', '▁', 't', 'e', 'st']
['▁T', 'h', 'i', 's', '▁is', '▁a', '▁', 'te', 's', 't']
['▁T', 'h', 'is', '▁', 'is', '▁', 'a', '▁', 't', 'est']
['▁', 'This', '▁is', '▁', 'a', '▁', 't', 'e', 'st']
['▁', 'This', '▁', 'is', '▁', 'a', '▁', 't', 'e', 's', 't']
['▁This', '▁is', '▁a', '▁', 'te', 's', 't']
['▁This', '▁is', '▁', 'a', '▁', 't', 'e', 'st']
['▁', 'T', 'h', 'is', '▁', 'is', '▁', 'a', '▁', 'te', 'st']
['▁', 'This', '▁', 'i', 's', '▁a', '▁', 't', 'e', 'st']
['▁This', '▁', 'is', '▁a', '▁', 't', 'est']

>> sp.nbest_encode('This is a test', nbest_size=5, out_type=str)
[['▁This', '▁is', '▁a', '▁', 't', 'est'],
['▁This', '▁is', '▁a', '▁', 'te', 'st'],
['▁This', '▁is', '▁a', '▁', 'te', 's', 't'],
['▁This', '▁is', '▁a', '▁', 't', 'e', 'st'],
['▁This', '▁is', '▁a', '▁', 't', 'es', 't']]

>>> sp.sample_encode_and_score('This is a test', num_samples=5, alpha=0.1, out_type=str, wor=True)
[(['▁This', '▁', 'i', 's', '▁a', '▁', 'te', 's', 't'], -3.043105125427246),
(['▁This', '▁', 'i', 's', '▁a', '▁', 'te', 'st'], -2.8475849628448486),
(['▁', 'This', '▁is', '▁', 'a', '▁', 'te', 'st'], -3.043248176574707),
(['▁', 'This', '▁is', '▁a', '▁', 't', 'e', 'st'], -2.87727689743042),
(['▁', 'This', '▁', 'i', 's', '▁', 'a', '▁', 't', 'est'], -3.6284031867980957)]

>>> sp.decode([284, 47, 11, 4, 15, 400])
'This is a test'

>>> sp.decode([[284, 47, 11, 4, 15, 400], [151, 88, 21, 887]])
['This is a test', 'Hello world']

>>> proto = sp.decode([284, 47, 11, 4, 15, 400], out_type='immutable_proto')
>>> proto.text
'This is a test'

>>> sp.decode(['▁', 'This', '▁', 'is', '▁a', '▁', 't', 'e', 'st'])
'This is a test'

>>> sp.decode([['▁This', '▁is', '▁a', '▁', 't', 'est'], ['▁He', 'll', 'o', '▁world']])
['This is a test', 'Hello world']

>>> sp.get_piece_size()
1000

>>> sp.id_to_piece(2)
'</s>'

>>> sp.id_to_piece([2, 3, 4])
['</s>', '\r', '▁']

>>> sp.piece_to_id('<s>')
1

>>> sp.piece_to_id(['</s>', '\r', '▁'])
[2, 3, 4]

>>> len(sp)
1000

>>> sp['</s>']
2

Model Training

Training is performed by passing parameters of spm_train to SentencePieceTrainer.train() function.

>>> import sentencepiece as spm
>>> spm.SentencePieceTrainer.train(input='test/botchan.txt', model_prefix='m', vocab_size=1000, user_defined_symbols=['foo', 'bar'])
sentencepiece_trainer.cc(73) LOG(INFO) Starts training with :
trainer_spec {
  input: test/botchan.txt
  .. snip
unigram_model_trainer.cc(500) LOG(INFO) EM sub_iter=1 size=1188 obj=10.2839 num_tokens=32182 num_tokens/piece=27.0892
unigram_model_trainer.cc(500) LOG(INFO) EM sub_iter=0 size=1100 obj=10.4269 num_tokens=33001 num_tokens/piece=30.0009
unigram_model_trainer.cc(500) LOG(INFO) EM sub_iter=1 size=1100 obj=10.4069 num_tokens=33002 num_tokens/piece=30.0018
trainer_interface.cc(595) LOG(INFO) Saving model: m.model
trainer_interface.cc(619) LOG(INFO) Saving vocabs: m.vocab
>>>

Training without local filesystem

Sentencepiece trainer can receive any iterable object to feed training sentences. You can also pass a file object (instance with write() method) to emit the output model to any devices. These features are useful to run sentencepiece on environment that have limited access to the local file system (e.g., Google colab.)

import urllib.request
import io
import sentencepiece as spm

# Loads model from URL as iterator and stores the model to BytesIO.
model = io.BytesIO()
with urllib.request.urlopen(
    'https://raw.githubusercontent.com/google/sentencepiece/master/data/botchan.txt'
) as response:
  spm.SentencePieceTrainer.train(
      sentence_iterator=response, model_writer=model, vocab_size=1000)

# Serialize the model as file.
# with open('out.model', 'wb') as f:
#   f.write(model.getvalue())

# Directly load the model from serialized model.
sp = spm.SentencePieceProcessor(model_proto=model.getvalue())
print(sp.encode('this is test'))

Free Threading support

Experimental support for no-GIL/Free-Threading has been introduced since v0.2.1. For more details, please refer to this page. This operates similarly to how NumPy handles it.

The C++ library's const and static methods, e.g., encode(), decode() and train(), are designed to work in a non-GIL environment. However, non-const methods, e.g., load(), may have potential data race issues, so please ensure you implement appropriate locks beforehand.

While this limitation might be removed in the future, please note that it's not a simple fix, as it would require additional shared locks in C++.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.2.1

Aug 12, 2025

0.2.0

Feb 19, 2024

0.1.99

May 2, 2023

0.1.98

Apr 12, 2023

0.1.97

Aug 7, 2022

0.1.96

Jun 18, 2021

0.1.95

Jan 10, 2021

0.1.94

Oct 24, 2020

0.1.92 yanked

Jun 8, 2020

Reason this release was yanked:

Crash bug is reported (confirming)

0.1.91

May 21, 2020

0.1.90

May 13, 2020

0.1.86

Apr 24, 2020

0.1.85

Dec 15, 2019

0.1.83

Aug 16, 2019

0.1.82

Apr 13, 2019

0.1.81

Mar 22, 2019

0.1.8

Jan 11, 2019

0.1.7

Dec 26, 2018

0.1.6

Nov 12, 2018

0.1.5

Oct 29, 2018

0.1.4

Aug 26, 2018

0.1.3

Jul 30, 2018

0.1.2

Jul 13, 2018

0.1.1

Jun 26, 2018

0.1.0

Jun 10, 2018

0.0.9

May 11, 2018

0.0.7

Apr 29, 2018

0.0.6

Apr 18, 2018

0.0.5

Apr 9, 2018

0.0.4

Feb 28, 2018

0.0.3

Dec 17, 2017

0.0.2

Nov 8, 2017

0.0.0

Aug 28, 2017

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sentencepiece-0.2.1.tar.gz (3.2 MB view details)

Uploaded Aug 12, 2025 Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sentencepiece-0.2.1-cp314-cp314t-win_arm64.whl (1.1 MB view details)

Uploaded Aug 12, 2025 CPython 3.14tWindows ARM64

sentencepiece-0.2.1-cp314-cp314t-win_amd64.whl (1.2 MB view details)

Uploaded Aug 12, 2025 CPython 3.14tWindows x86-64

sentencepiece-0.2.1-cp314-cp314t-win32.whl (1.1 MB view details)

Uploaded Aug 12, 2025 CPython 3.14tWindows x86

sentencepiece-0.2.1-cp314-cp314t-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (1.4 MB view details)

Uploaded Aug 12, 2025 CPython 3.14tmanylinux: glibc 2.27+ x86-64manylinux: glibc 2.28+ x86-64

sentencepiece-0.2.1-cp314-cp314t-manylinux_2_27_aarch64.manylinux_2_28_aarch64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.14tmanylinux: glibc 2.27+ ARM64manylinux: glibc 2.28+ ARM64

sentencepiece-0.2.1-cp314-cp314t-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.14tmacOS 11.0+ ARM64

sentencepiece-0.2.1-cp314-cp314t-macosx_10_13_x86_64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.14tmacOS 10.13+ x86-64

sentencepiece-0.2.1-cp314-cp314t-macosx_10_13_universal2.whl (1.9 MB view details)

Uploaded Aug 12, 2025 CPython 3.14tmacOS 10.13+ universal2 (ARM64, x86-64)

sentencepiece-0.2.1-cp314-cp314-win_arm64.whl (1.1 MB view details)

Uploaded Aug 12, 2025 CPython 3.14Windows ARM64

sentencepiece-0.2.1-cp314-cp314-win_amd64.whl (1.2 MB view details)

Uploaded Aug 12, 2025 CPython 3.14Windows x86-64

sentencepiece-0.2.1-cp314-cp314-win32.whl (1.1 MB view details)

Uploaded Aug 12, 2025 CPython 3.14Windows x86

sentencepiece-0.2.1-cp314-cp314-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (1.4 MB view details)

Uploaded Aug 12, 2025 CPython 3.14manylinux: glibc 2.27+ x86-64manylinux: glibc 2.28+ x86-64

sentencepiece-0.2.1-cp314-cp314-manylinux_2_27_aarch64.manylinux_2_28_aarch64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.14manylinux: glibc 2.27+ ARM64manylinux: glibc 2.28+ ARM64

sentencepiece-0.2.1-cp314-cp314-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.14macOS 11.0+ ARM64

sentencepiece-0.2.1-cp314-cp314-macosx_10_13_x86_64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.14macOS 10.13+ x86-64

sentencepiece-0.2.1-cp314-cp314-macosx_10_13_universal2.whl (1.9 MB view details)

Uploaded Aug 12, 2025 CPython 3.14macOS 10.13+ universal2 (ARM64, x86-64)

sentencepiece-0.2.1-cp313-cp313t-win_arm64.whl (1.0 MB view details)

Uploaded Aug 12, 2025 CPython 3.13tWindows ARM64

sentencepiece-0.2.1-cp313-cp313t-win_amd64.whl (1.1 MB view details)

Uploaded Aug 12, 2025 CPython 3.13tWindows x86-64

sentencepiece-0.2.1-cp313-cp313t-win32.whl (1.0 MB view details)

Uploaded Aug 12, 2025 CPython 3.13tWindows x86

sentencepiece-0.2.1-cp313-cp313t-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (1.4 MB view details)

Uploaded Aug 12, 2025 CPython 3.13tmanylinux: glibc 2.27+ x86-64manylinux: glibc 2.28+ x86-64

sentencepiece-0.2.1-cp313-cp313t-manylinux_2_27_aarch64.manylinux_2_28_aarch64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.13tmanylinux: glibc 2.27+ ARM64manylinux: glibc 2.28+ ARM64

sentencepiece-0.2.1-cp313-cp313t-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.13tmacOS 11.0+ ARM64

sentencepiece-0.2.1-cp313-cp313t-macosx_10_13_x86_64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.13tmacOS 10.13+ x86-64

sentencepiece-0.2.1-cp313-cp313t-macosx_10_13_universal2.whl (1.9 MB view details)

Uploaded Aug 12, 2025 CPython 3.13tmacOS 10.13+ universal2 (ARM64, x86-64)

sentencepiece-0.2.1-cp313-cp313-win_arm64.whl (1.0 MB view details)

Uploaded Aug 12, 2025 CPython 3.13Windows ARM64

sentencepiece-0.2.1-cp313-cp313-win_amd64.whl (1.1 MB view details)

Uploaded Aug 12, 2025 CPython 3.13Windows x86-64

sentencepiece-0.2.1-cp313-cp313-win32.whl (999.5 kB view details)

Uploaded Aug 12, 2025 CPython 3.13Windows x86

sentencepiece-0.2.1-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (1.4 MB view details)

Uploaded Aug 12, 2025 CPython 3.13manylinux: glibc 2.27+ x86-64manylinux: glibc 2.28+ x86-64

sentencepiece-0.2.1-cp313-cp313-manylinux_2_27_aarch64.manylinux_2_28_aarch64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.13manylinux: glibc 2.27+ ARM64manylinux: glibc 2.28+ ARM64

sentencepiece-0.2.1-cp313-cp313-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.13macOS 11.0+ ARM64

sentencepiece-0.2.1-cp313-cp313-macosx_10_13_x86_64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.13macOS 10.13+ x86-64

sentencepiece-0.2.1-cp313-cp313-macosx_10_13_universal2.whl (1.9 MB view details)

Uploaded Aug 12, 2025 CPython 3.13macOS 10.13+ universal2 (ARM64, x86-64)

sentencepiece-0.2.1-cp312-cp312-win_arm64.whl (1.0 MB view details)

Uploaded Aug 12, 2025 CPython 3.12Windows ARM64

sentencepiece-0.2.1-cp312-cp312-win_amd64.whl (1.1 MB view details)

Uploaded Aug 12, 2025 CPython 3.12Windows x86-64

sentencepiece-0.2.1-cp312-cp312-win32.whl (999.5 kB view details)

Uploaded Aug 12, 2025 CPython 3.12Windows x86

sentencepiece-0.2.1-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (1.4 MB view details)

Uploaded Aug 12, 2025 CPython 3.12manylinux: glibc 2.27+ x86-64manylinux: glibc 2.28+ x86-64

sentencepiece-0.2.1-cp312-cp312-manylinux_2_27_aarch64.manylinux_2_28_aarch64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.12manylinux: glibc 2.27+ ARM64manylinux: glibc 2.28+ ARM64

sentencepiece-0.2.1-cp312-cp312-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.12macOS 11.0+ ARM64

sentencepiece-0.2.1-cp312-cp312-macosx_10_13_x86_64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.12macOS 10.13+ x86-64

sentencepiece-0.2.1-cp312-cp312-macosx_10_13_universal2.whl (1.9 MB view details)

Uploaded Aug 12, 2025 CPython 3.12macOS 10.13+ universal2 (ARM64, x86-64)

sentencepiece-0.2.1-cp311-cp311-win_arm64.whl (1.0 MB view details)

Uploaded Aug 12, 2025 CPython 3.11Windows ARM64

sentencepiece-0.2.1-cp311-cp311-win_amd64.whl (1.1 MB view details)

Uploaded Aug 12, 2025 CPython 3.11Windows x86-64

sentencepiece-0.2.1-cp311-cp311-win32.whl (999.6 kB view details)

Uploaded Aug 12, 2025 CPython 3.11Windows x86

sentencepiece-0.2.1-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (1.4 MB view details)

Uploaded Aug 12, 2025 CPython 3.11manylinux: glibc 2.27+ x86-64manylinux: glibc 2.28+ x86-64

sentencepiece-0.2.1-cp311-cp311-manylinux_2_27_aarch64.manylinux_2_28_aarch64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.11manylinux: glibc 2.27+ ARM64manylinux: glibc 2.28+ ARM64

sentencepiece-0.2.1-cp311-cp311-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.11macOS 11.0+ ARM64

sentencepiece-0.2.1-cp311-cp311-macosx_10_9_x86_64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.11macOS 10.9+ x86-64

sentencepiece-0.2.1-cp311-cp311-macosx_10_9_universal2.whl (1.9 MB view details)

Uploaded Aug 12, 2025 CPython 3.11macOS 10.9+ universal2 (ARM64, x86-64)

sentencepiece-0.2.1-cp310-cp310-win_arm64.whl (1.0 MB view details)

Uploaded Aug 12, 2025 CPython 3.10Windows ARM64

sentencepiece-0.2.1-cp310-cp310-win_amd64.whl (1.1 MB view details)

Uploaded Aug 12, 2025 CPython 3.10Windows x86-64

sentencepiece-0.2.1-cp310-cp310-win32.whl (999.5 kB view details)

Uploaded Aug 12, 2025 CPython 3.10Windows x86

sentencepiece-0.2.1-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (1.4 MB view details)

Uploaded Aug 12, 2025 CPython 3.10manylinux: glibc 2.27+ x86-64manylinux: glibc 2.28+ x86-64

sentencepiece-0.2.1-cp310-cp310-manylinux_2_27_aarch64.manylinux_2_28_aarch64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.10manylinux: glibc 2.27+ ARM64manylinux: glibc 2.28+ ARM64

sentencepiece-0.2.1-cp310-cp310-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.10macOS 11.0+ ARM64

sentencepiece-0.2.1-cp310-cp310-macosx_10_9_x86_64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.10macOS 10.9+ x86-64

sentencepiece-0.2.1-cp310-cp310-macosx_10_9_universal2.whl (1.9 MB view details)

Uploaded Aug 12, 2025 CPython 3.10macOS 10.9+ universal2 (ARM64, x86-64)

sentencepiece-0.2.1-cp39-cp39-win_arm64.whl (1.0 MB view details)

Uploaded Aug 12, 2025 CPython 3.9Windows ARM64

sentencepiece-0.2.1-cp39-cp39-win_amd64.whl (1.1 MB view details)

Uploaded Aug 12, 2025 CPython 3.9Windows x86-64

sentencepiece-0.2.1-cp39-cp39-win32.whl (999.6 kB view details)

Uploaded Aug 12, 2025 CPython 3.9Windows x86

sentencepiece-0.2.1-cp39-cp39-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (1.4 MB view details)

Uploaded Aug 12, 2025 CPython 3.9manylinux: glibc 2.27+ x86-64manylinux: glibc 2.28+ x86-64

sentencepiece-0.2.1-cp39-cp39-manylinux_2_27_aarch64.manylinux_2_28_aarch64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.9manylinux: glibc 2.27+ ARM64manylinux: glibc 2.28+ ARM64

sentencepiece-0.2.1-cp39-cp39-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.9macOS 11.0+ ARM64

sentencepiece-0.2.1-cp39-cp39-macosx_10_9_x86_64.whl (1.3 MB view details)

Uploaded Aug 12, 2025 CPython 3.9macOS 10.9+ x86-64

sentencepiece-0.2.1-cp39-cp39-macosx_10_9_universal2.whl (1.9 MB view details)

Uploaded Aug 12, 2025 CPython 3.9macOS 10.9+ universal2 (ARM64, x86-64)

File details

Details for the file sentencepiece-0.2.1.tar.gz.

File metadata

Download URL: sentencepiece-0.2.1.tar.gz
Upload date: Aug 12, 2025
Size: 3.2 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for sentencepiece-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`8138cec27c2f2282f4a34d9a016e3374cd40e5c6e9cb335063db66a0a3b71fad`
MD5	`7bdf96c179dd79b9aaca4c3cf6fe5047`
BLAKE2b-256	`15152e7a025fc62d764b151ae6d0f2a92f8081755ebe8d4a64099accc6f77ba6`

Algorithm	Hash digest
SHA256	`105e36e75cbac1292642045458e8da677b2342dcd33df503e640f0b457cb6751`
MD5	`39b56d4dd3fdb48a68ceda51589cec0d`
BLAKE2b-256	`f31654f611fcfc2d1c46cbe3ec4169780b2cfa7cf63708ef2b71611136db7513`

Algorithm	Hash digest
SHA256	`5e4366c97b68218fd30ea72d70c525e6e78a6c0a88650f57ac4c43c63b234a9d`
MD5	`40b930f2dadd054f7422b0c1164e4b40`
BLAKE2b-256	`91d52a69e1ce15881beb9ddfc7e3f998322f5cedcd5e4d244cb74dade9441663`

Algorithm	Hash digest
SHA256	`d3233770f78e637dc8b1fda2cd7c3b99ec77e7505041934188a4e7fe751de3b0`
MD5	`154373ed18c7febd5bf370c55b20207c`
BLAKE2b-256	`667c08ff0012507297a4dd74a5420fdc0eb9e3e80f4e88cab1538d7f28db303d`

Algorithm	Hash digest
SHA256	`733e59ff1794d26db706cd41fc2d7ca5f6c64a820709cb801dc0ea31780d64ab`
MD5	`bd389c512108a1544d62849289a6670b`
BLAKE2b-256	`7eaa553dbe4178b5f23eb28e59393dddd64186178b56b81d9b8d5c3ff1c28395`

Algorithm	Hash digest
SHA256	`010f025a544ef770bb395091d57cb94deb9652d8972e0d09f71d85d5a0816c8c`
MD5	`c24053de542b1b7c2da811428c94cbfc`
BLAKE2b-256	`ef23195b2e7ec85ebb6a547969f60b723c7aca5a75800ece6cc3f41da872d14e`

Algorithm	Hash digest
SHA256	`477c81505db072b3ab627e7eab972ea1025331bd3a92bacbf798df2b75ea86ec`
MD5	`1fb9f91b7bed291cea74b63b7d55d11f`
BLAKE2b-256	`03b0811dae8fb9f2784e138785d481469788f2e0d0c109c5737372454415f55f`

Algorithm	Hash digest
SHA256	`e37e4b4c4a11662b5db521def4e44d4d30ae69a1743241412a93ae40fdcab4bb`
MD5	`c7a62a996c71f37418c2d60933b1353a`
BLAKE2b-256	`77eb7a5682bb25824db8545f8e5662e7f3e32d72a508fdce086029d89695106b`

Algorithm	Hash digest
SHA256	`a19adcec27c524cb7069a1c741060add95f942d1cbf7ad0d104dffa0a7d28a2b`
MD5	`b480fcec49bca7c97968d4a2806eba33`
BLAKE2b-256	`a1115b414b9fae6255b5fb1e22e2ed3dc3a72d3a694e5703910e640ac78346bb`

Algorithm	Hash digest
SHA256	`2005242a16d2dc3ac5fe18aa7667549134d37854823df4c4db244752453b78a8`
MD5	`c00c99d7101fd2b6c8605dd27d10bc72`
BLAKE2b-256	`3289047921cf70f36c7b6b6390876b2399b3633ab73b8d0cb857e5a964238941`

Algorithm	Hash digest
SHA256	`881b2e44b14fc19feade3cbed314be37de639fc415375cefaa5bc81a4be137fd`
MD5	`98bbaaf0bda7af93babc8f1ae6def789`
BLAKE2b-256	`b8cbfe400d8836952cc535c81a0ce47dc6875160e5fedb71d2d9ff0e9894c2a6`

Algorithm	Hash digest
SHA256	`c415c9de1447e0a74ae3fdb2e52f967cb544113a3a5ce3a194df185cbc1f962f`
MD5	`e01e9b4d9d5ffa211b060f2dad51fafb`
BLAKE2b-256	`dcaa956ef729aafb6c8f9c443104c9636489093bb5c61d6b90fc27aa1a865574`

Algorithm	Hash digest
SHA256	`01e6912125cb45d3792f530a4d38f8e21bf884d6b4d4ade1b2de5cf7a8d2a52b`
MD5	`5b6fe750ba893df73b092526b063f05c`
BLAKE2b-256	`fb0335fbe5f3d9a7435eebd0b473e09584bd3cc354ce118b960445b060d33781`

Algorithm	Hash digest
SHA256	`1855f57db07b51fb51ed6c9c452f570624d2b169b36f0f79ef71a6e6c618cd8b`
MD5	`f9b1f853e4a9c1c1ac4bd20265e0de0c`
BLAKE2b-256	`19add5c7075f701bd97971d7c2ac2904f227566f51ef0838dfbdfdccb58cd212`

Algorithm	Hash digest
SHA256	`c83b85ab2d6576607f31df77ff86f28182be4a8de6d175d2c33ca609925f5da1`
MD5	`2f16a3ebb2b54fb5a9ed72bb594f5c7e`
BLAKE2b-256	`ea99bbe054ebb5a5039457c590e0a4156ed073fb0fe9ce4f7523404dd5b37463`

Algorithm	Hash digest
SHA256	`c7f54a31cde6fa5cb030370566f68152a742f433f8d2be458463d06c208aef33`
MD5	`185cf569da3d068e7fe09626f8d034b6`
BLAKE2b-256	`820ba1432bc87f97c2ace36386ca23e8bd3b91fb40581b5e6148d24b24186419`

Algorithm	Hash digest
SHA256	`5d0350b686c320068702116276cfb26c066dc7e65cfef173980b11bb4d606719`
MD5	`19e14aced6a6a7532689282c08cc34c5`
BLAKE2b-256	`249c89eb8b2052f720a612478baf11c8227dcf1dc28cd4ea4c0c19506b5af2a2`

Algorithm	Hash digest
SHA256	`b3616ad246f360e52c85781e47682d31abfb6554c779e42b65333d4b5f44ecc0`
MD5	`2109e629ef40c11564c6935ef58fe24e`
BLAKE2b-256	`88145aee0bf0864df9bd82bd59e7711362908e4935e3f9cdc1f57246b5d5c9b9`

Algorithm	Hash digest
SHA256	`33f068c9382dc2e7c228eedfd8163b52baa86bb92f50d0488bf2b7da7032e484`
MD5	`71e18d7f9ee02d779775adce174d9e6d`
BLAKE2b-256	`c103d332828c4ff764e16c1b56c2c8f9a33488bbe796b53fb6b9c4205ddbf167`

Algorithm	Hash digest
SHA256	`89a3ea015517c42c0341d0d962f3e6aaf2cf10d71b1932d475c44ba48d00aa2b`
MD5	`3ea63e37d4b0c2ae76d288134d08a91f`
BLAKE2b-256	`995eae66c361023a470afcbc1fbb8da722c72ea678a2fcd9a18f1a12598c7501`

Algorithm	Hash digest
SHA256	`0a81799d0a68d618e89063fb423c3001a034c893069135ffe51fee439ae474d6`
MD5	`65f38bd42ce93919511e7287610f778b`
BLAKE2b-256	`4ae8661e5bd82a8aa641fd6c1020bd0e890ef73230a2b7215ddf9c8cd8e941c2`

sentencepiece 0.2.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

SentencePiece Python Wrapper

Build and Install SentencePiece

Usage

Segmentation

Model Training

Training without local filesystem

Free Threading support

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distributions

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

Algorithm	Hash digest
SHA256	`b81a24733726e3678d2db63619acc5a8dccd074f7aa7a54ecd5ca33ca6d2d596`
MD5	`7e2f365c408232e893117e8e94c8d2eb`
BLAKE2b-256	`bc85c72fd1f3c7a6010544d6ae07f8ddb38b5e2a7e33bd4318f87266c0bbafbf`

Algorithm	Hash digest
SHA256	`ad8493bea8432dae8d6830365352350f3b4144415a1d09c4c8cb8d30cf3b6c3c`
MD5	`50f0bdfc4ef47401c9b1d945130a64b1`
BLAKE2b-256	`997e1fb26e8a21613f6200e1ab88824d5d203714162cf2883248b517deb500b7`

Algorithm	Hash digest
SHA256	`0c0f672da370cc490e4c59d89e12289778310a0e71d176c541e4834759e1ae07`
MD5	`3988ab95de4d89c80378a0325172f3cd`
BLAKE2b-256	`abd91ea0e740591ff4c6fc2b6eb1d7510d02f3fb885093f19b2f3abd1363b402`

Algorithm	Hash digest
SHA256	`8dd4b477a7b069648d19363aad0cab9bad2f4e83b2d179be668efa672500dc94`
MD5	`07f9f1fb632947cd09fa773e20bc134b`
BLAKE2b-256	`4ab608fe2ce819e02ccb0296f4843e3f195764ce9829cbda61b7513f29b95718`

Algorithm	Hash digest
SHA256	`ac650534e2251083c5f75dde4ff28896ce7c8904133dc8fef42780f4d5588fcd`
MD5	`7461348e0e044074ab04fb7bc17dec4a`
BLAKE2b-256	`c93a76488a00ea7d6931689cda28726a1447d66bf1a4837943489314593d5596`

Algorithm	Hash digest
SHA256	`10ed3dab2044c47f7a2e7b4969b0c430420cdd45735d78c8f853191fa0e3148b`
MD5	`d0f2217c649b8f2c178d5555310509a2`
BLAKE2b-256	`dde9932b9eae6fd7019548321eee1ab8d5e3b3d1294df9d9a0c9ac517c7b636d`

Algorithm	Hash digest
SHA256	`92b3816aa2339355fda2c8c4e021a5de92180b00aaccaf5e2808972e77a4b22f`
MD5	`5dff1b6182d6842d338d6a5a2ddf11fd`
BLAKE2b-256	`acddf7774d42a881ced8e1739f393ab1e82ece39fc9abd4779e28050c2e975b5`

Algorithm	Hash digest
SHA256	`c7f0fd2f2693309e6628aeeb2e2faf6edd221134dfccac3308ca0de01f8dab47`
MD5	`70ac4dedd8af4cbebbaa0b7ae74e891f`
BLAKE2b-256	`96df0cfe748ace5485be740fed9476dee7877f109da32ed0d280312c94ec259f`

Algorithm	Hash digest
SHA256	`d7b670879c370d350557edabadbad1f6561a9e6968126e6debca4029e5547820`
MD5	`47d92503e98dd4ba423ef0de9362dfdc`
BLAKE2b-256	`2cd2f552be5928105588f4f4d66ee37dd4c61460d8097e62d0e2e0eec41bc61d`

Algorithm	Hash digest
SHA256	`097f3394e99456e9e4efba1737c3749d7e23563dd1588ce71a3d007f25475fff`
MD5	`a512f65d36f8b3c48859ea7f33f17f25`
BLAKE2b-256	`8dde5a007fb53b1ab0aafc69d11a5a3dd72a289d5a3e78dcf2c3a3d9b14ffe93`