SentencePiece python wrapper

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 5 - Production/Stable
Environment
- Console
Intended Audience
- Developers
- Science/Research
License
- OSI Approved :: Apache Software License
Operating System
- Unix
Programming Language
- Python
Topic
- Software Development :: Libraries :: Python Modules
- Text Processing :: Linguistic

Project description

SentencePiece Python Wrapper

Python wrapper for SentencePiece. This API will offer the encoding, decoding and training of Sentencepiece.

Build and Install SentencePiece

For Linux (x64/i686), macOS, and Windows(win32/x64) environment, you can simply use pip command to install SentencePiece python module.

% pip install sentencepiece

To build and install the Python wrapper from source, please install SentencePiece C++ and try the following commands:

% python setup.py build
% sudo python setup.py install

If you don’t have write permission to the global site-packages directory or don’t want to install into it, please try:

% python setup.py install --user

Usage

See this google colab page to run sentencepiece interactively. (Note: this sample is written in old interface.)

Segmentation

% python
>>> import sentencepiece as spm
>>> sp = spm.SentencePieceProcessor(model_file='test/test_model.model')
>>> sp.encode('This is a test')
[284, 47, 11, 4, 15, 400]
>>> sp.encode(['This is a test', 'Hello world'], out_type=int)
[[284, 47, 11, 4, 15, 400], [151, 88, 21, 887]]
>>> sp.encode('This is a test', out_type=str)
['▁This', '▁is', '▁a', '▁', 't', 'est']
>>> sp.encode(['This is a test', 'Hello world'], out_type=str)
[['▁This', '▁is', '▁a', '▁', 't', 'est'], ['▁He', 'll', 'o', '▁world']]
>>> for _ in range(10):
...     sp.encode('This is a test', out_type=str, enable_sampling=True, alpha=0.1, nbest_size=-1)
... 
['▁', 'This', '▁', 'is', '▁a', '▁', 't', 'e', 'st']
['▁T', 'h', 'i', 's', '▁is', '▁a', '▁', 'te', 's', 't']
['▁T', 'h', 'is', '▁', 'is', '▁', 'a', '▁', 't', 'est']
['▁', 'This', '▁is', '▁', 'a', '▁', 't', 'e', 'st']
['▁', 'This', '▁', 'is', '▁', 'a', '▁', 't', 'e', 's', 't']
['▁This', '▁is', '▁a', '▁', 'te', 's', 't']
['▁This', '▁is', '▁', 'a', '▁', 't', 'e', 'st']
['▁', 'T', 'h', 'is', '▁', 'is', '▁', 'a', '▁', 'te', 'st']
['▁', 'This', '▁', 'i', 's', '▁a', '▁', 't', 'e', 'st']
['▁This', '▁', 'is', '▁a', '▁', 't', 'est']
>>> sp.decode([284, 47, 11, 4, 15, 400])
'This is a test'
>>> sp.decode([[284, 47, 11, 4, 15, 400], [151, 88, 21, 887]])
['This is a test', 'Hello world']
>>> sp.decode(['▁', 'This', '▁', 'is', '▁a', '▁', 't', 'e', 'st'])
'This is a test'
>>> sp.decode([['▁This', '▁is', '▁a', '▁', 't', 'est'], ['▁He', 'll', 'o', '▁world']])
['This is a test', 'Hello world']
>>> sp.get_piece_size()
1000
>>> sp.id_to_piece(2)
'</s>'
>>> sp.id_to_piece([2, 3, 4])
['</s>', '\r', '▁']
>>> sp.piece_to_id('<s>')
1
>>> sp.piece_to_id(['</s>', '\r', '▁'])
[2, 3, 4]
>>> len(sp)
1000
>>> sp['</s>']
2

Model Training

Training is performed by passing parameters of spm_train to SentencePieceTrainer.train() function.

>>> import sentencepiece as spm
>>> spm.SentencePieceTrainer.train(input='test/botchan.txt', model_prefix='m', vocab_size=1000, user_defined_symbols=['foo', 'bar'])
sentencepiece_trainer.cc(73) LOG(INFO) Starts training with : 
trainer_spec {
  input: test/botchan.txt
  .. snip
unigram_model_trainer.cc(500) LOG(INFO) EM sub_iter=1 size=1188 obj=10.2839 num_tokens=32182 num_tokens/piece=27.0892
unigram_model_trainer.cc(500) LOG(INFO) EM sub_iter=0 size=1100 obj=10.4269 num_tokens=33001 num_tokens/piece=30.0009
unigram_model_trainer.cc(500) LOG(INFO) EM sub_iter=1 size=1100 obj=10.4069 num_tokens=33002 num_tokens/piece=30.0018
trainer_interface.cc(595) LOG(INFO) Saving model: m.model
trainer_interface.cc(619) LOG(INFO) Saving vocabs: m.vocab
>>>

Segmentation (old interface)

% python
>>> import sentencepiece as spm
>>> sp = spm.SentencePieceProcessor()
>>> sp.Load("test/test_model.model")
True
>>> sp.EncodeAsPieces("This is a test")
['\xe2\x96\x81This', '\xe2\x96\x81is', '\xe2\x96\x81a', '\xe2\x96\x81', 't', 'est']
>>> sp.EncodeAsIds("This is a test")
[284, 47, 11, 4, 15, 400]
>>> sp.DecodePieces(['\xe2\x96\x81This', '\xe2\x96\x81is', '\xe2\x96\x81a', '\xe2\x96\x81', 't', 'est'])
'This is a test'
>>> sp.NBestEncodeAsPieces("This is a test", 5)
[['\xe2\x96\x81This', '\xe2\x96\x81is', '\xe2\x96\x81a', '\xe2\x96\x81', 't', 'est'], ['\xe2\x96\x81This', '\xe2\x96\x81is', '\xe2\x96\x81a', '\xe2\x96\x81', 'te', 'st'], ['\xe2\x96\x81This', '\xe2\x96\x81is', '\xe2\x96\x81a', '\xe2\x96\x81', 'te', 's', 't'], ['\xe2\x96\x81This', '\xe2\x96\x81is', '\xe2\x96\x81a', '\xe2\x96\x81', 't', 'e', 'st'], ['\xe2\x96\x81This', '\xe2\x96\x81is', '\xe2\x96\x81a', '\xe2\x96\x81', 't', 'es', 't']]
>>> for x in range(10):
...     sp.SampleEncodeAsPieces("This is a test", -1, 0.1)
...
['\xe2\x96\x81', 'T', 'h', 'i', 's', '\xe2\x96\x81', 'is', '\xe2\x96\x81a', '\xe2\x96\x81', 't', 'e', 's', 't']
['\xe2\x96\x81T', 'h', 'is', '\xe2\x96\x81is', '\xe2\x96\x81', 'a', '\xe2\x96\x81', 't', 'est']
['\xe2\x96\x81This', '\xe2\x96\x81is', '\xe2\x96\x81', 'a', '\xe2\x96\x81', 't', 'e', 'st']
['\xe2\x96\x81This', '\xe2\x96\x81is', '\xe2\x96\x81a', '\xe2\x96\x81', 't', 'e', 'st']
['\xe2\x96\x81This', '\xe2\x96\x81is', '\xe2\x96\x81a', '\xe2\x96\x81', 't', 'e', 's', 't']
['\xe2\x96\x81T', 'h', 'is', '\xe2\x96\x81', 'i', 's', '\xe2\x96\x81a', '\xe2\x96\x81', 'te', 's', 't']
['\xe2\x96\x81This', '\xe2\x96\x81', 'is', '\xe2\x96\x81a', '\xe2\x96\x81', 'te', 's', 't']
['\xe2\x96\x81This', '\xe2\x96\x81', 'i', 's', '\xe2\x96\x81a', '\xe2\x96\x81', 't', 'e', 'st']
['\xe2\x96\x81This', '\xe2\x96\x81', 'is', '\xe2\x96\x81', 'a', '\xe2\x96\x81', 't', 'e', 'st']
['\xe2\x96\x81This', '\xe2\x96\x81', 'i', 's', '\xe2\x96\x81', 'a', '\xe2\x96\x81', 'te', 's', 't']
>>> sp.DecodeIds([284, 47, 11, 4, 15, 400])
'This is a test'
>>> sp.GetPieceSize()
1000
>>> sp.IdToPiece(2)
'</s>'
>>> sp.PieceToId('</s>')
2
>>> len(sp)
1000
>>> sp['</s>']
2

Model Training (old interface)

Training is performed by passing parameters of spm_train to SentencePieceTrainer.Train() function.

>>> import sentencepiece as spm
>>> spm.SentencePieceTrainer.Train('--input=test/botchan.txt --model_prefix=m --vocab_size=1000')
unigram_model_trainer.cc(494) LOG(INFO) Starts training with : 
input: "test/botchan.txt"
model_prefix: "m"
model_type: UNIGRAM
..snip..
unigram_model_trainer.cc(529) LOG(INFO) EM sub_iter=0 size=1239 obj=10.4055 num_tokens=36256 num_tokens/piece=29.2623
unigram_model_trainer.cc(529) LOG(INFO) EM sub_iter=1 size=1239 obj=10.3187 num_tokens=36256 num_tokens/piece=29.2623
unigram_model_trainer.cc(529) LOG(INFO) EM sub_iter=0 size=1100 obj=10.5285 num_tokens=37633 num_tokens/piece=34.2118
unigram_model_trainer.cc(529) LOG(INFO) EM sub_iter=1 size=1100 obj=10.4973 num_tokens=37630 num_tokens/piece=34.2091
trainer_interface.cc(284) LOG(INFO) Saving model: m.model
trainer_interface.cc(293) LOG(INFO) Saving vocabs: m.vocab
>>>

Python2/3 String/Unicode compatibility

Sentencepiece python wrapper accepts both Unicode string and legacy byte string. The output string type is determined by the input string type. The output type of IdToPiece/DecodeIds methods is str, but note that it is a legacy byte string in Python2 and Unicode string in Python3 respectively.

Python2:

>>> sp.EncodeAsPieces('吾輩は猫である')
['\xe2\x96\x81', '\xe5\x90\xbe', '\xe8\xbc\xa9', '\xe3\x81\xaf', '\xe7\x8c\xab', '\xe3\x81\xa7\xe3\x81\x82\xe3\x82\x8b']
>>> sp.EncodeAsPieces(u'吾輩は猫である')
[u'\u2581', u'\u543e', u'\u8f29', u'\u306f', u'\u732b', u'\u3067\u3042\u308b']
>>> sp.EncodeAsPieces(u'吾輩は猫である'.encode('utf-8'))
['\xe2\x96\x81', '\xe5\x90\xbe', '\xe8\xbc\xa9', '\xe3\x81\xaf', '\xe7\x8c\xab', '\xe3\x81\xa7\xe3\x81\x82\xe3\x82\x8b']
>>> sp.IdToPiece(10)
'\xe3\x81\xab'
>>> type(sp.IdToPiece(10))
<type 'str'>

Python3:

>>> sp.EncodeAsPieces('吾輩は猫である')
['▁', '吾', '輩', 'は', '猫', 'である']
>>> sp.EncodeAsPieces('吾輩は猫である'.encode('utf-8'))
[b'\xe2\x96\x81', b'\xe5\x90\xbe', b'\xe8\xbc\xa9', b'\xe3\x81\xaf', b'\xe7\x8c\xab', b'\xe3\x81\xa7\xe3\x81\x82\xe3\x82\x8b']
>>>
>>> sp.IdToPiece(10)
'に'
>>> type(sp.IdToPiece(10))
<class 'str'>

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 5 - Production/Stable
Environment
- Console
Intended Audience
- Developers
- Science/Research
License
- OSI Approved :: Apache Software License
Operating System
- Unix
Programming Language
- Python
Topic
- Software Development :: Libraries :: Python Modules
- Text Processing :: Linguistic

Release history Release notifications | RSS feed

0.2.1

Aug 12, 2025

0.2.0

Feb 19, 2024

0.1.99

May 2, 2023

0.1.98

Apr 12, 2023

0.1.97

Aug 7, 2022

0.1.96

Jun 18, 2021

0.1.95

Jan 10, 2021

0.1.94

Oct 24, 2020

0.1.92 yanked

Jun 8, 2020

Reason this release was yanked:

Crash bug is reported (confirming)

0.1.91

May 21, 2020

This version

0.1.90

May 13, 2020

0.1.86

Apr 24, 2020

0.1.85

Dec 15, 2019

0.1.83

Aug 16, 2019

0.1.82

Apr 13, 2019

0.1.81

Mar 22, 2019

0.1.8

Jan 11, 2019

0.1.7

Dec 26, 2018

0.1.6

Nov 12, 2018

0.1.5

Oct 29, 2018

0.1.4

Aug 26, 2018

0.1.3

Jul 30, 2018

0.1.2

Jul 13, 2018

0.1.1

Jun 26, 2018

0.1.0

Jun 10, 2018

0.0.9

May 11, 2018

0.0.7

Apr 29, 2018

0.0.6

Apr 18, 2018

0.0.5

Apr 9, 2018

0.0.4

Feb 28, 2018

0.0.3

Dec 17, 2017

0.0.2

Nov 8, 2017

0.0.0

Aug 28, 2017

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sentencepiece-0.1.90-cp38-cp38-win_amd64.whl (1.2 MB view details)

Uploaded May 13, 2020 CPython 3.8Windows x86-64

sentencepiece-0.1.90-cp38-cp38-win32.whl (1.1 MB view details)

Uploaded May 13, 2020 CPython 3.8Windows x86

sentencepiece-0.1.90-cp38-cp38-manylinux1_x86_64.whl (1.1 MB view details)

Uploaded May 13, 2020 CPython 3.8

sentencepiece-0.1.90-cp38-cp38-manylinux1_i686.whl (1.1 MB view details)

Uploaded May 13, 2020 CPython 3.8

sentencepiece-0.1.90-cp38-cp38-macosx_10_6_x86_64.whl (1.0 MB view details)

Uploaded May 13, 2020 CPython 3.8macOS 10.6+ x86-64

sentencepiece-0.1.90-cp37-cp37m-win_amd64.whl (1.2 MB view details)

Uploaded May 13, 2020 CPython 3.7mWindows x86-64

sentencepiece-0.1.90-cp37-cp37m-win32.whl (1.1 MB view details)

Uploaded May 13, 2020 CPython 3.7mWindows x86

sentencepiece-0.1.90-cp37-cp37m-manylinux1_x86_64.whl (1.1 MB view details)

Uploaded May 13, 2020 CPython 3.7m

sentencepiece-0.1.90-cp37-cp37m-manylinux1_i686.whl (1.1 MB view details)

Uploaded May 13, 2020 CPython 3.7m

sentencepiece-0.1.90-cp37-cp37m-macosx_10_6_x86_64.whl (1.1 MB view details)

Uploaded May 13, 2020 CPython 3.7mmacOS 10.6+ x86-64

sentencepiece-0.1.90-cp36-cp36m-win_amd64.whl (1.2 MB view details)

Uploaded May 13, 2020 CPython 3.6mWindows x86-64

sentencepiece-0.1.90-cp36-cp36m-win32.whl (1.1 MB view details)

Uploaded May 13, 2020 CPython 3.6mWindows x86

sentencepiece-0.1.90-cp36-cp36m-manylinux1_x86_64.whl (1.1 MB view details)

Uploaded May 13, 2020 CPython 3.6m

sentencepiece-0.1.90-cp36-cp36m-manylinux1_i686.whl (1.1 MB view details)

Uploaded May 13, 2020 CPython 3.6m

sentencepiece-0.1.90-cp36-cp36m-macosx_10_6_x86_64.whl (1.1 MB view details)

Uploaded May 13, 2020 CPython 3.6mmacOS 10.6+ x86-64

sentencepiece-0.1.90-cp35-cp35m-manylinux1_x86_64.whl (1.1 MB view details)

Uploaded May 13, 2020 CPython 3.5m

sentencepiece-0.1.90-cp35-cp35m-manylinux1_i686.whl (1.1 MB view details)

Uploaded May 13, 2020 CPython 3.5m

sentencepiece-0.1.90-cp35-cp35m-macosx_10_6_x86_64.whl (1.1 MB view details)

Uploaded May 13, 2020 CPython 3.5mmacOS 10.6+ x86-64

sentencepiece-0.1.90-cp27-cp27mu-manylinux1_x86_64.whl (1.1 MB view details)

Uploaded May 13, 2020 CPython 2.7mu

sentencepiece-0.1.90-cp27-cp27mu-manylinux1_i686.whl (1.1 MB view details)

Uploaded May 13, 2020 CPython 2.7mu

sentencepiece-0.1.90-cp27-cp27m-manylinux1_x86_64.whl (1.1 MB view details)

Uploaded May 13, 2020 CPython 2.7m

sentencepiece-0.1.90-cp27-cp27m-manylinux1_i686.whl (1.1 MB view details)

Uploaded May 13, 2020 CPython 2.7m

sentencepiece-0.1.90-cp27-cp27m-macosx_10_6_x86_64.whl (1.1 MB view details)

Uploaded May 13, 2020 CPython 2.7mmacOS 10.6+ x86-64

File details

Details for the file sentencepiece-0.1.90-cp38-cp38-win_amd64.whl.

File metadata

Download URL: sentencepiece-0.1.90-cp38-cp38-win_amd64.whl
Upload date: May 13, 2020
Size: 1.2 MB
Tags: CPython 3.8, Windows x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.1 pkginfo/1.4.2 requests/2.22.0 setuptools/44.0.0 requests-toolbelt/0.8.0 tqdm/4.30.0 CPython/3.7.7

File hashes

Hashes for sentencepiece-0.1.90-cp38-cp38-win_amd64.whl
Algorithm	Hash digest
SHA256	`037fb6661f8cfa8d089ae19b9aa2c9d78a0559326f21f62d46bbc7574c6f79d3`
MD5	`7ea90d2c20f62f88ba4b78a325450974`
BLAKE2b-256	`4242eab8f33afd9c6a63a92aa9048e1db9b92ead4c02bce24ac1e541db62df14`

Algorithm	Hash digest
SHA256	`a437a7ce2aa7860a131f9838102dee08dadd56515b3e810f09920ee762d9ecf5`
MD5	`a0810b4344240015e785577dc678fc1f`
BLAKE2b-256	`03ac636198a8c634b1d9c71cd4d23b0d530bb866908c6e1774ef40d53c276f94`

Algorithm	Hash digest
SHA256	`6e35dc1ea126c00b28121de7174fa0a70d62c2218ed1df010cc44be00e327676`
MD5	`25a901cfd8a7d14a41aacb4e4cc9501d`
BLAKE2b-256	`f999316fa5fc4d8765c92a54d5e382462879d997bb2617a11a551078fb530d3c`

Algorithm	Hash digest
SHA256	`830e6edabfd5019e92683fa125bdbb749fd313defa724538d4c2e2f35defd635`
MD5	`87643d69910b4a8a1a0c4b7e04ba1ea4`
BLAKE2b-256	`7581b8e31e192846291f618a4846cfb3bb6d991871e815131c9b500ebdf75b38`

Algorithm	Hash digest
SHA256	`9852a9e26d1d9b8778c705311b5f294682d20af6a9ebfee08868d8cc4d46df0d`
MD5	`6c0a4f08a66e0ef73568785f86728761`
BLAKE2b-256	`d10e6366817deb4dd80efddd43c37a4fe5f49db9c1f32001a9d9f3a71106a296`

Algorithm	Hash digest
SHA256	`81f64642acffefe4b45a417f48946334d696ade5f2cda8dcaac8160cfb0f9d66`
MD5	`ba4fd064454ab043eb9c34c730162a38`
BLAKE2b-256	`742bf2ab5b595c86b91daebe4c95f7a42b7b1125e13fc75bfcafe525f594872e`

Algorithm	Hash digest
SHA256	`f7bc61d19007cf871850f9dac27f6e905fe8494070b2e4063df9df9fcf7d41cc`
MD5	`cd8588c5ef3c70806a33a148c5843d16`
BLAKE2b-256	`9573a4d7c51feedd23d3783416378c4b961711179eb63a2beee10cb6307df3fe`

Algorithm	Hash digest
SHA256	`bdb707d0dce13ab16365f988398ec8cc90cb7098c61b2a80c63e0f3699b582c6`
MD5	`48ac261aeecaa868343e4546952e58ed`
BLAKE2b-256	`5f1d55d2cede55b5c6a6a4015297697068d224eefb05b46a86df0c3933145bfb`

Algorithm	Hash digest
SHA256	`4e06fea80f87d14f3ac6a24f69034efbc7f33c3d66730063a87ff99f944619a5`
MD5	`9daef6b2df1b3896f058acb7de40fb52`
BLAKE2b-256	`a7c997f38a10484fe7c6cb4d76fe5e638917cc99b11ad05f0747550c14394416`

Algorithm	Hash digest
SHA256	`6c89de6b3078b8cdcaece9c96530a9b92184efd40eaee7f666de663b62a9bfb1`
MD5	`ea1659f2062187caac18e753ae0e12d2`
BLAKE2b-256	`2025d23969e7b04ef9730d18546906491ff2e61f278fa9d885efc1bf9d81f3f8`

Algorithm	Hash digest
SHA256	`d4811a325f442848ebc5116b2d21c6b53b57eb655430b012ddd3653161caf1b2`
MD5	`4c6857d00ca5297ba82b4267bf8c9516`
BLAKE2b-256	`8f70376c2007e643851175ffc48bff95df4f3cb4ac8d545f501b465f09a5ce5a`

Algorithm	Hash digest
SHA256	`df824c256cbe219637f3ca59f0bfd450bf1246f87b58243f171d20da48fbbf21`
MD5	`05388371c112cc2f83aea98ce062b5f7`
BLAKE2b-256	`93f47e6ce090f5ba385cf26ec4628d3181acd888178ed911ffdd92dc599ae8ff`

Algorithm	Hash digest
SHA256	`4aea3d1344b37d2feb1d2c14304fe43c1b2971e303cb0c1288f18d19be5d0210`
MD5	`29de9ce62f2949b1024fb245ff3ff743`
BLAKE2b-256	`3b8849e772d686088e1278766ad68a463513642a2a877487decbd691dec02955`

Algorithm	Hash digest
SHA256	`fbed3501d653abb2ad50d372a7d88246e04a0a1b297f616f12d2bba870b527e0`
MD5	`6ca4dfaa9d902b18f17d9f1661b84ad9`
BLAKE2b-256	`d4f7019fb72e87af6ea3a7f4ab85abcf865c99a1c9b6fad7cb30db35362e5b0b`

Algorithm	Hash digest
SHA256	`ff4e7c78b035f0bc9fdf69610a69f22fc330cbb352baf212d356d32316cb24b5`
MD5	`29b9432f88b5f6ad06901bc2151e5215`
BLAKE2b-256	`a97c0835960be46921be494c8caaa91e6a70fd8a96c2835395122d0d10f1c090`

Algorithm	Hash digest
SHA256	`a049890bd309e77afa3bf0b4182ce64eb71d7b59604b85e06f7fd45b881cd1a3`
MD5	`82e6ca1e54775840ff118ab1646a1760`
BLAKE2b-256	`4f5d7671d1366b32122f019de7db00bd58820e78bee1e97e1af3fa4c76506979`

Algorithm	Hash digest
SHA256	`7a7704dbadd4e26940333f1155b1c412a3fa5200c32d472dac18e00082ab1d16`
MD5	`3531ad44e5457a9a7ad06485fad012ef`
BLAKE2b-256	`d5ca5e436620635efc420fd6a4e8aa7139d078e52b5e270845ec4a838c35d643`

Algorithm	Hash digest
SHA256	`3a1659e6a2853c440af31d7aca242ff25340e57bdeaa27ac9de66ec5c8721afd`
MD5	`19e0fa02c86db4528e9a2aedcdf736e5`
BLAKE2b-256	`192de19b8e8ff88768b889846cacd422a78a8b071d5c3a22097a8672a8d91f18`

Algorithm	Hash digest
SHA256	`2c717d3c9e1261288ebf1ef77f77665efa4fcd16b6cf591f7f3271eac5fee3ab`
MD5	`04fd9618c649f578ad34afbe7840170c`
BLAKE2b-256	`3360a07d7c1aa5a5c38cbfcf60319e1cfc5a1f9d350b49f15fef4015e0ac5a29`

Algorithm	Hash digest
SHA256	`fbbc2da9901552580a83b20ee58c34d91140103208864a64b66630544a413b3c`
MD5	`cc18e7fbd29fc8cdab2d2f63317743d2`
BLAKE2b-256	`42a9f3544d8108146b7f4cfa0d5762d618190fda997751deff56edb8de74bea1`

Algorithm	Hash digest
SHA256	`736ad81b1b035bf715e527c54868922941ee8ffd210a1779eff59126d2d544b1`
MD5	`005bd49e07d09fcf33354a488ac26d50`
BLAKE2b-256	`fd66365e850954e2e3d6437a09cde85f8d33a6d4d39e59848ca2e49969348e03`

sentencepiece 0.1.90

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

SentencePiece Python Wrapper

Build and Install SentencePiece

Usage

Segmentation

Model Training

Segmentation (old interface)

Model Training (old interface)

Python2/3 String/Unicode compatibility

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distributions

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

Algorithm	Hash digest
SHA256	`e12a1e5c29001262480de1bd320b5536007d68b7a01635899eec0db3dbfe111a`
MD5	`ec72d62aef11da27d9a503d40b1c6a26`
BLAKE2b-256	`efbc4a9b0a75e5e452c81592f1a8155f69d981e2914806cc0b25f178fe8bf5e8`

Algorithm	Hash digest
SHA256	`c296d750d30cd0a555fde441b42bd4924721f687b22ded473a1ed5aab6efcc67`
MD5	`5ea43db20d278187d939eb7d70619ded`
BLAKE2b-256	`bc0a264a4b0211fda89cb081900952cf7052e68496a3e93fb9599d09e457ba90`