ACAutomaton python wrapper,support unicode
Project description
# About ACAutomaton Python Package High-performance multi-string lookup data structure
# Notice 1. If you want to insert unicode string, please encode them to byte string first. 2. Once you insert a new word to ACAutomaton, please remember call build method. You can call build method multiple times.
- # Install
pip install ACAutomaton
# Usage
>>> from ACAutomaton import ACAutomaton >>> a = ACAutomaton() >>> a.insert('11') >>> a.insert('22') >>> a.insert('33') >>> a.build() >>> a.matchOne('0011222333') (2, '11') >>> a.matchOne('00') (-1, None) >>> a.matchAll('0011222333') [(2, '11'), (4, '22'), (5, '22'), (7, '33'), (7, '33'), (8, '33'), (8, '33')]
example for unicode string
>>> from ACAutomaton import ACAutomaton >>> a = ACAutomaton() >>> a.insert('你好') >>> a.insert('你坏') >>> a.insert('你') >>> a.build() >>> a.matchOne('你好你坏你') (0, '\xe4\xbd\xa0') >>> a.matchAll('你好你坏你不存在') [(0, '\xe4\xbd\xa0'), (0, '\xe4\xbd\xa0\xe5\xa5\xbd'), (6, '\xe4\xbd\xa0'), (6, '\xe4\xbd\xa0\xe5\x9d\x8f'), (12, '\xe4\xbd\xa0')] >>> a.matchAll('不存在') [] >>> a.insert('不存在') >>> a.build() >>> a.matchAll('不存在') [(0, '\xe4\xb8\x8d\xe5\xad\x98\xe5\x9c\xa8')]
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
ACAutomaton-1.0.3.tar.gz
(3.8 kB
view details)
File details
Details for the file ACAutomaton-1.0.3.tar.gz
.
File metadata
- Download URL: ACAutomaton-1.0.3.tar.gz
- Upload date:
- Size: 3.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: Python-urllib/2.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b09281ee10ef3b9f3b0c4f55074ac32129024b60c5909e1ca970a72dad60847f |
|
MD5 | f459c23464b0dca64ac790f98437c59c |
|
BLAKE2b-256 | 9a2b06697f1a3f3d830d08773302b0251d41bcb21d5cea608345b493df727c00 |