Skip to main content

A high-performance Serbian stemming library supporting both Cyrillic and Latin scripts (Ekavica).

Project description

Serb-Stem 🇷🇸⚡

Serb-Stem Header

Rust Python WebAssembly License

„Ponuda koju tvoj NLP procesor ne može da odbije.“

Prestanite da gubite vreme na spora, neprecizna rešenja koja "pucaju" na ćirilici. Serb-Stem nije samo biblioteka — to je nepravedna prednost za tvoj pretraživač.

✨ Šta dobijaš (The Grand Slam Offer)

  • 🚀 Brzina Svetlosti (<1µs Latency): Dok drugi učitavaju rečnike, ti si već ostemovao celu bazu. Rust motor radi na metalu — bez smeća, bez čekanja.
  • 🎯 Hirurška Preciznost (98.35% Acc): Naš algoritam ne nagađa. On poznaje srpsku gramatiku bolje od tvoje profesorke iz srednje.
  • 💪 Universal Script Engine: Ćirilica? Latinica? Ijekavica? Serb-Stem sve žvaće i izbacuje savršen ekavski koren spreman za indeksiranje.
  • 🏗️ Zero-Effort Integration: pip install i gotov si. Nema kompajliranja, nema zavisnosti, nema glavobolje.

📉 Jednačina Vrednosti (Value Equation)

  • Dream Outcome: Savršena pretraga i analiza srpskog teksta u realnom vremenu.
  • Likelihood of Success: 98.35% verifikovan korpus + Rust memorijska sigurnost.
  • Time Delay: NULA. Od instalacije do prvog stem() poziva treba ti 30 sekundi. Latencija obrade je bukvalno nevidljiva.
  • Effort & Sacrifice: NULA. Handling oba pisma i ekavizaciju radimo mi. Ti samo šalješ stringove.

🛠️ Instalacija i Korišćenje

🐍 Python

pip install serb-stem
import serb_stem

# Latino ulaz
print(serb_stem.stem_py("knjigama"))  # Output: "knjig"

# Ćirilični ulaz
print(serb_stem.stem_py("књигама"))  # Output: "књиг"

# Ekavizacija (mlijeko -> mlek)
print(serb_stem.stem_py("mlijeka"))   # Output: "mlek"

🦀 Rust

use serb_stem::stem;

let result = stem("učenici");
assert_eq!(result, "učenik");

🌐 Interaktivni Demo

U okviru projekta nalazi se i /portal (React + Vite + WASM) koji omogućava testiranje stemmera direktno u vašem browseru uz vizuelni prikaz rezultata i vremena obrade.

📜 Licenca

Ovaj projekat je licenciran pod AGPL-3.0 licencom.


Developed with ❤️ by Ja1Denis & Antigravity AI

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

serb_stem-0.1.2.tar.gz (785.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

serb_stem-0.1.2-cp311-cp311-win_amd64.whl (105.4 kB view details)

Uploaded CPython 3.11Windows x86-64

File details

Details for the file serb_stem-0.1.2.tar.gz.

File metadata

  • Download URL: serb_stem-0.1.2.tar.gz
  • Upload date:
  • Size: 785.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.11.5

File hashes

Hashes for serb_stem-0.1.2.tar.gz
Algorithm Hash digest
SHA256 c9cd7ef58a679843bcb8eb7b47309458d23c02df228f648c54e26f5e76dbf1bf
MD5 080d7a777898c3be65f1f6192dc33ac6
BLAKE2b-256 b351a8b3a2ecc39be95d8c6f16ea8bdbbb0ec45b794ef5fba0398ee496294ec4

See more details on using hashes here.

File details

Details for the file serb_stem-0.1.2-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for serb_stem-0.1.2-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 53e12cbd60c28010bd7f0f8357b331c0fb6d638507a7dc5bd8d9acd81454d347
MD5 16f9e31e3cd159d34e2c63c907808de0
BLAKE2b-256 26803282ad96b0524d0ecf91406af897eac21d809ba47e8056d4e6f40c7eec66

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page