Python API for the Turkish Language Foundation
Project description
Python API for the Turkish Language Foundation
tdk-py is a Python package that allows for simple access to Turkish dictionaries made available by the TDK, the Turkish Language Society. tdk-py aims to be easy to use and internally queries the TDK and parses its response into easy to use Python class objects.
Installation
tdk-py is supported on Python 3.6+. The recommended way to install is via pip which comes with Python.
pip install tdk-py
If your machine doesn’t have Python and pip installed you can download it from The Python Software Foundation’s website.
Sample usage
tdk.gts is used to access TDK’s GTS, the up-to-date Turkish dictionary (Güncel Türkçe Sözlük).
>>> import tdk.gts >>> tdk.gts.search("merkeziyetçilik") [<Entry 41635 (merkeziyetçilik)>]
tsk.gts.search returns a list because it is possible for there to be more than one word with the exact same spelling.
>>> for number, entry in enumerate(tdk.gts.search("bar")): ... for meaning in entry.meanings: ... print(number+1, entry.entry, meaning.meaning) ... 1 bar Anadolu'nun doğu ve kuzey bölgesinde, en çok Artvin ve Erzurum yörelerinde el ele tutuşularak oynanan, ağır ritimli bir halk oyunu 2 bar Danslı, içkili eğlence yeri 2 bar Ayaküstü içki içilen eğlence yeri 2 bar Amerikan bar 3 bar Hava basıncı birimi 4 bar Ateşten, mide bozukluğundan, ağızda, dil ve dişlerde meydana gelen acılık, pas 5 bar Halter sporunda ağırlığı oluşturan kiloları birbirine bağlayan metal çubuk >>> # 5 different words! One of them (#2) has multiple meanings!
You can query suggestions for misspelt words or for other similar words.
>>> from difflib import get_close_matches >>> get_close_matches("feldispat", tdk.gts.index()) ['feldspat', 'ispat', 'fesat']
You can perform complex analyses very easily. Let’s see the distribution of entries by the number of maximum consecutive consonants.
>>> from tdk.tools import max_streak >>> from tdk.alphabet import CONSONANTS >>> annotated_dict = {} >>> for entry in tdk.gts.index(): ... streaks = max_streak(entry) ... if streaks not in annotated_dict: ... annotated_dict[streaks] = [entry] ... else: ... annotated_dict[streaks].append(entry) >>> for i in set(annotated_dict): ... print(i, len(annotated_dict[i])) ... 0 19 1 15199 2 73511 3 3605 4 68 5 5
License
tdk-py’s source code is provided under the MIT License.
Copyright © 2021-2023 Emre Özcan
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file tdk-py-1.3.0.tar.gz
.
File metadata
- Download URL: tdk-py-1.3.0.tar.gz
- Upload date:
- Size: 12.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 40b3c116870d05f722d65bfe456c8646c0b29e30fe56fda84e3b068a1a1a8110 |
|
MD5 | 04eab012cc1097424c1a51f544878b93 |
|
BLAKE2b-256 | f3839929f63c027d28f9aed2d38b7c9fbe5233cb74a2d88cd33725e6efc2ec51 |
File details
Details for the file tdk_py-1.3.0-py3-none-any.whl
.
File metadata
- Download URL: tdk_py-1.3.0-py3-none-any.whl
- Upload date:
- Size: 13.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fc4b2ce627a3de50abe46dda86a39b5507dd7395b659f94d16c2ed48d2a8632a |
|
MD5 | b0c6118a7f513f819b9c56340b661d1c |
|
BLAKE2b-256 | 11423f56d64fe1427a80a444b3d0c1054c5b27b00cedacc61cba21a00746be6b |