A library for processing Code Mixed Text. Still in development!
Project description
CMTT is a wrapper library that makes code-mixed text processing more efficient than ever. More documentation incoming!
Installation
pip install cmtt
Get started
How to use this library:
from cmtt.data import *
# Loading json files
result_json = load_url('https://world.openfoodfacts.org/api/v0/product/5060292302201.json')
# Loading csv files
result_csv = load_url('https://gist.githubusercontent.com/rnirmal/e01acfdaf54a6f9b24e91ba4cae63518/raw/b589a5c5a851711e20c5eb28f9d54742d1fe2dc/datasets.csv')
# List the key properties available for the datasets provided by the cmtt library
keys = list_dataset_keys()
# List all datasets provided by cmtt
# Specifying the 'key' property allows to return the dataset names with the respective 'key' value
# Specifying the 'key' as 'all' returns all the information pertaining to all the datasets
data = list_cmtt_datasets()
# Download multiple datasets provided by cmtt, returning a list of paths where the datasets get downloaded
# The Datasets are downloaded into a new 'cmtt' directory inside the root directory of the operating system
lst = download_cmtt_datasets(["linc_ner_hineng", "L3Cube_HingLID_all", "linc_lid_spaeng"])
# Download a dataset from a url, returning the path where the dataset gets downloaded
# The Dataset is downloaded into a new directory 'datasets' inside the current working directory
path = download_dataset_url('https://world.openfoodfacts.org/api/v0/product/5060292302201.json')
Contributors
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
cmtt-0.1.0.tar.gz
(8.9 kB
view details)
Built Distribution
cmtt-0.1.0-py3-none-any.whl
(8.9 kB
view details)
File details
Details for the file cmtt-0.1.0.tar.gz
.
File metadata
- Download URL: cmtt-0.1.0.tar.gz
- Upload date:
- Size: 8.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6d799d7fa903b643a4f8a5c965b467a2208443804cf150be00020769ea4048c7 |
|
MD5 | b6175f3d52f212d36a9ec1165eef334f |
|
BLAKE2b-256 | eb6b8955e0dfc434b2b7fb3204c7f72aca6e36d81a6d8e64786b25be8d6bee03 |
File details
Details for the file cmtt-0.1.0-py3-none-any.whl
.
File metadata
- Download URL: cmtt-0.1.0-py3-none-any.whl
- Upload date:
- Size: 8.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a5ef29185d19f826215bb45d160718d5bdd1705cb96d73ebffcff43f532c2b73 |
|
MD5 | 59d8993e851bb76a39a860c3f4fba57c |
|
BLAKE2b-256 | c9527fb76e206c9ca0622f37950b149ed711e7ce7421d986cea65e04afa2efcc |