Skip to main content

multilingual emoji prediction

Project description

Bertmoticon

The Bertmoticon package is fine-tuned from the BERT model, to the emoji prediction task. It can predict emojis in 102 languages. In this package we include two functions that enable the use of it: bertmoticon.infer and bertmoticon.infer_mappings. The number of emojis available for this model are 80; and are listed in bertmoticon.emojis.

Installation

Installing the Bertmoticon package from PyPI using:

pip3 install bertmoticon

Importing in python

Importing the package can be done as:

import bertmoticon

If the model is not already downloaded; upon first run it will download and extract the model automatically as such:

Downloading bermoticon model
[=                                                          ]
...
[==================                                         ]
...
[===========================================================]
Extracting the model

The model is not included with the pypi installation. It requires 1.34 GB. Loads it either into CUDA or CPU based on CUDA availability.

Usage

bertmoticon.emojis

The model can predict up to 80 emojis. Acceessing the emojis can be done by calling the global variable emojis called as bertmoticon.emojis.

>>> print(bertmoticon.emojis)
['๐Ÿ˜‚', '๐Ÿ˜ญ', '๐Ÿ˜', '๐Ÿ˜Š', '๐Ÿ™', '๐Ÿ˜…', '๐Ÿ˜', '๐Ÿ™„', '๐Ÿ˜˜', '๐Ÿ˜”', '๐Ÿ˜ฉ', '๐Ÿ˜‰', '๐Ÿ˜Ž', '๐Ÿ˜ข', '๐Ÿ˜†', '๐Ÿ˜‹', '๐Ÿ˜Œ', '๐Ÿ˜ณ', '๐Ÿ˜', '๐Ÿ™‚', '๐Ÿ˜ƒ', '๐Ÿ™ƒ', '๐Ÿ˜’', '๐Ÿ˜œ', '๐Ÿ˜€', '๐Ÿ˜ฑ', '๐Ÿ™ˆ', '๐Ÿ˜„', '๐Ÿ˜ก', '๐Ÿ˜ฌ', '๐Ÿ™Œ', '๐Ÿ˜ด', '๐Ÿ˜ซ', '๐Ÿ˜ช', '๐Ÿ˜ค', '๐Ÿ˜‡', '๐Ÿ˜ˆ', '๐Ÿ˜ž', '๐Ÿ˜ท', '๐Ÿ˜ฃ', '๐Ÿ˜ฅ', '๐Ÿ˜', '๐Ÿ˜‘', '๐Ÿ˜“', '๐Ÿ˜•', '๐Ÿ˜น', '๐Ÿ˜', '๐Ÿ˜ป', '๐Ÿ˜–', '๐Ÿ˜›', '๐Ÿ˜ ', '๐Ÿ™Š', '๐Ÿ˜ฐ', '๐Ÿ˜š', '๐Ÿ˜ฒ', '๐Ÿ˜ถ', '๐Ÿ˜ฎ', '๐Ÿ™', '๐Ÿ˜ต', '๐Ÿ˜—', '๐Ÿ˜Ÿ', '๐Ÿ˜จ', '๐Ÿ™‡', '๐Ÿ™‹', '๐Ÿ˜™', '๐Ÿ˜ฏ', '๐Ÿ™†', '๐Ÿ™‰', '๐Ÿ˜ง', '๐Ÿ˜ฟ', '๐Ÿ˜ธ', '๐Ÿ™€', '๐Ÿ˜ฆ', '๐Ÿ˜ฝ', '๐Ÿ˜บ', '๐Ÿ˜ผ', '๐Ÿ™…', '๐Ÿ˜พ', '๐Ÿ™', '๐Ÿ™Ž']

bertmoticon.infer

Takes in a list of strings and an int number of guesses. It returns a list of dictionaries, where each dictionary contains an emoji and a corresponding percentage.

>>> ls_of_strings =  ["Vote #TRUMP2020ToSaveAmerica from corrupt Joe Biden and the radical left.","Je veux aller dormir. #fatiguรฉ"]
>>> print(bertmoticon.infer(ls_of_strings,3))
    [{'๐Ÿ˜‚': '0.1938', '๐Ÿ˜ก': '0.1866', '๐Ÿ™„': '0.0847'}, {'๐Ÿ˜ด': '0.1547', '๐Ÿ˜ญ': '0.1507', '๐Ÿ˜ฉ': '0.0892'}]

bertmoticon.infer_mappings

Takes in a list of strings, a dictionary dict of the emoji mappings, and an int number of guesses. It returns the number of occurences of each key value. We define the dictionary and the list as follows:

>>> mappings = {"Anger":['๐Ÿ˜ก'], "Other":['๐Ÿ˜‚','๐Ÿ˜ญ']}
>>> ls_of_strings =  ["Vote #TRUMP2020ToSaveAmerica from corrupt Joe Biden and the radical left.","Je veux aller dormir. #fatiguรฉ"]

The key values are the category names and the values are lists of the emojis contained in that category. Then parsed into the bertmoticon.infer_mappings returns:

>>>print(bertmoticon.infer_mappings(ls_of_strings,mappings,3))
{'Anger': 1, 'Other': 2}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bertmoticon-1.0.1.tar.gz (8.9 kB view details)

Uploaded Source

Built Distribution

bertmoticon-1.0.1-py3-none-any.whl (10.2 kB view details)

Uploaded Python 3

File details

Details for the file bertmoticon-1.0.1.tar.gz.

File metadata

  • Download URL: bertmoticon-1.0.1.tar.gz
  • Upload date:
  • Size: 8.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.50.0 CPython/3.6.9

File hashes

Hashes for bertmoticon-1.0.1.tar.gz
Algorithm Hash digest
SHA256 7321b70cd30df93d655cb374d446de249d73708501d17a19368cb7e2d4c36e07
MD5 e721626ea05b22278f88b7ed0dc1aa33
BLAKE2b-256 5ae4c1957dd85b03860b5973c04d6dea3938cdc3e36283a08735d40f3521da56

See more details on using hashes here.

File details

Details for the file bertmoticon-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: bertmoticon-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 10.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.50.0 CPython/3.6.9

File hashes

Hashes for bertmoticon-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 f1d08ece547acca2257424df135ef46e67dc24d1f98478e4395aac57435128dd
MD5 a798aae6759a0e5c08338060a183e408
BLAKE2b-256 b1eb35d78e78daee39e414b53bea8c0ee3f46b44801ced5b992031f908b2c12d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page