A biologically inspired method to create sparse, binary word vectors

These details have not been verified by PyPI

Project links

Homepage

Project description

FlyVec

Flybrain-inspired Sparse Binary Word Embeddings

Code based on the ICLR 2021 paper Can a Fruit Fly Learn Word Embeddings?. A work in progress.

Install

pip install flyvec

How to use

import numpy as np
from flyvec import FlyVec

model = FlyVec.load()
embed_info = model.get_sparse_embedding("market")

Loading Tokenizer...
No phraser specified. Proceeding without phrases
Loading synapses...

FlyVec uses a simple, word-based tokenizer with to isolate concepts. The provided model uses a tokenizer with about 40,000 words, all lower-cased, with special tokens for numbers (<NUM>) and unknown words (<UNK>). See Tokenizer for details.

# Batch generate word embeddings
sentence = "Supreme Court dismissed the criminal charges."
tokens = model.tokenize(sentence)
embedding_info = [model.get_sparse_embedding(t) for t in tokens]
embeddings = np.array([e['embedding'] for e in embedding_info])
print("TOKENS: ", [e['token'] for e in embedding_info])
print("EMBEDDINGS: ", embeddings)

TOKENS:  ['supreme', 'court', 'dismissed', 'the', 'criminal', 'charges']
EMBEDDINGS:  [[0 1 0 ... 0 0 0]
 [0 0 0 ... 0 0 0]
 [0 0 0 ... 0 1 0]
 [0 0 0 ... 0 0 0]
 [0 0 0 ... 0 1 0]
 [0 0 0 ... 0 1 0]]

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.3.2

May 10, 2021

0.3.1

May 10, 2021

0.3.0

Feb 26, 2021

0.2.6

Feb 26, 2021

0.2.5

Feb 26, 2021

0.2.4

Feb 26, 2021

0.2.3

Feb 26, 2021

0.2.2

Feb 26, 2021

0.2.1

Feb 7, 2021

0.1.4

Feb 3, 2021

0.1.3

Jan 29, 2021

0.1.1

Jan 29, 2021

0.0.14

Jan 27, 2021

0.0.13

Jan 27, 2021

0.0.10

Jan 26, 2021

This version

0.0.9

Jan 26, 2021

0.0.8

Jan 26, 2021

0.0.6

Jan 26, 2021

0.0.5

Jan 26, 2021

0.0.4

Jan 26, 2021

0.0.3

Jan 26, 2021

0.0.2

Jan 26, 2021

0.0.1

Jan 25, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

flyvec-0.0.9.tar.gz (14.2 kB view hashes)

Uploaded Jan 26, 2021 Source

Built Distribution

flyvec-0.0.9-py3-none-any.whl (13.6 kB view hashes)

Uploaded Jan 26, 2021 Python 3

Hashes for flyvec-0.0.9.tar.gz

Hashes for flyvec-0.0.9.tar.gz
Algorithm	Hash digest
SHA256	`f31660e99920a3fe73966ab68dc272c5a93d3ae35f712350e55a9a76cb0af7f7`
MD5	`06d15b5eaa323366a3b93a038d3294f2`
BLAKE2b-256	`553d44af2df1395cb2c51fb22d8989b8f698e5ec24965f5a14d6a27648aa55b5`

Hashes for flyvec-0.0.9-py3-none-any.whl

Hashes for flyvec-0.0.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b3f9ab420927ce6798d72d2ca990fdaa4031b35250f2bb9270bfca1b41399779`
MD5	`bf46b50080d1e90f0ace11fee2410fad`
BLAKE2b-256	`9d263adacd00accb40717acaaaf97d7680e1c23cba695fcfbd3a3d8ccd32e1b4`