Skip to main content

The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER). We have shared a pretrained 9B parameter model.

Project description

GeoV

Overview

The GeoV model was designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER) by Georges Hark and Varuna Jayasiri.

RoPER, in addition to using relative positions in the attention score calculation by RoPE embeddings, adds relative positional information explicitly to value embeddings. Specifically, it incorporates the relative positions of the tokens paid attention to. RoPER has given better performance in some algorithmic tasks, and seems comparable to RoPE in language modeling.

The GeoV tokenizer uses SentencePiece unigram language model and tokenizes symbols, digits and new line characters separately, in order to achieve better performance on mathematical content and code.

This model was contributed by gharik and vpj.

We have shared 9B parameter pre-trained model at GeoV/GeoV-9b, We plan to release checkpoints around every 20b tokens trained from here until around 300b tokens. We will also train smaller and larger versions. Our aim is to have broadly available smaller and larger models.

This implementation is built on top of transformers library.

Installations

pip install geov

Generation

from geov import GeoVForCausalLM, GeoVTokenizer

model = GeoVForCausalLM.from_pretrained("GeoV/GeoV-9b")
tokenizer = GeoVTokenizer.from_pretrained("GeoV/GeoV-9b")

prompt = "In mathematics, topology is the study of"

input_ids = tokenizer(prompt, return_tensors="pt").input_ids

gen_tokens = model.generate(
    input_ids,
    do_sample=True,
    temperature=0.9,
    max_length=100,
)
gen_text = tokenizer.batch_decode(gen_tokens)[0]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geov-0.0.2.tar.gz (12.8 kB view details)

Uploaded Source

Built Distribution

geov-0.0.2-py3-none-any.whl (14.4 kB view details)

Uploaded Python 3

File details

Details for the file geov-0.0.2.tar.gz.

File metadata

  • Download URL: geov-0.0.2.tar.gz
  • Upload date:
  • Size: 12.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.27.1 setuptools/58.0.4 requests-toolbelt/0.9.1 tqdm/4.63.0 CPython/3.7.11

File hashes

Hashes for geov-0.0.2.tar.gz
Algorithm Hash digest
SHA256 5a737bd20531ed1ad72c9742839af754371d0108fd997b02b0200c8ae6393e3f
MD5 78351886a76142e3b02bbf854070cefd
BLAKE2b-256 a724fc830ae9da607c5857b071056ee78972e72575e0a04b9c1ec9c40a35bd05

See more details on using hashes here.

File details

Details for the file geov-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: geov-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 14.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.27.1 setuptools/58.0.4 requests-toolbelt/0.9.1 tqdm/4.63.0 CPython/3.7.11

File hashes

Hashes for geov-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 2cbde524f8dbe208ea904bcf6910400d7476514e6f407cef911ddd107eb065db
MD5 3fdbe9c9534e74023963a7956d0de77b
BLAKE2b-256 d66e2a0e17ee8e8c03f38652c6f02c00a080a31a8133d4477e17b9ecad436ff3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page