Skip to main content

Genesis architecture (PyTorch) and utilities for inference/benchmarking.

Project description


license: apache-2.0 language:

  • en pipeline_tag: text-generation tags:
  • pytorch
  • safetensors
  • text-generation
  • small-llm
  • custom-architecture

Genesis-152M-Instruct

Genesis-152M-Instruct is a small instruction-tuned model based on a custom Genesis PyTorch architecture. The weights are provided as a single .safetensors file and are not directly compatible with transformers (there is no config.json / tokenizer files in this repo).

What’s in this repo

  • genesis_152m_instruct.safetensors (model weights)
  • README.md (this model card)
  • LICENSE (Apache-2.0 for the weights)

Model summary

  • Params: ~151.8M total (~122.8M non-embedding; benchmark wrapper report)
  • Context length: 2048
  • Tokenizer: GPT‑NeoX / Pythia base vocab + ChatML tokens (<|im_start|>, <|im_end|>)
  • Architecture: hybrid attention (linear + full attention)

How to run (recommended)

  1. Install the code (PyPI):
python -m pip install -U genesis-llm
  1. Download the weights from the Hub:
python -m pip install -U "huggingface-hub<1.0,>=0.34.0"
hf download guiferrarib/genesis-152m-instruct genesis_152m_instruct.safetensors --local-dir .
  1. Start chat (loads the .safetensors you downloaded):
genesis --model ./genesis_152m_instruct.safetensors

Benchmarks (English, full; MPS)

Run (example):

python benchmark/run_benchmark.py --full --chatml -c models/genesis_152m_instruct.safetensors

Notes:

  • Device: MPS
  • Loglikelihood: 20848/20848 (~5.91 it/s)
Task Metric Value Stderr
all acc_norm 0.3710 0.0122
all acc 0.4909 0.0141
genesis:_average:0 acc_norm 0.4021 0.0144
genesis:_average:25 acc_norm 0.3434 0.0114
genesis:arc_challenge:25 acc_norm 0.2466 0.0126
genesis:arc_easy:25 acc_norm 0.4402 0.0102
genesis:boolq:0 acc_norm 0.5630 0.0087
genesis:commonsenseqa:0 acc_norm 0.2916 0.0130
genesis:hellaswag:10 acc_norm 0.3019 0.0046
genesis:openbookqa:0 acc_norm 0.2860 0.0202
genesis:sciq:0 acc_norm 0.4680 0.0158
genesis:winogrande:5 acc 0.4909 0.0141

Intended use

  • Small local assistant for short tasks (rewriting, short Q&A, quick explanations).
  • Prompt-format experiments (ChatML) and sampling strategy prototyping.

Limitations

  • Hallucinations and factual errors can happen.
  • Weak multi-step reasoning and unreliable math.
  • Instruction-following can be brittle for strict constraints (e.g. “answer with only a number”).

License

  • Weights: Apache License 2.0 (see LICENSE in this repo).
  • Code: MIT in the upstream code repository (see LICENSE there).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

genesis_llm-2.0.0.tar.gz (63.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

genesis_llm-2.0.0-py3-none-any.whl (64.8 kB view details)

Uploaded Python 3

File details

Details for the file genesis_llm-2.0.0.tar.gz.

File metadata

  • Download URL: genesis_llm-2.0.0.tar.gz
  • Upload date:
  • Size: 63.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for genesis_llm-2.0.0.tar.gz
Algorithm Hash digest
SHA256 76d5aae88448271656330132e26b7d41d37684b439ad0ae8797e40476272d094
MD5 e47eab55b81c151015352833a8e07623
BLAKE2b-256 6ca4bdd63353a70656159a36f0b5a89fa51c4ab8b4babfafc102c947eef1f886

See more details on using hashes here.

File details

Details for the file genesis_llm-2.0.0-py3-none-any.whl.

File metadata

  • Download URL: genesis_llm-2.0.0-py3-none-any.whl
  • Upload date:
  • Size: 64.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for genesis_llm-2.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d56f0dc1a1ba1290d9ceccfe3e5434d3f57ad8949949ac311fda717b29b28d5e
MD5 24d61c538f92cd4fc4e53ec7f40c805a
BLAKE2b-256 8e3ed6a1ea6ef71738c1dd12fbe3020b19cb5a555deca4cd4419a56ea771769f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page