Skip to main content

GLMap: Profiling genomic language models as individuals in a population (placeholder release; full v1.0.0 forthcoming with the paper)

Project description

GLMap

Placeholder release (v0.0.1). The full v1.0.0 will be released together with the accompanying paper.

GLMap (Genomic Language Model Map) is a training-free, architecture-agnostic framework for representing and comparing genomic language models (GLMs) by their likelihood responses over a fixed panel of DNA sequences. Applied to 123 publicly available GLMs scored on a panel of 10,000 DNA probes, GLMap places autoregressive and masked-language models in a common space, yields model distances stable to the choice of probes, and predicts downstream task performance with Spearman ρ ≈ 0.7.

Status

This 0.0.1 release reserves the glmap name on PyPI. The full library — including model loaders, scoring code, panel construction, and the clip + double-center matrix pipeline — is in active preparation:

  • Paper submitted
  • Code repository public release
  • PyPI v1.0.0

Track progress at https://github.com/ai4nucleome/GLMap.

Citation

If you wish to cite GLMap before the v1.0.0 release, please cite the preprint:

Hou, Y., Long, W., Su, H., Feng, J., Zhang, Y.
Profiling genomic language models as individuals in a population.
(In submission, 2026.)

License

Apache-2.0.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ai4nucleome_glmap-0.0.1.tar.gz (6.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ai4nucleome_glmap-0.0.1-py3-none-any.whl (6.8 kB view details)

Uploaded Python 3

File details

Details for the file ai4nucleome_glmap-0.0.1.tar.gz.

File metadata

  • Download URL: ai4nucleome_glmap-0.0.1.tar.gz
  • Upload date:
  • Size: 6.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for ai4nucleome_glmap-0.0.1.tar.gz
Algorithm Hash digest
SHA256 a1146b62edb041944f3b9ec4436bb729208bbb649d5c05eeecab3939e7f9b04c
MD5 a8faeb7bc1ff5446408a9db8916acd27
BLAKE2b-256 4eba8f3314532a03c20e5eadcd3a238b27769b8734ea4c992ed651c8ab917a4b

See more details on using hashes here.

File details

Details for the file ai4nucleome_glmap-0.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for ai4nucleome_glmap-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 5bc82233c506e038dcbf1b6c6588b3c1023c122b4a81d6a6b7f0d0c7cadeec98
MD5 b195add58c27db0bfbd9b990bb7abbd1
BLAKE2b-256 4508e6af991d2217797b66b1ff0052ed57aa15dc2b71c91bb851b3e83c4551aa

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page