Client for the public Ginkgo AI API
Project description
Ginkgo's AI model API client
Work in progress: this repo was just made public and we are still working on integration
A python client for Ginkgo's AI model API, to run inference on public and Ginkgo-proprietary models. Learn more in the Model API announcement.
Prerequisites
Register at https://models.ginkgobioworks.ai/ to get credits and an API KEY (of the form xxxxxxx-xxxx-xxxx-xxxx-xxxxxxxx).
Store the API KEY in the GINKGOAI_API_KEY environment variable.
Installation
Install the python client with pip:
pip install ginkgo-ai-client
Usage:
Note: This is an alpha version of the client and its interface may vary in the future.
Example : masked inference with Ginkgo's AA0 model
The client requires an API key (and defaults to os.environ.get("GINKGOAI_API_KEY") if none is explicitly provided)
from ginkgo_ai_client import GinkgoAIClient, MaskedInferenceQuery
client = GinkgoAIClient()
model = "ginkgo-aa0-650M"
query = MaskedInferenceQuery(sequence="MPK<mask><mask>RRL", model=model)
prediction = client.send_request(query)
# prediction.sequence == "MPKRRRRL"
It is also possible to send multiple queries at once, and even recommended in most cases as these will be processed in parallel, with appropriate scaling from our servers. The send_batch_request method returns a list of results in the same order as the queries:
sequences = ["MPK<mask><mask>RRL", "M<mask>RL", "MLLM<mask><mask>R"]
queries = [MaskedInferenceQuery(sequence=seq, model=model) for seq in sequences]
predictions = client.send_batch_request(queries)
# predictions[0].sequence == "MPKRRRRL"
For large datasets (say, 100,000 queries), one can also send multiple batches of requests, then iterate over the results as they are ready. Note that the order in which the results are returned is not guaranteed to be the same as the order of the queries, therefore you should make sure the queries have a query_name attribute that will be used to identify the results.
from ginkgo_ai_client import MeanEmbeddingQuery
queries = MeanEmbeddingQuery.iter_from_fasta("sequences.fasta", model=model)
for batch_results in client.send_requests_by_batches(queries, batch_size=1000):
for result in batch_results:
print(result.query_name, result.embedding)
Changing the model parameter to esm2-650M or esm2-3b in this example will perform
masked inference with the ESM2 model.
Example : embedding computation with Ginkgo's 3'UTR language model
from ginkgo_ai_client import GinkgoAIClient, MeanEmbeddingQuery
client = GinkgoAIClient()
model = "ginkgo-maskedlm-3utr-v1"
# SINGLE QUERY
query = MeanEmbeddingQuery(sequence="ATTGCG", model=model)
prediction = client.send_request(query)
# prediction.embedding == [1.05, -2.34, ...]
# BATCH QUERY
sequences = ["ATTGCG", "CAATGC", "GCGCACATGT"]
queries = [MeanEmbeddingQuery(sequence=seq, model=model) for seq in sequences]
predictions = client.send_batch_request(queries)
# predictions[0].embedding == [1.05, -2.34, ...]
Available models
See the example folder and reference docs for more details on usage and parameters.
| Model | Description | Reference | Supported queries | Versions |
|---|---|---|---|---|
| ESM2 | Large Protein language model from Meta | Github | Embeddings, masked inference | 3B, 650M |
| AA0 | Ginkgo's protein language model | Announcement | Embeddings, masked inference | 650M |
| 3UTR | Ginkgo's 3'UTR language model | Preprint | Embeddings, masked inference | v1 |
| Promoter-0 | Ginkgo's promoter activity model | Coming soon | Promoter activity accross tissues | v1 |
| Boltz | Protein structure prediction model | Github | Protein structure prediction | v1 |
| ABdiffusion | Antibody diffusion model | Coming soon | Unmasking | v1 |
| LCDNA | Long-context DNA diffusion model | Coming soon | Unmasking | v1 |
License
This project is licensed under the MIT License. See the LICENSE file for details.
Releases
To release a new version to PyPI:
- Make sure the changelog is up to date and the top section reads
Unreleased. - Increment the version with the
bumpversionworkflow in Actions - it will update the version everywhere in the repo and create a tag. - If all looks good, create a release for the tag, it will automatically publish to PyPI.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ginkgo_ai_client-0.7.0.tar.gz.
File metadata
- Download URL: ginkgo_ai_client-0.7.0.tar.gz
- Upload date:
- Size: 20.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.0.1 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
62ebb9dc3b46383eabca23fbefc24778bd385ddace9421b378969b7d42744cd1
|
|
| MD5 |
8ea5b35bfe039cf2622685285d915f92
|
|
| BLAKE2b-256 |
1111edcf0f82b9644a319721dcbb58974604d324b0a1843c43947ec7e83c54ea
|
Provenance
The following attestation bundles were made for ginkgo_ai_client-0.7.0.tar.gz:
Publisher:
publish.yml on ginkgobioworks/ginkgo-ai-client
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
ginkgo_ai_client-0.7.0.tar.gz -
Subject digest:
62ebb9dc3b46383eabca23fbefc24778bd385ddace9421b378969b7d42744cd1 - Sigstore transparency entry: 162353308
- Sigstore integration time:
-
Permalink:
ginkgobioworks/ginkgo-ai-client@b4693774f53fcf00518ba4ff32d3aa9747c0131f -
Branch / Tag:
refs/tags/v0.7.0 - Owner: https://github.com/ginkgobioworks
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@b4693774f53fcf00518ba4ff32d3aa9747c0131f -
Trigger Event:
release
-
Statement type:
File details
Details for the file ginkgo_ai_client-0.7.0-py3-none-any.whl.
File metadata
- Download URL: ginkgo_ai_client-0.7.0-py3-none-any.whl
- Upload date:
- Size: 16.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.0.1 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8df368e737186919289761483c77ea177458e7c69ca6beeb84e69239d910d6e5
|
|
| MD5 |
77692a264816371cdf23881b1f2d0264
|
|
| BLAKE2b-256 |
8b0c4640b47d4d28b70e4c029cc4129cc8c2a756547f2a7c1e4542b32863803f
|
Provenance
The following attestation bundles were made for ginkgo_ai_client-0.7.0-py3-none-any.whl:
Publisher:
publish.yml on ginkgobioworks/ginkgo-ai-client
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
ginkgo_ai_client-0.7.0-py3-none-any.whl -
Subject digest:
8df368e737186919289761483c77ea177458e7c69ca6beeb84e69239d910d6e5 - Sigstore transparency entry: 162353311
- Sigstore integration time:
-
Permalink:
ginkgobioworks/ginkgo-ai-client@b4693774f53fcf00518ba4ff32d3aa9747c0131f -
Branch / Tag:
refs/tags/v0.7.0 - Owner: https://github.com/ginkgobioworks
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@b4693774f53fcf00518ba4ff32d3aa9747c0131f -
Trigger Event:
release
-
Statement type: