AfriLink SDK — One-line access to GPUs, models and datasets from your notebook

These details have not been verified by PyPI

Project links

Project description

AfriLink SDK

Version: 0.6.0

Last Updated: April 9, 2026

Finetune LLMs on HPC from your notebook

AfriLink SDK gives you one-line access to GPUs, models and datasets; all ready to use directly from your notebook interface. Authenticate, submit LoRA finetune jobs, download trained weights, and run inference without ever leaving your notebook.

pip install afrilink-sdk

Quick Start

from afrilink import AfriLinkClient

# 1. Authenticate (prompts for DataSpires email/password, then auto-handles HPC)
client = AfriLinkClient()
client.authenticate()

# 2. Prepare your dataset (pandas DataFrame with "text" column)
import pandas as pd
data = pd.DataFrame({"text": [
    "Below is an instruction...\n\n### Response:\nHere is the answer..."
]})

# 3. Submit a finetune job
job = client.finetune(
    model="qwen2.5-0.5b",    # The model you choose to finetune
    training_mode="low",      # How much training: "low", "medium", or "high"
    data=data,                # Your dataset (DataFrame, HF Dataset, or file path)
    gpus=1,                   # Number of A100 GPUs to use
    time_limit="01:00:00",    # Maximum time your job should run for
    backend="cineca",         # HPC backend: "cineca" (default), "eversetech", "agh", or "acf"
)
result = job.run(wait=True)   # blocks until SLURM job finishes

# 4. Download the trained adapter (only if job succeeded)
if result["status"] == "completed":
    client.download_model(result["job_id"], "./my-model")

    # 5. Load & run inference
    from transformers import AutoModelForCausalLM, AutoTokenizer
    from peft import PeftModel

    base = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-0.5B")
    model = PeftModel.from_pretrained(base, "./my-model")
    tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2.5-0.5B")

    out = model.generate(**tokenizer("Hello!", return_tensors="pt"), max_new_tokens=64)
    print(tokenizer.decode(out[0], skip_special_tokens=True))
else:
    print(f"Job failed with status: {result['status']}")
    print(f"Check logs: job.get_logs()")

Installation

pip install afrilink-sdk

The package has zero required dependencies — heavy libraries (requests, torch, transformers, peft) are only needed at the point you actually use them and are pre-installed in most notebook environments.

Authentication

AfriLink uses a two-phase auth flow. Both phases happen inside a single client.authenticate() call:

Phase	What happens	User action
1. DataSpires	Validates your DataSpires account for billing/telemetry	Enter email + password when prompted
2. HPC	Headless Selenium browser automation gets SSH certificates via Smallstep	Fully automatic (org credentials auto-provisioned)

from afrilink import AfriLinkClient

client = AfriLinkClient()
client.authenticate()   # prompts for DataSpires creds, then auto-handles HPC

# Or pass credentials explicitly:
client.authenticate(
    dataspires_email="you@example.com",
    dataspires_password="...",
)

After authentication you get:

SSH certificate valid for ~12 hours (the SDK warns you before it expires — see Session Recovery)
SLURM job manager ready to submit jobs
SCP transfer manager ready to move files
Telemetry tracker logging GPU-minutes to your DataSpires account

Built-in User Guide

The SDK ships with an inline reference manual you can query from any notebook cell using a slash-style syntax:

import afrilink

afrilink/help          # top-level index of all topics
afrilink/quickstart    # step-by-step getting started guide
afrilink/auth          # authentication & session management
afrilink/finetune      # finetune job parameters & training modes
afrilink/specs         # available models and GPU requirements
afrilink/datasets      # dataset formats and upload
afrilink/transfer      # SCP upload/download commands
afrilink/jobs          # SLURM job management
afrilink/inference     # routing inference to HuggingFace endpoints

Each page prints a formatted reference to your notebook output — no internet connection required.

API Reference

`AfriLinkClient`

Main entry point. Created once per notebook session.

Method	Description
`authenticate()`	Full auth flow (DataSpires + HPC)
`finetune(model, training_mode, data, gpus, ...)`	Create a `FinetuneJob`
`download_model(job_id, local_dir)`	Download trained adapter weights
`upload_dataset(local_path, dataset_name)`	Upload dataset to HPC
`list_available_models(size=None)`	List models in the registry
`list_available_datasets()`	List datasets in the registry
`get_model_requirements(model, training_mode)`	GPU/memory recommendations
`list_jobs()`	List SLURM queue
`recover_session(download_dir=None)`	Re-authenticate + check/download tracked jobs
`inference(prompt, model_id, ...)`	Route inference to a HuggingFace endpoint
`cancel_job(job_id)`	Cancel a running job
`run_command(command)`	Run arbitrary shell command on HPC login node
`get_queue_status()`	SLURM partition info
`cert_minutes_remaining`	Minutes until SSH certificate expires

`client.finetune()`

job = client.finetune(
    model="qwen2.5-0.5b",       # model ID from registry
    training_mode="low",          # "low" | "medium" | "high"
    data=my_dataframe,            # pandas DataFrame, HF Dataset, or file path
    gpus=1,                       # number of A100 GPUs
    time_limit="01:00:00",        # max wallclock (HH:MM:SS)
    backend="cineca",             # HPC backend cluster
    output_dir=None,              # default: $WORK/finetune_outputs
)

HPC Backends:

Backend	Provider	Region	Status
`cineca`	CINECA Leonardo (EuroHPC)	Bologna, Italy	Available (default)
`eversetech`	EverseTech	Variable	Coming soon
`agh`	AGH	Variable	Coming soon
`acf`	ACF	Variable	Coming soon

CINECA Leonardo — Hardware Specs:

Each GPU node on the Leonardo Booster partition (where AfriLink jobs run):

Component	Specification
GPU per node	4x NVIDIA A100 (custom)
GPU memory	64 GB HBM2e per GPU (256 GB per node)
FP64 performance	11.2 TFLOPS per GPU
FP32 performance	22.4 TFLOPS per GPU
CPU cores per node	32
System RAM per node	512 GB DDR4
RAM per GPU (effective)	~128 GB (shared, not partitioned)
Node interconnect	200 Gb/s HDR InfiniBand
SLURM partition	`boost_usr_prod`

Per-GPU memory guide:

Model size	Training mode	Min GPUs recommended
0.5B - 1B	low (QLoRA 4-bit)	1
3B - 7B	low	1
3B - 7B	high (bf16)	2-4
13B	low	2
13B	high	4
30B+	low or high	4

Billing: $2.00 / GPU-hour, charged per completed GPU-minute (minimum 1 minute). Credits deducted automatically from your DataSpires balance.

Training modes:

Mode	Strategy	Quantization	Typical GPUs
`low`	QLoRA (rank 8)	4-bit	1
`medium`	LoRA (rank 16)	8-bit / none	1-2
`high`	LoRA (rank 64) + DDP/FSDP	none	2-4+

`FinetuneJob`

Returned by client.finetune().

Method / Property	Description
`run(wait=True)`	Submit to SLURM. `wait=True` polls until done.
`cancel()`	Cancel the SLURM job
`get_logs(tail=100)`	Fetch recent log lines
`status`	Current status string
`job_id`	AfriLink job ID (8-char UUID prefix)
`slurm_job_id`	SLURM numeric job ID (set after `run()`)

run() returns a dict:

{
    "job_id": "a1b2c3d4",
    "slurm_job_id": "12345678",
    "status": "completed",        # or "submitted" if wait=False
    "output_dir": "/path/...",
    "model_path": "/path/...",
}

Session Watchdog

The SDK monitors your SSH certificate in the background and prints a warning as it approaches expiry (at 60, 30, 15, and 5 minutes remaining). You can check time remaining at any point:

print(f"{client.cert_minutes_remaining:.0f} minutes remaining on SSH certificate")

Session Recovery

SSH certificates expire after ~12 hours. The SDK monitors this automatically and warns you before expiry. When you see the warning — or when you return to a notebook after being away — call recover_session() to re-authenticate and pick up where you left off:

# Re-authenticate and check on all tracked jobs
recovery = client.recover_session("./recovered-models")

# recovery.re_authenticated  — True if fresh SSH cert was obtained
# recovery.jobs               — status of each tracked SLURM job
# recovery.files_retrieved    — list of model dirs downloaded for completed jobs

What recover_session() does:

Re-authenticates with CINECA — gets a fresh SSH certificate without re-entering credentials
Checks all tracked SLURM jobs — reports status of every job submitted in this session
Downloads completed models — if you pass a download_dir, finished adapters are pulled automatically
Registers email notification — for jobs still running, you'll get an email when they finish

Your SLURM jobs keep running on the cluster even after your certificate expires — you just need fresh credentials to check on them or download results.

# Minimal usage (just re-auth, no download)
client.recover_session()

# With download directory for completed jobs
client.recover_session("./my-models")

`client.inference()`

Route an inference request to any HuggingFace Inference Endpoint — no CINECA session required.

# Public model (no token needed, rate-limited)
result = client.inference(
    "Explain LoRA fine-tuning in one sentence.",
    model_id="HuggingFaceH4/zephyr-7b-beta",
)
print(result.text)

# Gated model with HF token + generation parameters
result = client.inference(
    "What is transfer learning?",
    model_id="meta-llama/Llama-2-7b-chat-hf",
    hf_token="hf_...",
    parameters={"max_new_tokens": 256, "temperature": 0.7},
)

# Private HuggingFace Inference Endpoint
result = client.inference(
    payload={"inputs": "Hello!"},
    endpoint_url="https://xyz.endpoints.huggingface.cloud",
    hf_token="hf_...",
)

# Check result
if result.success:
    print(result.text)
else:
    print(f"Error {result.status_code}: {result.error}")

InferenceResult fields:

Field	Type	Description
`text`	str	Generated text (first result)
`raw`	Any	Full decoded JSON from HuggingFace
`status_code`	int	HTTP status code
`success`	bool	True if status_code < 400
`error`	str \| None	Error message, or None

`client.download_model()`

client.download_model(result["job_id"], "./my-model")

Downloads adapter files (adapter_config.json, adapter_model.safetensors, tokenizer files) flat into the target directory — ready for PeftModel.from_pretrained().

Working With Your Model

Once you have downloaded adapter weights with client.download_model(), the adapter directory is ready for standard Hugging Face tooling.

GGUF Conversion & Ollama

Convert your adapter to GGUF format for use with Ollama or llama.cpp:

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel

# 1. Merge adapter into base model
base = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-0.5B")
model = PeftModel.from_pretrained(base, "./my-model")
merged = model.merge_and_unload()
merged.save_pretrained("./my-model-merged")
AutoTokenizer.from_pretrained("Qwen/Qwen2.5-0.5B").save_pretrained("./my-model-merged")

# 2. Convert to GGUF (requires llama.cpp — see llama.cpp repo for build instructions)
# python convert_hf_to_gguf.py ./my-model-merged --outfile my-model.gguf --outtype f16

# 3. Quantize (optional, 4-bit)
# ./llama-quantize my-model.gguf my-model-q4.gguf Q4_K_M

# 4. Run with Ollama
# Create a Modelfile:
#   FROM ./my-model-q4.gguf
# ollama create my-model -f Modelfile
# ollama run my-model

Publishing to Hugging Face Hub

from huggingface_hub import HfApi

api = HfApi(token="hf_...")
repo_id = "your-username/my-finetuned-model"
api.create_repo(repo_id, exist_ok=True)

# Option A — adapter only (small, loads on top of base model)
api.upload_folder(folder_path="./my-model", repo_id=repo_id)

# Option B — full merged model
api.upload_folder(folder_path="./my-model-merged", repo_id=repo_id)

# Option C — GGUF file
api.upload_file(path_or_fileobj="./my-model-q4.gguf",
                path_in_repo="my-model-q4.gguf",
                repo_id=repo_id)

Model & Dataset Registry

# List all models
client.list_available_models()

# Filter by size
client.list_available_models(size="tiny")   # tiny | small | medium | large

# List datasets
client.list_available_datasets()

# Resource requirements
client.get_model_requirements("qwen2.5-0.5b", "low")

Available models (v0.1.0):

ID	Name	Type	Params	Min VRAM
`qwen2.5-0.5b`	Qwen 2.5 0.5B	text	0.5B	4 GB
`gemma-3-270m`	Gemma 3 270M	text	0.27B	2 GB
`llama-3.2-1b`	Llama 3.2 1B	text	1.0B	4 GB
`deepseek-r1-1.5b`	DeepSeek R1 1.5B	text	1.5B	6 GB
`ministral-3b`	Ministral 3B	text	3.3B	8 GB
`florence-2-base`	Florence 2 Base	vision	0.23B	4 GB
`smolvlm-256m`	SmolVLM 256M	vision	0.26B	2 GB
`moondream2`	Moondream 2	vision	1.9B	8 GB
`internvl2-1b`	InternVL2 1B	vision	1.0B	4 GB
`llava-1.5-7b`	LLaVA 1.5 7B	vision	7.0B	16 GB

Data Transfer

# Upload a dataset
client.upload_dataset("./train.jsonl", dataset_name="my-data")

# Download model weights
client.download_model("a1b2c3d4", "./my-model")

# List remote files
client.transfer.list_remote_files("$WORK/finetune_outputs/")

# Run shell commands on HPC
client.run_command("squeue -u $USER")

Dataset Formats

client.finetune(data=...) accepts:

Type	How it's handled
`pandas.DataFrame`	Serialised to JSONL, uploaded via SCP
`datasets.Dataset`	Saved to disk, uploaded via SCP
`str` (local path)	Uploaded via SCP
`str` (starts with `$`)	Treated as remote HPC path (no upload)

Your DataFrame should have a text column with the full prompt+response formatted as a single string (Alpaca-style or chat template).

Architecture

Notebook Interface                      High Performance Compute
+--------------+      SSH/SCP          +------------------+
| AfriLink SDK | ------------------->  |  Login Node      |
|              |  (Smallstep certs)    |  +- SLURM sbatch |
| DataSpires   |                       |  +- $WORK/       |
| (billing)    |                       |  |  +- containers|
|              |                       |  |  +- datasets  |
+--------------+                       |  |  +- finetune_ |
                                       |  |     outputs/  |
                                       |  |     +- {jobid}|
                                       |  +- Singularity  |
                                       |     container    |
                                       |     (A100 GPUs)  |
                                       +------------------+

Publishing to PyPI

For maintainers:

cd afrilink-sdk
pip install build twine

# Build wheel + sdist
python -m build

# Upload to PyPI (requires PyPI API token)
twine upload dist/*

You'll need a PyPI account at https://pypi.org and an API token configured in ~/.pypirc or passed via --username __token__ --password pypi-....

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.8.16

May 29, 2026

0.8.15

May 29, 2026

0.8.14

May 29, 2026

0.8.13

May 29, 2026

0.8.12

May 29, 2026

0.8.11

May 28, 2026

0.8.10

May 28, 2026

0.8.9

May 28, 2026

0.8.8

May 28, 2026

0.8.7

May 28, 2026

0.8.6

May 28, 2026

0.8.5

May 28, 2026

0.8.4

May 28, 2026

0.8.3

May 28, 2026

0.8.2

May 28, 2026

0.8.1

May 28, 2026

0.8.0

May 28, 2026

0.7.5

May 8, 2026

0.7.4

May 8, 2026

0.7.3

May 4, 2026

0.7.2

May 3, 2026

0.7.1

Apr 13, 2026

0.7.0

Apr 9, 2026

This version

0.6.0

Apr 9, 2026

0.5.9

Apr 1, 2026

0.5.8

Apr 1, 2026

0.5.7

Mar 23, 2026

0.5.6

Mar 21, 2026

0.5.5

Mar 21, 2026

0.5.4

Mar 21, 2026

0.5.3

Mar 21, 2026

0.5.2

Mar 20, 2026

0.5.1

Mar 16, 2026

0.5.0

Mar 9, 2026

0.4.0

Mar 6, 2026

0.3.1

Feb 27, 2026

0.3.0

Feb 27, 2026

0.2.3

Feb 23, 2026

0.2.2

Feb 20, 2026

0.2.1

Feb 20, 2026

0.2.0

Feb 20, 2026

0.1.9

Feb 19, 2026

0.1.8

Feb 19, 2026

0.1.7

Feb 19, 2026

0.1.6

Feb 17, 2026

0.1.5

Feb 17, 2026

0.1.4

Feb 17, 2026

0.1.3

Feb 16, 2026

0.1.2

Feb 16, 2026

0.1.1

Feb 16, 2026

0.1.0

Feb 16, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

afrilink_sdk-0.6.0.tar.gz (100.8 kB view details)

Uploaded Apr 9, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

afrilink_sdk-0.6.0-py3-none-any.whl (102.1 kB view details)

Uploaded Apr 9, 2026 Python 3

File details

Details for the file afrilink_sdk-0.6.0.tar.gz.

File metadata

Download URL: afrilink_sdk-0.6.0.tar.gz
Upload date: Apr 9, 2026
Size: 100.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.5

File hashes

Hashes for afrilink_sdk-0.6.0.tar.gz
Algorithm	Hash digest
SHA256	`7a2152924abe99e6b9c6864aec4d7d4f083d9d06b2014dbb3fbd8612d346cb7c`
MD5	`b8c72307428fd31d42fd64e44fa191d9`
BLAKE2b-256	`824cd67d9525e774a8a96f5cbace811f5aa049307faec07e49113d248ebe5634`

See more details on using hashes here.

File details

Details for the file afrilink_sdk-0.6.0-py3-none-any.whl.

File metadata

Download URL: afrilink_sdk-0.6.0-py3-none-any.whl
Upload date: Apr 9, 2026
Size: 102.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.5

File hashes

Hashes for afrilink_sdk-0.6.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cee513dfee4e57e24b877da07108d2e59369b277626a5a5399fbe82e2113e77e`
MD5	`3d697a40401461f7f749dac3645d0721`
BLAKE2b-256	`0f4ac5f2ac9295f574dda7a837460cf6de5a1ad8b914beaa0d9ccf473c991eaa`

See more details on using hashes here.

afrilink-sdk 0.6.0

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

AfriLink SDK

Quick Start

Installation

Authentication

Built-in User Guide

API Reference

AfriLinkClient

client.finetune()

FinetuneJob

Session Watchdog

Session Recovery

client.inference()

client.download_model()

Working With Your Model

GGUF Conversion & Ollama

Publishing to Hugging Face Hub

Model & Dataset Registry

Data Transfer

Dataset Formats

Architecture

Publishing to PyPI

License

Project details

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`AfriLinkClient`

`client.finetune()`

`FinetuneJob`

`client.inference()`

`client.download_model()`