Skip to main content

TechNode CLI — run inference on compressed open models, rent GPUs, or serve as a provider (RunPod-compatible).

Project description

technode

Run inference on TechNode — a GPU grid serving compressed open models (Qwen, Granite, gpt-oss, Devstral, Gemma, EXAONE…) at consumer-GPU prices.

pip install technode-cli       # the command is `technode`
technode login                 # paste your tn_test_… key (or set TECHNODE_API_KEY)
technode models                # list the compressed catalog
technode infer "Explain quantization in one line." --model qwen2.5-7b

Zero dependencies — pure Python stdlib, runs anywhere Python ≥3.8 does.

Commands

Command What it does
technode login [key] Save your API key to ~/.technode/config.json (chmod 600).
technode logout Remove the saved key.
technode models [--json] List available models (id, quantization, role).
technode infer PROMPT [-m MODEL] [-n MAX_TOKENS] [-t TEMP] [--json] [-q] Text generation. - or piped stdin reads the prompt from stdin.
technode whoami Show the active key (masked) + endpoint.
technode gpu lease/list/status/release Rent a whole GPU (Jupyter lab session).

Become a provider (share your GPU)

Got an NVIDIA Linux box? Join the grid and serve models — outbound-only, works behind any NAT (no Tailscale, no inbound ports):

technode provider register --gpu "RTX 4090" --vram 24
technode provider serve --llama-server /path/to/llama-server   # pull-mode worker (llama.cpp)
technode provider status

Data-center / IDC GPUs (B200·B300·H100, multi-GPU)

For datacenter GPU nodes, use the vLLM backend with tensor-parallel across GPUs:

pip install -U technode-cli vllm        # vLLM needs CUDA GPUs + drivers
technode provider register --gpu "8x B200" --vram 1440
technode provider serve --backend vllm --tp 8     # 8-GPU tensor-parallel
technode provider install --backend vllm --tp 8   # → systemd unit (boot persistence)

One-shot onboarding (detects GPUs, installs, registers, serves):

curl -fsSL https://technode.network/idc.sh | bash -s -- --name "IDC-node-01"

serve polls the broker for jobs it can run, executes them on your GPU, and returns the results. Needs a llama.cpp llama-server binary (CUDA build for NVIDIA) and operator approval before it receives live jobs.

Configuration

Setting Env var Default
API key TECHNODE_API_KEY — (from technode login)
Endpoint TECHNODE_BASE_URL https://technode.network

Get a key (free beta): https://technode.network/developers

Examples

# pick a coder model
technode infer "Write a Python one-liner to flatten a list of lists." -m qwen2.5-coder-7b

# read the prompt from a file / pipe
cat prompt.txt | technode infer -

# machine-readable
technode infer "hi" --json

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

technode_cli-0.3.0.tar.gz (12.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

technode_cli-0.3.0-py3-none-any.whl (13.0 kB view details)

Uploaded Python 3

File details

Details for the file technode_cli-0.3.0.tar.gz.

File metadata

  • Download URL: technode_cli-0.3.0.tar.gz
  • Upload date:
  • Size: 12.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for technode_cli-0.3.0.tar.gz
Algorithm Hash digest
SHA256 affc12478ce6ce45fb2f914afe4ca9aab618b807ebd3e0f4ed41b8cff02be029
MD5 24defc785b783c60f024ce3d04e4b01c
BLAKE2b-256 6e0b230b2dd3a904acf6f1f5c0d517ca5ff4cd74cf6fd8826d1784dca89bb1dc

See more details on using hashes here.

File details

Details for the file technode_cli-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: technode_cli-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 13.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for technode_cli-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7987c9714097f4d8584e159eed76c84852694041e49b2096df90e9841273cbc5
MD5 fb17e788c0f0629cfde48d1dc7a1afdd
BLAKE2b-256 caeaa21e40c3d7f1582515b5d32eff90f57b06a0573986588dbd1d0f68deb102

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page