Metapackage bundling qwen-think and qwen3.6-mtp under a shared namespace

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

AD17

These details have not been verified by PyPI

Project description

forge-infer

Metapackage bundling qwen-think and qwen3.6-mtp under a shared namespace.

pip install forge-infer pulls in qwen-think and qwen3.6-mtp as dependencies and re-exports their key APIs under a single forge namespace. This is packaging and narrative, not new code.

Why this exists

Two focused packages -- thinking-mode session control and MTP speculative decoding -- that belong together. forge-infer gives them a shared identity so you can recommend, install, and document them as a unit instead of scattering links across READMEs.

Install

pip install forge-infer

This installs both qwen-think and qwen3.6-mtp automatically.

Quick start

Thinking sessions (qwen-think)

Control when and how Qwen3.6 "thinks" -- budget tokens, toggle thinking on/off mid-conversation, route by complexity.

from forge.session import ThinkingSession

session = ThinkingSession(model="Qwen/Qwen3.6-27B")
response = session.chat("Explain merge sort", thinking=True)
print(response)

MTP speculative decoding (qwen3.6-mtp)

Tune multi-token prediction for throughput, find crossover points, generate backend configs.

from forge.mtp import recommend, quick_crossover, vllm_mtp_command, sglang_mtp_command
from forge.mtp import UseCase, Objective

# Get a recommendation for your hardware
rec = recommend(use_case=UseCase.SINGLE_USER, objective=Objective.MINIMIZE_LATENCY, gpu_id="rtx-4090")
print(rec.enable, rec.expected_gain)

# Find where MTP flips from positive to negative
for s in quick_crossover(gpu_id="rtx-3090"):
    print(f"MTP-{s.spec_tokens}: crossover at batch {s.crossover_batch_size}")

# Generate serve commands
print(vllm_mtp_command(model="Qwen/Qwen3.6-27B", num_speculative_tokens=2).command)
print(sglang_mtp_command(model="Qwen/Qwen3.6-27B", num_speculative_tokens=2).command)

Architecture

How the packages relate:

+---------------------------------------------+
|              forge (metapackage)             |
+------------------+--------------------------+
|   forge.session  |       forge.mtp          |
|  (qwen-think)    |   (qwen3.6-mtp)         |
|                  |                          |
|  Thinking-mode   |  MTP speculative decode  |
|  session control |  tuning & backend config |
+------------------+--------------------------+
|              Qwen3.6 model family           |
+---------------------------------------------+

forge.session -- Re-exports ThinkingSession from qwen-think.
forge.mtp -- Re-exports recommend, quick_crossover, vllm_mtp_command, sglang_mtp_command, UseCase, Objective from qwen3.6-mtp.

Individual packages

Package	What it does
qwen-think	Thinking-mode session management
qwen3.6-mtp	MTP speculative decoding tuner

What this package does NOT do

No new functionality -- strictly re-exports from the underlying packages
No CLI -- the libraries are Python-first
No model generalization -- wraps Qwen3.6-specific versions as-is

License

Apache 2.0

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

AD17

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.3.0

May 1, 2026

0.2.1

May 1, 2026

0.2.0

Apr 30, 2026

0.1.1

Apr 30, 2026

This version

0.1.0

Apr 29, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

forge_infer-0.1.0.tar.gz (7.1 kB view details)

Uploaded Apr 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

forge_infer-0.1.0-py3-none-any.whl (7.4 kB view details)

Uploaded Apr 29, 2026 Python 3

File details

Details for the file forge_infer-0.1.0.tar.gz.

File metadata

Download URL: forge_infer-0.1.0.tar.gz
Upload date: Apr 29, 2026
Size: 7.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for forge_infer-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`4260852ea72fc60c11dac8691b1a3b2f2d4d5c595c4a9fe6214aed0cd2c7990c`
MD5	`6c4e9e014d216ae72b626f776151dc00`
BLAKE2b-256	`f533f86f18113a3925422586576d7b2ffc3664b4b9d60bec9b09a6edc87ce280`

See more details on using hashes here.

Provenance

The following attestation bundles were made for forge_infer-0.1.0.tar.gz:

Publisher: publish.yml on ArkaD171717/Forge-LLM

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: forge_infer-0.1.0.tar.gz
- Subject digest: 4260852ea72fc60c11dac8691b1a3b2f2d4d5c595c4a9fe6214aed0cd2c7990c
- Sigstore transparency entry: 1405943287
- Sigstore integration time: Apr 29, 2026
Source repository:
- Permalink: ArkaD171717/Forge-LLM@e38f1584f47ae3f9faccdc021d8e70a40d4033c1
- Branch / Tag: refs/tags/v0.1.0a
- Owner: https://github.com/ArkaD171717
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@e38f1584f47ae3f9faccdc021d8e70a40d4033c1
- Trigger Event: release

File details

Details for the file forge_infer-0.1.0-py3-none-any.whl.

File metadata

Download URL: forge_infer-0.1.0-py3-none-any.whl
Upload date: Apr 29, 2026
Size: 7.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for forge_infer-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`440f9c772451e8a942ca67e8df3566edee5556289683c1fc956d7ace3a9fb731`
MD5	`96b14d84924d063fd5728c2d673569a0`
BLAKE2b-256	`c0ccaafac9e95248dcb5eb6bb3d583e7732128a0332c2045b786002b2990abf9`

See more details on using hashes here.

Provenance

The following attestation bundles were made for forge_infer-0.1.0-py3-none-any.whl:

Publisher: publish.yml on ArkaD171717/Forge-LLM

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: forge_infer-0.1.0-py3-none-any.whl
- Subject digest: 440f9c772451e8a942ca67e8df3566edee5556289683c1fc956d7ace3a9fb731
- Sigstore transparency entry: 1405943389
- Sigstore integration time: Apr 29, 2026
Source repository:
- Permalink: ArkaD171717/Forge-LLM@e38f1584f47ae3f9faccdc021d8e70a40d4033c1
- Branch / Tag: refs/tags/v0.1.0a
- Owner: https://github.com/ArkaD171717
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@e38f1584f47ae3f9faccdc021d8e70a40d4033c1
- Trigger Event: release

forge-infer 0.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

forge-infer

Why this exists

Install

Quick start

Thinking sessions (qwen-think)

MTP speculative decoding (qwen3.6-mtp)

Architecture

Individual packages

What this package does NOT do

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance