Skip to main content

ct-transformer punctuation model for fasr

Project description

fasr-punc-ct-transformer

基于 CT-Transformer 的标点恢复模型插件,为 fasr 提供中英文混合标点恢复能力。

安装

pip install fasr-punc-ct-transformer

注册模型

注册名 默认 checkpoint 说明
ct_transformer CTTransformerForPunc iic/punc_ct-transformer_zh-cn-common-vocab272727-pytorch CT-Transformer 标点恢复,支持中英文混合

模型权重默认从 ModelScope 自动下载。

使用方式

在流水线中使用

标点模型通常作为 sentencizer 组件在流水线末端使用:

from fasr import AudioPipeline

pipeline = (
    AudioPipeline()
    .add_pipe("detector", model="fsmn")
    .add_pipe("recognizer", model="paraformer")
    .add_pipe("sentencizer", model="ct_transformer")
)

单独使用模型

模型实例化时会自动执行 download_checkpoint() + load_checkpoint()

from fasr.config import registry

model = registry.punc_models.get("ct_transformer")()
spans = model.restore("今天天气真好我想出去玩你觉得呢")
for span in spans:
    print(span.text)  # 带标点的句子

# 使用自定义权重目录
model.load_checkpoint("/path/to/custom/ct_transformer")

运行期 / 会话参数

参数 类型 默认值 说明
checkpoint str | None "iic/punc_ct-transformer_zh-cn-common-vocab272727-pytorch" 远程 repo_id;非空时实例化会自动下载
cache_dir str | Path | None None 缓存目录,None 使用 fasr.utils.get_cache_dir()
endpoint Literal["modelscope", "huggingface", "hf-mirror"] "modelscope" 下载端点
disable_update bool True 禁用 FunASR 自动更新检查
disable_log bool True 禁用 FunASR 日志输出
disable_pbar bool True 禁用进度条

依赖

  • fasr
  • funasr
  • Python 3.10–3.12

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fasr_punc_ct_transformer-0.5.1.tar.gz (3.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

fasr_punc_ct_transformer-0.5.1-py3-none-any.whl (5.3 kB view details)

Uploaded Python 3

File details

Details for the file fasr_punc_ct_transformer-0.5.1.tar.gz.

File metadata

  • Download URL: fasr_punc_ct_transformer-0.5.1.tar.gz
  • Upload date:
  • Size: 3.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.11 {"installer":{"name":"uv","version":"0.10.11","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"22.04","id":"jammy","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for fasr_punc_ct_transformer-0.5.1.tar.gz
Algorithm Hash digest
SHA256 e7c8830f8460aa378dd8a191f7769ba0456921dc889dcdd090bc327c0817ed9c
MD5 13c9d9b9bffa72c1436bb691165b11a7
BLAKE2b-256 b51872bb9efabaeee30df71fd4ba569256be1c421d026c2f9e798b8f93e3631b

See more details on using hashes here.

File details

Details for the file fasr_punc_ct_transformer-0.5.1-py3-none-any.whl.

File metadata

  • Download URL: fasr_punc_ct_transformer-0.5.1-py3-none-any.whl
  • Upload date:
  • Size: 5.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.11 {"installer":{"name":"uv","version":"0.10.11","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"22.04","id":"jammy","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for fasr_punc_ct_transformer-0.5.1-py3-none-any.whl
Algorithm Hash digest
SHA256 d5e0bf45f5c809bdd11e9d748f46a255b91785efef9d595c0f8d272164a0810b
MD5 14778c378b1a60906c8c59383f4be0f4
BLAKE2b-256 66c5c97b6391ed59028fac7a475b760ced85ebfe2039a7267baf5236d9deaaef

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page