Skip to main content

Gemini STT CLI

Project description

iGSTT(Gemini STT)

iGSTT(Gemini STT) 开源免费的语音转文本(STT)的命令行工具。
iGSTT(Gemini STT) 是通过调用 Gemini STT 的接口,实现语音转文本(STT)的命令行工具。

添加 API key

# 编辑 .zshrc:
vim ~/.zshrc

# 添加信息(导入环境变量):
export GEMINI_API_KEY=<YOUR_API_KEY_HERE>

# 保存后应用更改:
source ~/.zshrc

安装 iGSTT

pip install igstt

运行 iGSTT

usage: igstt [-h] [-i INPUT] [-o OUTPUT] [text]

🎧 Gemini 语音转文本(STT)工具

positional arguments:
  text                 提示信息

options:
  -h, --help           show this help message and exit
  -i, --input INPUT    音频文件路径
  -o, --output OUTPUT  输出文件路径(默认与输入同名 .lrc)

实例

# 指定语音文本的内容
igstt "Describe this audio clip"

# 指定语音文本的文件 -i "input.mp3"
igstt -i "input.mp3"

# 指定语音文本的文件 -i "input.mp3"
# 指定语音保存的文件 -o "output.lrc"
igstt -i "input.mp3" -o "output.lrc"

实例转LRC

igstt -i input.wav
提示信息: Generate LRC, LRC content format: [minutes:seconds.milliseconds] English | Simplified Chinese
音频文件: input.wav
输出文件: input.lrc
🤖 Gemini 返回:
[00:00.060] Hello, |
[00:00.550] | 这是一个 Gemini TTS CLI 工具。
[00:04.650] | 记得关注哦。
#################### 已写入到 input.lrc

模型

gemini-2.5-flash

获取 API key

API key:https://aistudio.google.com/apikey

额度参数:

每分钟请求数 (RPM)
每分钟 token 数(输入)(TPM)
每日请求数 (RPD)

免费额度:

RPM:10
TPM:250000
RPD:250

额度详情:https://ai.google.dev/gemini-api/docs/rate-limits?hl=zh-cn

接口详情:https://ai.google.dev/gemini-api/docs/files?hl=zh-cn

软件信息

网站

https://stt.ichochy.com

GitHub

https://github.com/ichochy/igstt

PyPI

https://pypi.org/project/igstt

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

igstt-0.1.1.tar.gz (3.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

igstt-0.1.1-py3-none-any.whl (4.3 kB view details)

Uploaded Python 3

File details

Details for the file igstt-0.1.1.tar.gz.

File metadata

  • Download URL: igstt-0.1.1.tar.gz
  • Upload date:
  • Size: 3.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.2.1 CPython/3.13.8 Darwin/25.0.0

File hashes

Hashes for igstt-0.1.1.tar.gz
Algorithm Hash digest
SHA256 c671cbaf99b5f12931999511440b7d44c52f0fdd2ede3202990c18737baf3a3f
MD5 b0e57dca43db9db7d8a1462bb377bb88
BLAKE2b-256 35f615665ecf37ffc632ab7df827de2f5d6c350e1460cda737803db41dba5020

See more details on using hashes here.

File details

Details for the file igstt-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: igstt-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 4.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.2.1 CPython/3.13.8 Darwin/25.0.0

File hashes

Hashes for igstt-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 248c001c84a5c71b228346f42318aab870ea8b942850ef23046660297061ce08
MD5 38d8019a378fe241869440977489cb10
BLAKE2b-256 0d8b050ce18826a0156f679a7c6038c7ed67043146634f7104544be512a66584

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page