Skip to main content

Gemini STT CLI

Project description

iGSTT(Gemini STT)

iGSTT(Gemini STT) 开源免费的语音转文本(STT)的命令行工具。
iGSTT(Gemini STT) 是通过调用 Gemini STT 的接口,实现语音转文本(STT)的命令行工具。

添加 API key

# 编辑 .zshrc:
vim ~/.zshrc

# 添加信息(导入环境变量):
export GEMINI_API_KEY=<YOUR_API_KEY_HERE>

# 保存后应用更改:
source ~/.zshrc

安装 iGSTT

pip install igstt

运行 iGSTT

usage: igstt [-h] [-i INPUT] [-o OUTPUT] [text]

🎧 Gemini 语音转文本(STT)工具

positional arguments:
  text                 提示信息

options:
  -h, --help           show this help message and exit
  -i, --input INPUT    音频文件路径
  -o, --output OUTPUT  输出文件路径(默认与输入同名 .lrc)

实例

# 指定语音文本的内容
igstt "Describe this audio clip"

# 指定语音文本的文件 -i "input.mp3"
igstt -i "input.mp3"

# 指定语音文本的文件 -i "input.mp3"
# 指定语音保存的文件 -o "output.lrc"
igstt -i "input.mp3" -o "output.lrc"

实例转LRC

igstt -i input.wav
提示信息: Generate LRC, LRC content format: [minutes:seconds.milliseconds] English | Simplified Chinese
音频文件: input.wav
输出文件: input.lrc
🤖 Gemini 返回:
[00:00.060] Hello, |
[00:00.550] | 这是一个 Gemini TTS CLI 工具。
[00:04.650] | 记得关注哦。
#################### 已写入到 input.lrc

模型

gemini-2.5-flash

获取 API key

API key:https://aistudio.google.com/apikey

额度参数:

每分钟请求数 (RPM)
每分钟 token 数(输入)(TPM)
每日请求数 (RPD)

免费额度:

RPM:10
TPM:250000
RPD:250

额度详情:https://ai.google.dev/gemini-api/docs/rate-limits?hl=zh-cn

接口详情:https://ai.google.dev/gemini-api/docs/files?hl=zh-cn

软件信息

网站

https://stt.ichochy.com

GitHub

https://github.com/ichochy/igstt

PyPI

https://pypi.org/project/igstt

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

igstt-0.1.0.tar.gz (3.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

igstt-0.1.0-py3-none-any.whl (4.3 kB view details)

Uploaded Python 3

File details

Details for the file igstt-0.1.0.tar.gz.

File metadata

  • Download URL: igstt-0.1.0.tar.gz
  • Upload date:
  • Size: 3.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.2.1 CPython/3.13.8 Darwin/25.0.0

File hashes

Hashes for igstt-0.1.0.tar.gz
Algorithm Hash digest
SHA256 657be68e6350d7e19c5bbf6d908881328db778e442944710235a55a5674dc93b
MD5 2083e3372ccc791ce5609a0b8bf57cbc
BLAKE2b-256 f7491d82988b085e4d1508a1db40bc41a3002c884d1ddd1c2cc32b0b323f0664

See more details on using hashes here.

File details

Details for the file igstt-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: igstt-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 4.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.2.1 CPython/3.13.8 Darwin/25.0.0

File hashes

Hashes for igstt-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 8345848647b9e8f5ac41b0f0704c378f8e30c2add51f6554e8ee2aba36415ffa
MD5 cfbb3aadb6be14dbaff907475fe3204d
BLAKE2b-256 1f7045b32e13ae06552b742b09fc74b218d2408815db6454d5479c81fd8cfbec

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page