MCP服务：从抖音分享链接提取音频、进行语音识别并整理对话内容

These details have not been verified by PyPI

Project links

Project description

抖音语音识别MCP服务

这是一个MCP（Model Context Protocol）服务，用于从抖音分享链接中提取音频、进行语音识别，并使用AI大模型整理对话内容。

功能特性

解析抖音分享文本，自动提取短链接
通过RapidAPI获取抖音视频的MP3音频地址
使用阿里云通义千问语音识别模型进行音频转文字
使用阿里千问大模型整理对话内容，识别说话人角色
支持异步任务处理和自动轮询
返回结构化的对话JSON数组

项目结构

douyin-summary-mcp/
├── src/
│   ├── __init__.py
│   ├── douyin_parser.py          # 解析抖音分享文本，提取短链接
│   ├── rapidapi_client.py        # RapidAPI客户端，获取MP3地址
│   ├── aliyun_asr.py             # 阿里云语音识别客户端
│   ├── qwen_client.py            # 阿里千问大模型客户端，整理对话内容
│   ├── mcp_server.py             # 命令行入口（旧版）
│   └── mcp_protocol_server.py    # MCP协议服务器实现
├── mcp_server.py                 # MCP服务器启动脚本（主入口）
├── requirements.txt              # Python依赖
├── config.py                     # 配置文件（API Keys）
├── claude_desktop_config.example.json  # Claude Desktop配置示例
├── MCP_SETUP.md                  # MCP服务配置指南
└── README.md                     # 项目说明文档

安装

方式1：通过 pip 安装（推荐）

pip install douyin-summary-mcp

安装后，MCP服务器命令 douyin-summary-mcp 将可用。

方式2：从源码安装

1. 克隆项目

git clone <repository-url>
cd douyin-summary-mcp

2. 安装依赖

pip install -r requirements.txt

或者直接安装为开发模式：

pip install -e .

3. 配置环境变量

创建 .env 文件（参考 .env.example）：

# RapidAPI配置
RAPIDAPI_KEY=your_rapidapi_key_here

# 阿里云配置
DASHSCOPE_API_KEY=your_dashscope_api_key_here

获取API Keys：

RapidAPI Key: 访问 RapidAPI Douyin API New，订阅服务并获取API Key
阿里云DashScope API Key: 访问阿里云DashScope，获取API Key

使用方法

命令行使用

python3 -m src.mcp_server "7.46 z@G.vs YZZ:/ 08/31 随机进家做菜：叔叔吃了我烧的肉，觉得之前老伴做的不香了！# 抖音美食推荐官 # 随机进家做菜 # 美食 # 红烧肉 # 排骨汤 https://v.douyin.com/ybL9NO9RjKA/ 复制此链接，打开Dou音搜索，直接观看视频！"

Python代码使用

from src.mcp_server import process_douyin_share

share_text = "7.46 z@G.vs YZZ:/ 08/31 随机进家做菜... https://v.douyin.com/ybL9NO9RjKA/ 复制此链接..."

result = process_douyin_share(share_text)

if result["success"]:
    print(f"识别文本: {result['data']['recognized_text']}")
    if result['data'].get('formatted_dialogue'):
        import json
        print(f"格式化对话: {json.dumps(result['data']['formatted_dialogue'], ensure_ascii=False, indent=2)}")
else:
    print(f"处理失败: {result['message']}")

API说明

`process_douyin_share(share_text: str) -> Dict[str, Any]`

处理抖音分享链接的主函数。

参数：

share_text (str): 抖音分享文本

返回：

{
    "success": bool,           # 是否成功
    "message": str,            # 状态消息
    "data": {
        "douyin_url": str,      # 提取的抖音短链接
        "mp3_url": str,         # 获取的MP3地址
        "recognized_text": str, # 识别出的原始文本
        "formatted_dialogue": [ # 格式化后的对话数组（可选）
            {
                "speaker": "说话人1",  # 说话人角色
                "text": "说话内容"     # 说话内容
            },
            ...
        ]
    } or None
}

工作流程

解析分享文本: 使用正则表达式从分享文本中提取抖音短链接（如 https://v.douyin.com/ybL9NO9RjKA）
获取MP3地址: 调用RapidAPI的Douyin API接口，从短链接提取视频ID，获取视频的MP3音频地址
语音识别:
- 提交异步任务到阿里云语音识别服务（使用 qwen3-asr-flash-filetrans 模型）
- 轮询任务状态直到完成
- 从 transcription_url 下载识别结果JSON文件
- 提取识别出的文本内容（从 transcripts[0].text）
对话整理:
- 调用阿里千问大模型（qwen-plus）分析对话内容
- 识别不同的说话人角色
- 将对话分段并格式化为JSON数组
- 每个元素包含 speaker（说话人角色）和 text（说话内容）

技术细节

阿里云语音识别

模型: qwen3-asr-flash-filetrans
地域: 中国大陆（北京）
接口类型: 异步接口
支持时长: 最长12小时音频
API文档: 阿里云录音文件识别

RapidAPI

接口: https://douyin-api-app-web.p.rapidapi.com/web/aweme/detail
方法: POST
文档: RapidAPI Douyin API (APP - WEB)
说明: 需要先从抖音短链接提取视频ID，然后调用接口获取视频详情

阿里千问大模型

模型: qwen-plus
用途: 整理对话内容，识别说话人角色
输出格式: JSON数组，包含 speaker 和 text 字段

错误处理

服务包含完整的错误处理机制：

配置验证：启动时检查必要的环境变量
网络异常：处理API调用失败和超时
任务超时：设置最大轮询次数，避免无限等待
日志记录：详细记录处理过程，便于调试

MCP使用方式

本项目已实现完整的MCP（Model Context Protocol）协议支持，可以直接作为MCP服务器使用。

快速开始

安装包
```
pip install douyin-summary-mcp
```

配置Claude Desktop

编辑Claude Desktop配置文件（位置见下方），添加：

{
  "mcpServers": {
    "douyin-summary": {
      "command": "douyin-summary-mcp",
      "env": {
        "RAPIDAPI_KEY": "your_rapidapi_key_here",
        "DASHSCOPE_API_KEY": "your_dashscope_api_key_here"
      }
    }
  }
}

注意： 如果通过pip安装，直接使用 douyin-summary-mcp 命令即可，无需指定完整路径。

如果从源码安装，需要使用：

{
  "mcpServers": {
    "douyin-summary": {
      "command": "python3",
      "args": ["/绝对路径/douyin-summary-mcp/mcp_server.py"],
      "env": {
        "RAPIDAPI_KEY": "your_rapidapi_key_here",
        "DASHSCOPE_API_KEY": "your_dashscope_api_key_here"
      }
    }
  }
}

配置文件位置：

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%\Claude\claude_desktop_config.json
Linux: ~/.config/Claude/claude_desktop_config.json

重启Claude Desktop

配置完成后重启Claude Desktop，即可使用 @douyin-summary 工具。

详细配置指南

详细的配置步骤和故障排除，请参考 MCP_SETUP.md

其他使用方式

方式1：命令行调用

python3 -m src.mcp_server "抖音分享文本..."

方式2：Python模块调用

from src.mcp_server import process_douyin_share

share_text = "抖音分享文本..."
result = process_douyin_share(share_text)

作为Python模块调用

from src.mcp_server import process_douyin_share

# 处理抖音分享链接
share_text = "抖音分享文本..."
result = process_douyin_share(share_text)

if result["success"]:
    # 获取原始识别文本
    recognized_text = result['data']['recognized_text']
    
    # 获取格式化后的对话
    formatted_dialogue = result['data'].get('formatted_dialogue')
    if formatted_dialogue:
        for item in formatted_dialogue:
            print(f"{item['speaker']}: {item['text']}")

返回数据示例

{
  "success": true,
  "message": "处理成功",
  "data": {
    "douyin_url": "https://v.douyin.com/ybL9NO9RjKA",
    "mp3_url": "https://example.com/audio.mp3",
    "recognized_text": "完整的识别文本...",
    "formatted_dialogue": [
      {
        "speaker": "主持人",
        "text": "这是一期东北大丰收的保姆级教程..."
      },
      {
        "speaker": "主持人",
        "text": "这道菜需要用到一口大铁锅..."
      }
    ]
  }
}

注意事项

确保音频文件URL是公网可访问的
阿里云语音识别是异步接口，可能需要等待较长时间（取决于音频长度）
RapidAPI接口可能需要订阅才能使用
千问大模型调用需要消耗API额度，请合理使用
请妥善保管API Keys，不要提交到版本控制系统
如果步骤4（对话整理）失败，仍会返回步骤3的识别文本，保证流程不中断

许可证

[根据项目需要添加许可证信息]

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.0.1

Dec 25, 2025

This version

1.0.0

Dec 25, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

douyin_summary_mcp-1.0.0.tar.gz (18.8 kB view details)

Uploaded Dec 25, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

douyin_summary_mcp-1.0.0-py3-none-any.whl (17.1 kB view details)

Uploaded Dec 25, 2025 Python 3

File details

Details for the file douyin_summary_mcp-1.0.0.tar.gz.

File metadata

Download URL: douyin_summary_mcp-1.0.0.tar.gz
Upload date: Dec 25, 2025
Size: 18.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for douyin_summary_mcp-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`18d39b469ba171f852b04ce6e9e6f751c0e5aceab36fa32697df93ca0b8759e3`
MD5	`223e728f0d9c95f97b4c33983ffdfd2a`
BLAKE2b-256	`c8746b4a1806dc107d44185387dfc9c45b1445007b0771a55370ff5756a522da`

See more details on using hashes here.

File details

Details for the file douyin_summary_mcp-1.0.0-py3-none-any.whl.

File metadata

Download URL: douyin_summary_mcp-1.0.0-py3-none-any.whl
Upload date: Dec 25, 2025
Size: 17.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for douyin_summary_mcp-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`64bbebc78819111f0a019ca1817dcda15e3d7a2314e2ea55196a7baa79825f4f`
MD5	`270ee01be6272f95843cc31d93596fc5`
BLAKE2b-256	`c2c0ea086988aa4c42bb0253ea9bd2a086725be0e172c1de54ba8b38a5bd993a`

See more details on using hashes here.

douyin-summary-mcp 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

抖音语音识别MCP服务

功能特性

项目结构

安装

方式1：通过 pip 安装（推荐）

方式2：从源码安装

1. 克隆项目

2. 安装依赖

3. 配置环境变量

使用方法

命令行使用

Python代码使用

API说明

process_douyin_share(share_text: str) -> Dict[str, Any]

工作流程

技术细节

阿里云语音识别

RapidAPI

阿里千问大模型

错误处理

MCP使用方式

快速开始

详细配置指南

其他使用方式

方式1：命令行调用

方式2：Python模块调用

作为Python模块调用

返回数据示例

注意事项

许可证

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`process_douyin_share(share_text: str) -> Dict[str, Any]`