Skip to main content

支持AI总结的社交媒体聚合解析器

Project description

ParseHub

支持AI总结的社交媒体聚合解析器
Social Media Aggregation Analyzer Supported by AI Summarization

视频总结使用 whisper-1 模型

基于该项目开发的 Tg Bot:
@ParsehubBot | https://github.com/z-mio/parse_hub_bot

支持的平台:
Twitter 视频|图文
Instagram 视频|图文
微博 视频|图文
贴吧 视频|图文
小红书 视频|图文
Youtube 视频|音乐
Facebook 视频
Bilibili 视频|动态
抖音|TikTok 视频|图文
微信公众号 图文
最右 视频|图文
酷安 视频|图文 ......

安装

pip install parsehub


[!IMPORTANT]

注意

Linux用户在导入skia-python包时可能会遇到以下报错

libGL.so.1: cannot open shared object file: No such file or directory

Windows用户在缺少Microsoft Visual C++ Runtime时可能会遇到以下报错

ImportError: DLL load failed while importing skia: The specified module could not be found.

解决方法

ubuntu用户

# Ubuntu 22 安装
apt install libgl1-mesa-glx
# Ubuntu 24 安装
apt install libgl1 libglx-mesa0

ArchLinux用户

pacman -S libgl

centos用户

yum install mesa-libGL -y

Windows用户

下载链接Microsoft Visual C++ 2015 Redistributable Update 3 RC

使用

from parsehub import ParseHub
from parsehub.config import ParseConfig, DownloadConfig
import asyncio


async def main():
    ph = ParseHub(config=ParseConfig())
    result = await ph.parse('https://twitter.com/aobuta_anime/status/1827284717848424696')
    print(result)

    # dr = await result.download()
    # print(dr.media)
    # sr = await dr.summary()
    # await dr.delete()

    sr = await result.summary(download_config=DownloadConfig())
    print(sr.content)


if __name__ == '__main__':
    asyncio.run(main())

环境变量

名称 描述 默认值
PROVIDER 模型提供商, 支持: openai openai
API_KEY API Key
BASE_URL API 地址 https://api.openai.com/v1
MODEL AI总结使用的模型 gpt-4o-mini
PROMPT AI总结提示词 You are a useful assistant to summarize the main points of articles and video captions. Summarize 3 to 8 points in "Simplified Chinese" and summarize them all at the end.
TRANSCRIPTIONS_PROVIDER 语音转文本模型提供商 支持: openai,azure,fast_whisper openai
AZURE_SPEECH_REGION 语音转文本 azure端点
AZURE_SPEECH_KEY 语音转文本 azure密钥

参考项目

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parsehub-1.2.9.tar.gz (61.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

parsehub-1.2.9-py3-none-any.whl (85.9 kB view details)

Uploaded Python 3

File details

Details for the file parsehub-1.2.9.tar.gz.

File metadata

  • Download URL: parsehub-1.2.9.tar.gz
  • Upload date:
  • Size: 61.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.10

File hashes

Hashes for parsehub-1.2.9.tar.gz
Algorithm Hash digest
SHA256 d604a6aebbba222f68a35d6176d47633781ed2efbe9043ff078109c1c8947b2d
MD5 e1b41a2c046d11db83811adcc7ae51ba
BLAKE2b-256 07bda5084a048e428e7ae66e46e997e99fd7a9dc2d263e15c7d195b8c9dc1ce8

See more details on using hashes here.

File details

Details for the file parsehub-1.2.9-py3-none-any.whl.

File metadata

  • Download URL: parsehub-1.2.9-py3-none-any.whl
  • Upload date:
  • Size: 85.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.10

File hashes

Hashes for parsehub-1.2.9-py3-none-any.whl
Algorithm Hash digest
SHA256 28bf90e4f90324bbe58f14c02ec1a91e9f8d4736662cd7a9cdcd4fea18d1dfc0
MD5 228d5b4ef1015cdcabf2c3fbc71b6c4f
BLAKE2b-256 2e56d7cd20216e26cacf3e138aef574af214f232307de8b8b298cab6b0a5ce17

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page