Skip to main content

Add your description here

Project description

Pyetool

A personal efficiency enhancing tool.

Sprint 1 — Resource objects (done)

  • pexel api
  • video downloader
  • video resource object, understand method
  • audio resource object, transcribe method
  • bilibili page object, data & action

Sprint 2 — More resource types

  • GIF resource object, understand method
  • music resource object, understand method
  • resource management

Sprint 3 — Video composition

  • use Remotion to make descriptive video clip

Sprint 4 — Video topic research toolkit

为选题决策提供数据支撑:探索新主题或验证已有灵感。Pipeline:

[关键词/种子] → [找视频] → [视频+评论分析] → [角度卡]

P1: 标题相关性过滤 - done

搜索结果噪声率高(实测「结构主义」:1391 条中真正相关约 24%)。在拉评论 / 转录 / 分析之前,按标题用 LLM 判定与目标主题的相关性。

  • 三档输出(高度相关 / 弱相关 / 不相关)+ 一句理由
  • CLI 子命令,jsonl → jsonl,可与 search 管道串联

P2: 评论 & 弹幕收集

  • 评论:B 站 API 拉 top N 热评(按点赞排序),反映"观众关注什么"
  • 弹幕:反映"瞬时反应点",跟评论是不同维度
  • 两者用途不同,可分两步实现

P3: 单点深挖原语

"好主题来自迭代深挖"——需要的物理操作:

  • 给定 UP 主 → 拉全部投稿(最高频,先做这一个)
  • 给定视频 → 拉 B 站相关推荐
  • 给定视频 → 拉同分区 / 同 tag 热门

Misc: ASR 段落化

SeedASR / SeedASRFlash 当前输出只有标点 + utterance(句子级)切分,无段落结构, 长文阅读体验差。两个 API 都没有段落参数(已核对参数列表),需要客户端后处理 (基于语义 / 话题切分归并段落,LLM 或规则均可)。

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyetool-0.29.0-py3-none-any.whl (103.2 kB view details)

Uploaded Python 3

File details

Details for the file pyetool-0.29.0-py3-none-any.whl.

File metadata

  • Download URL: pyetool-0.29.0-py3-none-any.whl
  • Upload date:
  • Size: 103.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for pyetool-0.29.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6a325b8656164e2f0f8c745c82a548fea7b8e8750d5208fe6c28d0fb0be20cc5
MD5 c806fb297b0b98230aa2a5c1ac0a3e19
BLAKE2b-256 7ac76683d7ec2d573323cc938a35b222c0fed68c78e6ca49fdfdd90c911870ba

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page