Skip to main content

WeChat Middleware for EH Forwarder Bot to convert voice to text, based on Baidu and Microsoft API.

Project description

VoiceRecogMiddleware: A middleware for EFB

Notice

Middleware ID: catbaron.voice_recog

VoiceRecogMiddleware is a middleware for EFB to recognize content of voice message. This middleware is modified based on ehForwarderBot 1.0

example

  • Baidu
    • Deprecated for now because my Baidu account is limited by Baidu
    • 由于我的Baidu账号不能正常登陆,因此对百度引擎的支持暂时停止。
    • API from Baidu is available for free.
    • You need to get API_KEY and SECRET_KEY following http://ai.baidu.com/docs#/ASR-API-PRO/top
  • Azure Cognitive Service
    • Azure Cognitive Service offers 5 audio hours per month for free.
    • You need to get Key1 and Endpoint from https://portal.azure.com/
  • IFlyTek
    • IFlyTek offers 500 queries per day fro free to new users. Additional free bundles are available from 50,000 queries/90 days to 100,000 queries/year.
    • You need to get APPID, APISecret and APIKey from https://console.xfyun.cn/services/iat
  • Tencent
  • You need to use VoiceRecogMiddleware on top of EFB. Please check the document and install EFB first.

Dependense

  • Python >= 3.6
  • EFB >= 2.0.0b15
  • pydub

Install and configuration

Install

pip install efb-voice_recog-middleware

Enable

Register to EFB Following this document to edit the config file. The config file by default is $HOME/.ehforwarderbot/profiles/default. It should look like:

master_channel: foo.demo_master
slave_channels:
- foo.demo_slave
- bar.dummy
middlewares:
- foo.other_middlewares
- catbaron.voice_recog

You only need to add the last line to your config file.

Configure the middleware

The config file by default is $HOME/.ehforwarderbot/profiles/default/catbaron.voice_recog. Please create the config file if thers is not one. Edit it as:

speech_api:
    baidu:
        api_key: API_KEY
        secret_key: SECRET_KEY
        # supported language:
        #   zh, zh-x-en, en, zh-yue, zh-x-sichuan, zh-x-farfield
        lang: zh
    azure:
        key1: KEY_1
        endpoint: ENDPOINT
        # supported language:
        #   ar-EG, ar-SA, ar-AE, ar-KW, ar-QA, ca-ES,
        #   da-DK, de-DE, en-AU, en-CA, en-GB, en-IN,
        #   en-NZ, en-US, es-ES, es-MX, fi-FI, fr-CA,
        #   fr-FR, gu-IN, hi-IN, it-IT, ja-JP, ko-KR,
        #   mr-IN, nb-NO, nl-NL, pl-PL, pt-BR, pt-PT,
        #   ru-RU, sv-SE, ta-IN, te-IN, zh-CN, zh-HK,
        #   zh-TW, th-TH, tr-TR
        lang: zh
    tencent:
        secret_id: SECRET_ID
        secret_key: SECRET_KEY
        # supported language: en, zh
        lang: en
    iflytek:
        app_id: APP_ID
        api_secret: API_SECRET
        api_key: APP_KEY
        # supported language: zh, en
        lang: en
auto: true

Replace the section with all-caps to your own ones.

Note that you may omit the section that you do not want to enable.


Turn off auto if you want to disable auto recognition to all voice messages. Alternatively, you may reply recog` to a voice message to recognise it.

Restart EFB.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

efb-voice_recog-middleware-1.0.1.tar.gz (12.0 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file efb-voice_recog-middleware-1.0.1.tar.gz.

File metadata

  • Download URL: efb-voice_recog-middleware-1.0.1.tar.gz
  • Upload date:
  • Size: 12.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.6.1 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.28.1 CPython/3.6.9

File hashes

Hashes for efb-voice_recog-middleware-1.0.1.tar.gz
Algorithm Hash digest
SHA256 0299a7e146628191db9725383b21cd6c33765401a1cb46cd8b5966c46bfed57b
MD5 fb621e835841cd0438022d4a934d70ca
BLAKE2b-256 360d2e3ebd88f440c1a7c05f3b7a1615830f1b76751dde70b6c75085d7933996

See more details on using hashes here.

File details

Details for the file efb_voice_recog_middleware-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: efb_voice_recog_middleware-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 13.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.6.1 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.28.1 CPython/3.6.9

File hashes

Hashes for efb_voice_recog_middleware-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 be7e86d7aff0a56e8c4fdb8c34ab683704bd38529e0875fa726510e43741597c
MD5 7855041d9fef1b22de97c1f1ed9d20b7
BLAKE2b-256 3e09944af3b7b24e0398038c8498b2a786996e841187a8bd3407356c63a894ba

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page