Skip to main content

Text Correction wth deep learning

Project description

deepcorrect

Pre-trained models for punctuation correction (trained on google news, wikipedia and tatoeba) are available at https://drive.google.com/open?id=1Yd8cJaqfQkrJMbRVWIWtuyo4obTDYu-e

Demo of the punctuation model trained on google news corpus is available at http://bpraneeth.com/projects

This repo uses a seq2seq model written by me in keras with tensorflow backend. The multi-purpose seq2seq model can be found at https://github.com/bedapudi6788/txt2txt/

Usage:

from deepcorrect import DeepCorrect
corrector = DeepCorrect('params_path', 'checkpoint_path')
corrector.correct('hey')
'Hey!'

Installation:

pip install deepcorrect

Points to Note:

Max input and output lengths are 200

Segment text into sentences using https://github.com/bedapudi6788/deepsegment and run punctuation correction on each sentence seperately.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deepcorrect-1.0.5.tar.gz (3.1 kB view details)

Uploaded Source

Built Distribution

deepcorrect-1.0.5-py2.py3-none-any.whl (14.6 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file deepcorrect-1.0.5.tar.gz.

File metadata

  • Download URL: deepcorrect-1.0.5.tar.gz
  • Upload date:
  • Size: 3.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.29.1 CPython/3.6.8

File hashes

Hashes for deepcorrect-1.0.5.tar.gz
Algorithm Hash digest
SHA256 0acea8565ec56de41b2eaa3f093d6b95590ff21c9b7fdae8a597358d3580c787
MD5 1e86239ca972811a1b0ba53966ab22a9
BLAKE2b-256 3a14ced14daa245ffaa51ffb37146550fdbb509d2e867e2e0ff9b7da22322f25

See more details on using hashes here.

File details

Details for the file deepcorrect-1.0.5-py2.py3-none-any.whl.

File metadata

  • Download URL: deepcorrect-1.0.5-py2.py3-none-any.whl
  • Upload date:
  • Size: 14.6 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.29.1 CPython/3.6.8

File hashes

Hashes for deepcorrect-1.0.5-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 7408c19c133d02c98a5d287564693223ff3b339f5f606d8893ee65d16c6a178f
MD5 b6bdb07612a43e6a67bc16350a8fb61b
BLAKE2b-256 1725e131f4ee1b2c2077613d5d764bfc88d0e0330523746a0aafb21ed712f06d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page