A helper tool that processes text copied from PDF, removing newlines, replacing punctuation and more.
Project description
AfterCopy
有时候需要从 PDF、CAJ 等文件的阅读器窗口中复制出大段文字,但是这样得到的文字粘贴到 Word 里往往不尽人意:原排版的每一行都会被“另起一段”,有些中文标点变成了带有空格的英文标点……手动调整比较烦琐,因此我写了一个简单的脚本来处理这件事。
Quick Start
pip install aftercopy
aftercopy -v
然后去阅读器中复制文字,再粘贴时得到的已经是处理好(去掉换行、替换标点)的结果了。由于无法识别分段,段落之间需要使用者手动分开。
使用完毕后请记得关闭,避免影响常规的复制粘贴的使用。
用法
aftercopy --help
Usage: aftercopy [OPTIONS]
Options:
-p, --passive Disable active reading from clipboard. Instead you can
paste into and copy from terminal. End your input with
Ctrl-Z + Enter (Windows) or Ctrl-D + Enter.
-v, --verbose Display the concrete re-copied text and more info.
-l, --lang [cn|en] Switch type of language in text. This will influence the
rule set used. (Chinese by default)
--help Show this message and exit.
原理
每隔 0.01 秒读一次剪贴板(性能影响可忽略不计),若发生改变则对新读入的文字作相应的处理,将结果重新写入剪贴板。
TODO
- 替换规则。目前对于标点的替换规则是硬编码的,显然这种做法大大降低了使用的灵活性。但是我还没有想到在每次运行 / 在安装时指定规则文件的较好办法。
- 错别字识别。没找到这方面便捷的库。
One more thing...
请勿用于抄袭等侵犯他人著作权的用途。
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
aftercopy-0.1.1.tar.gz
(3.7 kB
view details)
Built Distribution
File details
Details for the file aftercopy-0.1.1.tar.gz
.
File metadata
- Download URL: aftercopy-0.1.1.tar.gz
- Upload date:
- Size: 3.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.8.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 74e6c723669d2afd9339f3f40682e58f53373c2b4c8aab049f58392d07f10b72 |
|
MD5 | 5cbfe3fbac7fa9bde8fa780b3d0d1fa7 |
|
BLAKE2b-256 | 9fcd2c2976b009e31037fc81f8225c112abaf53aca0df802703c9bf53e854de3 |
File details
Details for the file aftercopy-0.1.1-py3-none-any.whl
.
File metadata
- Download URL: aftercopy-0.1.1-py3-none-any.whl
- Upload date:
- Size: 4.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.8.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9dcfe1878c05270f4af680e9f2b0f9d4d3a3d4e059cd95689a8b4e89ec0a3805 |
|
MD5 | 78444df557dfa626f612ef8468902657 |
|
BLAKE2b-256 | 679a09badcbc4f30f5d4d7f6d0af4fa21f104b4df5f336a9dd221513003f364f |