Syntax Error Data Enhancement
Project description
语法错误数据增强
针对行业语法错误数据稀缺问题,提出语法错误替换方法,可以根据领域的数据进行语法错误制作,定制化行业模型。
目前支持缺字漏字、错别字错误、缺少标点、错用标点、主语不明、谓语残缺、宾语残缺、其他成分残缺、主语多余、虚词多余、其他成分多余、语序不当、动宾搭配不当、其他搭配不当等14种细粒度错误类型的替换。如下图所示:
也可以对一个句子进行多种错误的替换,如下:
模型文件下载
pre_model下的ltp_small,下载地址:https://huggingface.co/LTP/small
获得2024CCL Task7 一等奖
2024CCL Task7: https://github.com/cubenlp/2024CCL_CEFE
博客经验分享:https://www.cnblogs.com/twnlp/p/18208637
评测论文:待发表
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file grammarenhancer-1.1.tar.gz.
File metadata
- Download URL: grammarenhancer-1.1.tar.gz
- Upload date:
- Size: 7.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.8.19
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c5e0a03a7dfa07dfa20a53835ea8c08d44b429fb6795c0b540f1b963c40501bf
|
|
| MD5 |
aab1522e7daac8c57e692517674b5311
|
|
| BLAKE2b-256 |
76d5cbf074521653063632cc57bb86e704626ac6000e02d294118e5e2ff94310
|
File details
Details for the file GrammarEnhancer-1.1-py3-none-any.whl.
File metadata
- Download URL: GrammarEnhancer-1.1-py3-none-any.whl
- Upload date:
- Size: 7.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.8.19
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c71be35bb06e74effce511b7b6635ee8941b6eba2a2b5a2e599001cdf28385e1
|
|
| MD5 |
b919d6431847d852b8ad2510bd1c67bf
|
|
| BLAKE2b-256 |
bc7902e31530f675c31ba79ea367b129204c7a94bfc20535fc9f6c0ef87a7682
|