My short description for my project.
Project description
nlp-qa-tfidf
介绍
tfidf 余弦短文本求相似度
package
gensim jieba pandas
实例化参数
Tfidf = Tfidf(stopwords_file="", work_dir="", work_file_prefix="")
#stopwords_file 停用词文件 可不填 包里自带有
#work_dir 工作目录 模型存储的路径,不填则在当前项目根目录下 建议填
#work_file_prefix 文件名字前缀, 不填则训练的模型名字为 _.model 可以填时间戳,或者用户id,建议填写 不然模型会相互覆盖
模型训练
# QA 场景
# 问题列表
question_list = [
"test1-question","test2-question","test3-question"
]
# 答案列表
answer_list = [
"test1-answer","test2-answer","test3-answer"
]
Tfidf = Tfidf(stopwords_file="", work_dir="", work_file_prefix="")
# 训练模型并保存
Tfidf.save_model(question_list=question_list,answer_list=answer_list)
# 输入问题,并去计算相似度
Tfidf.run(question="test question")
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
nlp-tfidf-1.5.3.tar.gz
(6.2 kB
view hashes)
Built Distribution
Close
Hashes for nlp_tfidf-1.5.3-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b56b25ca6b5b353a31ba02554a9c2184b6750afcb17976c1f3b9a9c9b96b5f21 |
|
MD5 | 8b0acf29e648a1b723c899973faba449 |
|
BLAKE2b-256 | f43a74302992a5b9424324a45362e0fc82d273a8fadcad69a691dd905fa281c5 |