A simple, effective sentence mining tool.
Project description
VocabSieve - a simple sentence mining tool
VocabSieve (formerly Simple Sentence Mining, ssmtool
) is a program for sentence mining, in which sentences with target vocabulary words are collected and added into a spaced repetition system (SRS) for language learning.
Features
- Double-click lookups from sentences and even faster lookups from integrated applications
- Lemmatization of words on lookup
- No internet is required if you use downloaded resources
- Online and local dictionaries in multiple formats
- Frequency lists and pronunciations
- Web reader (epub, fb2, plaintext) allowing one-click lookup
- Kindle highlights to Anki sentence cards (KOReader support is planned too)
For a detailed list of features and language support data, please consult the blog post on my blog
Tutorials
Text tutorial (The text originally on this document has since been moved there.)
Old video tutorial (Basic, somewhat outdated)
USERS: If you want to install it, go to Releases and from the latest release, download the appropriate file for your operating system.
Linux distro packages
Development
To run from source, simply use pip3 -r requirements.txt
and then python3 vocabsieve.py
.
Alternatively, you can also install a live version to your python package library with pip3 install .
(Add --user if there is a permission error)
For debugging purposes, set the environmental variable VOCABSIEVE_DEBUG
to any value. This will create a separate profile (settings and databases for records and dictionaries) so you may perform tests without affecting your normal profile. For each different value of VOCABSIEVE_DEBUG
, a separate profile is generated. This can be any number or string.
Note that VocabSieve is unable to delete old profiles. You must do so yourself based on your operating system's locations.
API documentation
If you want to leverage VocabSieve to build your own plugins/apps, you can refer to the API Documentation
Note that VocabSieve is still alpha software. API is not guaranteed to be stable at this point.
Feedback
You are welcome to report bugs, suggest features/enhancements, or ask for clarifications by opening a GitHub issue.
Donations
Send me some Monero to support this work!
XMR Address: 89AZiqM7LD66XE9s5G7iBu4CU3i6qUu2ieCq4g3JKacn7e1xKuwe2tvWApLFvhaMR47kwNzjC4B5VL3N32MCokE2U9tGXzX
Monero is a private, censorship-resistant cryptocurrency. Transactions are anonymous and essentially impossible to track by authorities or third-party analytics companies.
If you do not have any Monero, a good way to get it is through ChangeNow or SimpleSwap.
Credits
The definitions provided by the program by default come from English Wiktionary, without which this program would never have been created.
App icon is made from icons by Freepik available on Flaticon.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for vocabsieve-0.7.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5a7c072c7e4be0002d2d49db40b2e6426a7719b4ee3dc33e01afbcdee4d63e1e |
|
MD5 | aaaaf128c166b4a03ff3bd260bdfbb99 |
|
BLAKE2b-256 | 1b41ca676f14158386d485aedd68f14ec4701952bc3e1dcf994f90520990cf86 |