Skip to main content

A simple, effective sentence mining tool.

Project description

VocabSieve - a simple sentence mining tool

https://ci.appveyor.com/api/projects/status/32r7s2skrgm9ubva?svg=true https://img.shields.io/pypi/v/vocabsieve.svg Downloads

Join our chat on Matrix or Telegram

VocabSieve (formerly Simple Sentence Mining, ssmtool) is a program for sentence mining, in which sentences with target vocabulary words are collected and added into a spaced repetition system (SRS, e.g. Anki) for language learning.

Demo

Features

  • Double-click lookups from copied sentences and even faster lookups from integrated applications
  • Lemmatization of words on lookup
  • No internet is required at all if you use downloaded resources
  • Online and local dictionaries in multiple formats (StarDict, Migaku, plain JSON)
  • Frequency lists and pronunciations
  • Web reader for ebooks and long articles (epub, fb2, plaintext), allowing one-click lookup
  • Kindle highlights to Anki sentence cards (KOReader support is planned)

For a detailed list of features and language support data, please consult the wiki page.

Tutorials

wiki page (The text originally on this document or the blog post has since been moved there, with some updates)

New video tutorial

Windows and Mac users: If you want to install this program, go to Releases and from the latest release, download the appropriate file for your operating system.

Linux distro packages

Packaging status

Gentoo

First, you need to add the ::guru overlay. Skip this section if you already had done so.

# eselect repository enable guru
# emaint -r guru sync

Install the package: # emerge -av app-misc/vocabsieve

Arch

Use your favorite AUR helper (or manually) to install the pacakge vocabsieve

Other distros

At this time, there are no packages for other distributions. If you are able to create packages for them, please tell me!

In the meantime, users should simply use pip3 to install VocabSieve: pip3 install --user vocabsieve.

This should install an executable and a desktop icon and behave like any other GUI application you may have.

Development

To run from source, simply use pip3 -r requirements.txt and then python3 vocabsieve.py.

Alternatively, you can also install a live version to your python package library with pip3 install . (Add --user if there is a permission error)

For debugging purposes, set the environmental variable VOCABSIEVE_DEBUG to any value. This will create a separate profile (settings and databases for records and dictionaries) so you may perform tests without affecting your normal profile. For each different value of VOCABSIEVE_DEBUG, a separate profile is generated. This can be any number or string.

Note that VocabSieve is unable to delete old profiles. You must do so yourself based on your operating system's locations.

API documentation

If you want to leverage VocabSieve to build your own plugins/apps, you can refer to the API Documentation

Note that VocabSieve is still alpha software. API is not guaranteed to be stable at this point.

Feedback

You are welcome to report bugs, suggest features/enhancements, or ask for clarifications by opening a GitHub issue.

Donations

Send me some Monero to support this work!

XMR Address: 89AZiqM7LD66XE9s5G7iBu4CU3i6qUu2ieCq4g3JKacn7e1xKuwe2tvWApLFvhaMR47kwNzjC4B5VL3N32MCokE2U9tGXzX

Monero is a private, censorship-resistant cryptocurrency. Transactions are anonymous and essentially impossible to track by authorities or third-party analytics companies.

Learn more about Monero

If you do not have any Monero, a good way to get it is through ChangeNow or SimpleSwap.

Credits

The definitions provided by the program by default come from English Wiktionary, without which this program would never have been created.

App icon is made from icons by Freepik available on Flaticon.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vocabsieve-0.8.0.tar.gz (182.9 kB view details)

Uploaded Source

Built Distribution

vocabsieve-0.8.0-py3-none-any.whl (190.4 kB view details)

Uploaded Python 3

File details

Details for the file vocabsieve-0.8.0.tar.gz.

File metadata

  • Download URL: vocabsieve-0.8.0.tar.gz
  • Upload date:
  • Size: 182.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for vocabsieve-0.8.0.tar.gz
Algorithm Hash digest
SHA256 5e29af389f1944d00b4ecae7e38d8b44d8d45f83ad9b720d38fac2fc605bf969
MD5 6b6579c6f46a26f0245e8b81e325c316
BLAKE2b-256 c67eb0b66d628b0881a8034583489ed7c856cb4d6f59ab17e53768353d81f305

See more details on using hashes here.

Provenance

File details

Details for the file vocabsieve-0.8.0-py3-none-any.whl.

File metadata

  • Download URL: vocabsieve-0.8.0-py3-none-any.whl
  • Upload date:
  • Size: 190.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for vocabsieve-0.8.0-py3-none-any.whl
Algorithm Hash digest
SHA256 63e1cef65478bd516aca6071dc5322df45ffffe2988bcdb4cf0f0ce255aa6e85
MD5 d3f87ef0c167dc7a661bb8ddd8ddb761
BLAKE2b-256 1462655990dc70c93a182c8369c2e9b042b69065a014d77d823c6d1eeccaf70e

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page