Skip to main content

Automatically make Anki Decks for Chinese text

Project description

❗️
Pull requests welcome! If you think you can improve AA in any way, you are welcome to send me a message!

AutoAnki

Auto-Generate Anki Flashcards from a text file.

Converts Chinese books from text files to Anki decks.

Text to Anki

AutoAnki was created to help Chinese learners get into more advanced texts. It can be hard transitioning from graded readers to full books, and this tool was designed to help ease that process.

Usage

AutoAnki is both a library and a terminal tool. To get started, run pip install auto-anki This should install all the requirements. Then, in a Python file dofrom AutoAnki import AutoAnki, cli

To get started, first, create a database for AutoAnki to use

db_path = "AutoAnki.db"
if not AutoAnki.is_database(db_path):
    AutoAnki.create_autoanki_db(db_path)

Then create an instance of AutoAnki using the database

aa = AutoAnki(db_path)

Add whatever books you want in your deck. These can be text files, or a folder full of text files

bookpath = 'short-story.txt'
aa.add_book(bookpath, 'My first book😆')

Once all of your books are added, the definitions need to be found, and then you can create a deck!

aa.complete_unfinished_definitions()
aa.create_deck("AutoAnki Deck", "output")

This will automatically have the .apkg extension, which Anki uses. Go into Anki, and import this file. You should see your deck in the list:

Anki deck view

When getting definitions for words, it uses selenium to scrape definitions from a dictionary website

Check your version of Chrome, and put the same version driver in the same directory as main. Drivers can be found here ---> https://chromedriver.chromium.org/downloads

Terminal

If you would like to run AutoAnki from the terminal, run AutoAnki/terminal.py
This will run a terminal interface where you will be able to run

How it works

AutoAnki interfaces has 4 components on the back end:

  1. BookCleaner: Cleans the input coming in from files that the user supplies
  2. DatabaseManager: Takes the cleaned input and puts it into the database
  3. Dictionary: Finds definitions for words in the database
  4. DeckManager: takes information from the user and database, and makes an Anki deck out of it

Database:

There are 3 different types of tables in the DB,

  1. dictionary, which holds all the definitions and other information
  2. book_list, which holds the titles of the books in the database
  3. book, based off of the name of the book, and holds the words in the book to find in the dictionary

dictionary:

  • word_id
  • word
  • word_traditional
  • word_type (noun, verb, etc.)
  • pinyin
  • pinyin_numbers
  • number_of_strokes
  • sub_components
  • frequency
  • hsk_level
  • top_level
  • audio_path
  • image_path
  • definition

Dictionary table Dictionary table

book_list:

  • book_name
  • book_table_name
  • language

Book list table Book list table

book_example

  • book_table_word_id
  • dictionary_word_id
  • number_of_appearances

Book table Book table

Planned features

  • Currently, all definitions are collected from the internet. This is really slow, so using a dictionary from a file would greatly improve performance. It could fall back on the web dictionary for words it can't find
  • More customization for how the cards in the deck look, and what they have on them
  • The ability to hear what a word sounds like. The database supports this, I just need combine the sounds, and upload the sounds with the cards.
  • There are a few TODOs in the code that I would like to fix

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autoanki-1.0.0.tar.gz (15.4 kB view details)

Uploaded Source

Built Distribution

autoanki-1.0.0-py3-none-any.whl (16.4 kB view details)

Uploaded Python 3

File details

Details for the file autoanki-1.0.0.tar.gz.

File metadata

  • Download URL: autoanki-1.0.0.tar.gz
  • Upload date:
  • Size: 15.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.6

File hashes

Hashes for autoanki-1.0.0.tar.gz
Algorithm Hash digest
SHA256 2c3eea704e10a666e783ad3db054828cd76e52dc346c8053d7087c1277cae320
MD5 e6cd4d80ce41dae6307cab75212df46b
BLAKE2b-256 adad7a5e62e715118e73583b8a9853a7e5c1a8bb10c9ea3686abb9a6fb2c9161

See more details on using hashes here.

File details

Details for the file autoanki-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: autoanki-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 16.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.6

File hashes

Hashes for autoanki-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 fd62223c3283add01931e78f7da3d418c20b44a41a23aa719afc607b7638ee97
MD5 b4150e6f45242e00eb0e21b57322d9ff
BLAKE2b-256 9203dc50bd5b832ddb4eac7405469ffcb3092e215949c75cb53b9181055c31ab

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page