Wordle AI with SQL Backend

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

WORDLE AI with SQL Backend

Usage

# Install this library via PyPI
pip install wordleaisql
# Then run the executable that comes with the library
wordleai-sql

# Alternatively, clone this repository and run without pip-install
python wordleai-sql.py

Note.

By default, the program uses SQLite as the backend engine and creates a dabase file named "wordleai.db" in the working directory. For the default vocabfile of about 13K words, the file size is about 8.4GB.
For the first run, it will take a while to set up the database.
The setup time will be significantly reduced if c++ compiler command (e.g g++ or clang++) is available.

Session example

Hello! This is Wordle AI (SQLite backend).

12947 remaining candidates: ['yarco', 'knars', 'loamy', 'barps', 'dozed', 'yerks',
'reggo', 'rowth', 'spoom', 'rewin', '...']

Type:
  '[s]uggest <criterion>'     to let AI suggest a word
  '[u]pdate <word> <result>'  to provide new information
  '[e]xit'                    to finish the session

where
  <criterion>  is either 'max_n', 'mean_n', or 'mean_entropy'
  <result>     is a string of 0 (no match), 1 (partial match), and 2 (exact match)

> s
Start AI evaluation (2022-02-21 20:59:06.659721)
End AI evaluation (2022-02-21 20:59:22.748949, elapsed: 0:00:16.089228)
* Top 20 candidates ordered by mean_entropy
--------------------------------------------------------------------
  input_word         max_n        mean_n  mean_entropy  is_candidate
--------------------------------------------------------------------
       tares           856         301.3         7.464             1
       lares           830         287.7         7.509             1
       rales           830         291.0         7.544             1
       rates           856         310.2         7.562             1
       teras           856         335.8         7.582             1
       nares           822         304.5         7.592             1
       soare           767         302.9         7.598             1
       tales           864         324.4         7.604             1
       reais           766         303.7         7.609             1
       tears           856         348.4         7.627             1
       arles           830         313.6         7.629             1
       tores          1098         357.0         7.640             1
       salet           864         332.5         7.642             1
       aeros           799         308.9         7.646             1
       dares           985         351.2         7.649             1
       reals           830         327.8         7.660             1
       saner           837         316.9         7.660             1
       lears           830         330.2         7.671             1
       lores           983         338.9         7.683             1
       serai           695         314.2         7.685             1
--------------------------------------------------------------------
12947 remaining candidates: ['yarco', 'knars', 'loamy', 'barps', 'dozed', 'yerks',
'reggo', 'rowth', 'spoom', 'rewin', '...']

Type:
  '[s]uggest <criterion>'     to let AI suggest a word
  '[u]pdate <word> <result>'  to provide new information
  '[e]xit'                    to finish the session

where
  <criterion>  is either 'max_n', 'mean_n', or 'mean_entropy'
  <result>     is a string of 0 (no match), 1 (partial match), and 2 (exact match)

> u tares 10120
38 remaining candidates: ['inter', 'noter', 'ether', 'voter', 'roted', 'citer',
'luter', 'enter', 'oxter', 'cruet', '...']

Type:
  '[s]uggest <criterion>'     to let AI suggest a word
  '[u]pdate <word> <result>'  to provide new information
  '[e]xit'                    to finish the session

where
  <criterion>  is either 'max_n', 'mean_n', or 'mean_entropy'
  <result>     is a string of 0 (no match), 1 (partial match), and 2 (exact match)

> s
Start AI evaluation (2022-02-21 20:59:30.849712)
End AI evaluation (2022-02-21 20:59:32.386516, elapsed: 0:00:01.536804)
* Top 20 candidates ordered by mean_entropy
--------------------------------------------------------------------
  input_word         max_n        mean_n  mean_entropy  is_candidate
--------------------------------------------------------------------
       notum             8           4.2         1.691             0
       ontic             8           4.6         1.843             0
       mount             8           4.5         1.859             0
       motif             7           4.4         1.860             0
       moult            10           4.9         1.874             0
       muton             8           4.7         1.893             0
       potin             8           4.7         1.895             0
       lotic             8           4.7         1.914             0
       onium             8           4.7         1.914             0
       mohur             9           4.7         1.916             0
       optic             7           4.7         1.916             0
       minor             8           4.8         1.925             0
       humor             9           5.1         1.939             0
       pitot             7           5.1         1.960             0
       rutin             9           4.8         1.993             0
       milor             9           5.1         1.994             0
       piton             8           5.3         2.024             0
       oubit             7           4.7         2.027             0
       point             8           5.1         2.040             0
       opium             8           5.2         2.042             0
--------------------------------------------------------------------
38 remaining candidates: ['inter', 'noter', 'ether', 'voter', 'roted', 'citer',
'luter', 'enter', 'oxter', 'cruet', '...']

Type:
  '[s]uggest <criterion>'     to let AI suggest a word
  '[u]pdate <word> <result>'  to provide new information
  '[e]xit'                    to finish the session

where
  <criterion>  is either 'max_n', 'mean_n', or 'mean_entropy'
  <result>     is a string of 0 (no match), 1 (partial match), and 2 (exact match)

> u notum 01100
'other' should be the answer!

Thank you!

Suggestion criteria

There are three criteria for the candidate evaluation:

"max_n": Maximum number of the candidate words that would remain.
"mean_n": Average number of the candidate words that would remain.
"mean_entropy": Average of the log2 of number of the candidate words that would remain.

Note that if there are n candidate words with the equal probability, then probability of each word i is p_i = 1/n. Then, the entropy is given by -sum(p_i log2(p_i)) = - n * (1/n) log2(1/n) = log2(n). Hence, the average of log2(n) would be the average entropy.

"mean_entropy" is often used in practice and thus set as the default choice of the program. "max_n" can be seen as a pessimistic criterion since it reacts to the worst case. "mean_n" can seem an intutive criterion but does not work as well as "mean_entropy" perhaps due to the skewed distribution.

Using a custom answer set

By default, the program uses "vocab.txt" as the word candidate list, which perhaps is compatible with New York Times version
You may give a different list with --vocabfile option. You should also specify --dbfile option other than the default ("wordle-ai.db") because the database setup would be overwritten.
The file should contain words of the same length, separated by the line break ("\n").
Although not tested, the program would work with words containing multibyte characters (with utf8 encoding) or digits.
See vocab-examples/ folder for examples.

# Example
wordleai-sql --dbfile custom-wordle.db --vocabfile my-vocab.txt

Play and challenge mode

wordleai-sql --play starts a self-play game.
wordleai-sql --challenge starts a competition against an AI.

Use google bigquery backend

We can use google bigquery as the backend SQL engine with -bbq option.

We need to supply a credential json file of the GCP service account with the following permissions:

bigquery.datasets.create
bigquery.datasets.get
bigquery.jobs.create
bigquery.jobs.get
bigquery.routines.create
bigquery.routines.delete
bigquery.routines.get
bigquery.routines.update
bigquery.tables.create
bigquery.tables.delete
bigquery.tables.get
bigquery.tables.getData
bigquery.tables.list
bigquery.tables.update
bigquery.tables.updateData

# minimal command to run with bigquery engine
# --vocabname is used as the dataset name
wordleai-sql -bbq --bq_credential "gcp_credentials.json" --vocabname "wordle_dataset"

Other options

See wordleai-sql -h for other options, which should mostly be self-explanatory.

Project details

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.2.10

Mar 17, 2022

0.2.9

Mar 17, 2022

0.2.8

Mar 15, 2022

0.2.7

Mar 15, 2022

0.2.6

Mar 15, 2022

0.2.5

Mar 15, 2022

0.2.4

Mar 15, 2022

0.2.3

Mar 15, 2022

0.2.2

Mar 15, 2022

0.2.1

Mar 8, 2022

0.1.12

Mar 8, 2022

0.1.11

Mar 7, 2022

0.1.10

Mar 6, 2022

0.1.9

Mar 6, 2022

0.1.8

Mar 6, 2022

0.1.7

Mar 4, 2022

0.1.6

Mar 4, 2022

0.1.5

Mar 4, 2022

This version

0.1.4

Mar 4, 2022

0.0.4

Feb 23, 2022

0.0.3

Feb 23, 2022

0.0.2

Feb 23, 2022

0.0.1

Feb 23, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wordleaisql-0.1.4.tar.gz (55.6 kB view hashes)

Uploaded Mar 4, 2022 Source

Built Distribution

wordleaisql-0.1.4-py3-none-any.whl (56.0 kB view hashes)

Uploaded Mar 4, 2022 Python 3

Hashes for wordleaisql-0.1.4.tar.gz

Hashes for wordleaisql-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`693dc071bb9ec2a51b663ae0fc5e40602bc5c4ec91dd86a105dbee1751aa3a20`
MD5	`8645dd3202bc3aa60a171c8ea24dfc8d`
BLAKE2b-256	`3e8dba38859948ddd0bc3780e592d14fae7ad6cd830dc99874fa944d391783b0`

Hashes for wordleaisql-0.1.4-py3-none-any.whl

Hashes for wordleaisql-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7a5405b45f9f46170c80e999e904eaf6a2c80c6196282887682276acf4deb79a`
MD5	`cfeece2f9b64a0657bc06056bf73ea99`
BLAKE2b-256	`d2909fc26f1eaed172953482ff63a8b7adfae826a274000677805df7603a5e9c`