A command-line interface for creating and interacting with Distant Reader data sets (a.k.a. study carrels)
Project description
Distant Reader Toolbox
A command-line interface for creating and interacting with Distant Reader study carrels
Installation
pip install reader-toolbox
Quick start
# configure; accept the default
rdr set -s local
# add an item to your library
rdr download homer
# read homer
rdr read homer
# list all words
rdr ngrams homer
# list all bigrams
rdr ngrams homer -s 2
# list all bigrams and count them
rdr ngrams homer -s 2 -c
# search
rdr concordance homer
# search again, but specify a query
rdr concordance homer -q war
# list subject-verb-object fragments; please be patient
rdr grammars homer
# list noun phrases
rdr grammars homer -g nouns
# cluster; do the items in the carrel group themselves?
rdr cluster homer
# topic model; similar to cluster but with more detail
rdr tm homer
# page through additional carrels for downloading
rdr catalog -l remote -h
# download another carrel
rdr download pride
# download yet another carrel
rdr download sonnets
# list your carrels
rdr catalog
Description and background
The Reader Toolbox -- run from the command-line as rdr
-- is designed to create and interact with Distant Reader study carrels. Using the Toolbox you can do things such as but not limited to:
- search and browse the collection of more than 3,000 publicly available study carrels
- download study carrels from the public collection and add them to your own collection
- count & tabulate the most frequent ngrams (one-word, two-word, etc. phrases) occurring in study carrels
- apply concordancing (keyword-in-context searching) against study carrels
- apply topic modeling (extracting latent themes) against study carrels
- extract information from your study carrels matching specific grammars
- create your own study carrels
- and more
In the end, the Toolbox empowers you to read, use, and understand large volumes of text quickly and easily.
Links
- download: https://pypi.org/project/reader-toolbox
- documentation: https://reader-toolbox.readthedocs.io
- source code: https://github.com/ericleasemorgan/reader-toolbox
- bug tracker: https://github.com/ericleasemorgan/reader-toolbox/issues
Eric Lease Morgan <emorgan@nd.edu>
January 5, 2023
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file reader_toolbox-0.9.tar.gz
.
File metadata
- Download URL: reader_toolbox-0.9.tar.gz
- Upload date:
- Size: 65.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5e99c946168076f86c39d378ee010f2979e9e9364dd0be19888b4183b05edad6 |
|
MD5 | 4cbae329b560c7c7d17c463927c83571 |
|
BLAKE2b-256 | 21b34db81cb49f29feccb2337ada109969216b230faf41cfb3cbd1a1233a6b38 |
File details
Details for the file reader_toolbox-0.9-py3-none-any.whl
.
File metadata
- Download URL: reader_toolbox-0.9-py3-none-any.whl
- Upload date:
- Size: 65.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | dc67363ea0cb5094bf0deb0f908d67c0522ab8ecff24b55509e162ea24fcf39c |
|
MD5 | 9802ae4fa6f9ef22c6f8091d5886e629 |
|
BLAKE2b-256 | f153896549b7cd2378ef1e7e65606e9cee3473438291afd3b682d7ad40fc4f6a |