Skip to main content
Help the Python Software Foundation raise $60,000 USD by December 31st!  Building the PSF Q4 Fundraiser

A reading text analysis tool

Project description

Text analysis tool written in Python 3.

Features

Drat is a tool that analyzes reading texts and produces a brief report which gives a readability score (according to the Dale-Chall readability formula) and the number of uncommon words (based on the General Service List) in the text. It also lists all of these uncommon words (if you choose the verbose option).

Drat supports docx, odt, ods, odp files as well as text files and urls.

You can also analyze multiple texts / links by creating a text file with the filenames / links written in it.

Use

drat [-h] [-v] [-w wordlist-file] file-or-url

  • multiple files, or urls, can be analyzed
  • the files can be text files, docx files, or odf files
  • a list of urls, or files, written in a text file can be redirected to drat: e.g., drat < list-of-urls.txt
  • -w use an additional wordlist(s)
  • This wordlist needs to be a text file with each word on a separate line
  • More than one wordlist can be used

For example, the following command will analyze an office (Writer) document called ‘NorwegianBlue.odt’:

drat NorwegianBlue.odt

The -w option will let you filter out additional words:

drat -w euphemisms_death.txt NorwegianBlue.odt

And this command will download the following url and analyze the text in it:

drat url_you_want_analyzed.html

It is also possible to analyze several files in one command:

drat text_file.txt word_document.docx open_office_file.odt

The following command uses a text file with links in it (one link per line) and parses the links in it:

drat < list-of-urls.txt

Adding the -v option to any of the above commands will give you a list of the uncommon words.

Todo

  • Improve html parsing.
  • More support for analyzing the complexity of texts.
  • Support other languages.

Dependencies

Python3 and requests.

Name

Drat is a recursive acronym which either stands for Drat is a Reading Analysis Tool or Drat is a Rabid Assessor of Texts.

Author

This program has been developed by David Whitlock.

License

Drat is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for drat, version 0.1.7
Filename, size File type Python version Upload date Hashes
Filename, size drat-0.1.7.tar.gz (47.2 kB) File type Source Python version None Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page