Skip to main content

Search string from files recursively

Project description

findstring

This program, findstring, allows you to search for a specific string within files in a directory, including support for searching within PDF and DOCX files. It works similarly to the grep -rI command but adds the ability to read and search through PDF and DOCX files.

Usage

findstring [OPTIONS] search_string

Options

  • search_string: The string to search for in the files.

Optional Arguments

  • -b, --binary:
    Scan binary files as well. If this option is used, the tool will attempt to read binary files and search for the specified string.

  • -d, --directory:
    Root directory to start searching from. If not specified, the current directory (.) is used by default.

  • -l, --max_length:
    Maximum number of characters to be shown as a result. The default is 0, which means no limit is set.

  • -t, --text:
    Show the matched lines containing the search string in the output.

  • -v, --verbose:
    Enable verbose output. The program will provide more detailed information about its operation, including which directories and files are being checked.

Features

  • PDF Support:
    The program can search within PDF files using the pdfminer library. It extracts text from the PDF and searches for the specified string.

  • DOCX Support:
    DOCX files are also supported, with text extraction handled by the docx library.

  • Binary File Scanning:
    When the --binary flag is enabled, the program will attempt to read binary files and search for the specified string. This is done by decoding the binary content into ASCII.

  • Context Display:
    When the --text option is enabled, the tool will display the lines containing the search string, with maximum numbers of characters specified by the --max_length option.

Examples

Search for a string in the current directory

findstring "example_string"

Search for a string in a specific directory

findstring -d /path/to/directory "example_string"

Search in binary files and display matched files

findstring -b -t "example_string"

Search with verbose output

findstring -v "example_string"

Limit the length of the output context to 50 characters

findstring -l 50 "example_string"

Highlighting Search Results

The search script is configured to highlight matching search results in bold red text by default. This highlighting is controlled by the GREP_COLORS environment variable, specifically using the mt= option.

If you wish to change the color or style of the highlighted text, you can modify the mt= setting in the GREP_COLORS environment variable. The mt= value is a color code that specifies the style and color used for matching text.

For example:

  • Bold Red (default): mt=1;31
  • Bold Green: mt=1;32
  • Underline Blue: mt=4;34

To apply a custom color, you can set the GREP_COLORS environment variable in your shell as follows:

export GREP_COLORS='mt=1;32'

This example would change the highlighted text to bold green.

The script automatically detects if the output is directed to a terminal (TTY). If not, it will print the plain text without any colorization.

Error Handling

The program will attempt to handle errors such as unreadable files gracefully. If an error occurs while reading a file, the program will skip the file and continue processing the rest, optionally displaying an error message if verbose mode is enabled.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

findstring-0.3.tar.gz (6.0 kB view details)

Uploaded Source

Built Distribution

findstring-0.3-py3-none-any.whl (6.3 kB view details)

Uploaded Python 3

File details

Details for the file findstring-0.3.tar.gz.

File metadata

  • Download URL: findstring-0.3.tar.gz
  • Upload date:
  • Size: 6.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.9

File hashes

Hashes for findstring-0.3.tar.gz
Algorithm Hash digest
SHA256 85c06309c37cfc1ebdaea93361a12d49f6dcd711a1ba149c7dc04b6fdd2f4721
MD5 0ed19a4c7ce4f3905dd02d2a69c7fe42
BLAKE2b-256 e3ea92754f9aa844ebc31962cf73c287ba35950f7a6d407c8123ee556dea70be

See more details on using hashes here.

File details

Details for the file findstring-0.3-py3-none-any.whl.

File metadata

  • Download URL: findstring-0.3-py3-none-any.whl
  • Upload date:
  • Size: 6.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.9

File hashes

Hashes for findstring-0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 6b7a19a58af2076417fa5786deb6564125cad8e8942d8946a793a30580ce9fe8
MD5 08c9b035f2a4999cc4d6f7376cd44fc3
BLAKE2b-256 43b4a8cdd645eaddea964822ce1f76cac9e28e10d1d06117ac1c9420e148a7c3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page