Skip to main content

Search a Git repository's full revision history (commit messages and diffs) for text patterns.

Project description

Poirot helps you investigate your repositories. Give him a set of clues (e.g. strings or regular expressions) and he will report back any place they appear in your repository’s revision history.

When used as a pre-commit hook, Poirot can warn you if you’re about to commit something you might not intend (think passwords, private keys, tokens, and other bits of sensitive or personally identifiable information).

Poirot began as a fork of CFPB’s fellow gumshoe, Clouseau.

Dependencies

  • git

  • Python 2.7 or 3.3+

  • a UNIX-based OS (e.g. Mac or Linux) or a UNIX-y shell on Windows (e.g. Cygwin, Babun, or Git-Bash). It will not work with the default Windows Command Prompt (cmd).

Poirot uses these Python packages:

  • Jinja2 to format its console output

  • tqdm to display a progress bar

  • regex to allow for POSIX ERE regular expressions

  • requests to read remote pattern files

Installation

Poirot is available on PyPi and can be installed with pip as:

pip install poirot

You may want to install it in a virtual environment, unless you plan on using Poirot in a global commit hook.

In that case, you will have to ensure that you have done a global pip install for any Python versions you are using. E.g., if you want to run it on Python 2.7, 3.3, and 3.5 installed, install Poirot as follows:

pip2.7 install poirot
pip3.3 install poirot
pip3.5 install poirot

Running Poirot from the Command Line

To invoke Poirot and see his findings, call him from the command line with poirot and the following optional arguments:

  • –url: The repository’s URL, e.g. https://github.com/DCgov/poirot.git or git@github.com:DCgov/poirot.git. When included, you will be given the choice to clone or pull from the remote URL. Default value: none.

  • –dir: The local path to your repository’s base directory or the directory you would like to clone or pull to. Default value: the current working directory.

  • –term: A single term or regular expression to search for. Default value: none.

  • –patterns: The path to a .txt file with strings or regular expression patterns, each on its own line. These can be the file’s URL or its relative or absolute local path. You can give a comma-separated list of pattern files, if you wish to include more than one. Default value: default.txt.

  • –staged: A flag, which when included, restricts search to staged revisions. This is helpful, along with --dir, as part of a pre-commit hook.

  • –revlist: A range of revisions to inspect. Default value: The last commit (i.e. HEAD^!) if --staged is not included, otherwise none.

  • –verbose: A flag to output verbose, colorful output and pattern-match highlighting. The GIF above gives an example with –verbose included.

  • –before: Date restriction on revisions. Default value: none.

  • –after: Date restriction on revisions. Default value: none.

  • –author: Authorship restriction on revisions. Default value: none.

  • –output: File to output results as JSON. Default value: none.

Examples

The most basic command Poirot will accept is:

poirot

That will search the current git directory’s last commit (i.e. HEAD^!) for the patterns in the default pattern file.

To specify one or more different patterns files (each separated by a comma), do this instead:

poirot --patterns='../path/to/thisisapatternfile.txt,/Users/myusername/anotherpatternfile.txt'

The –patterns option also allows files accessible over HTTP, like this one here:

poirot --patterns='https://raw.githubusercontent.com/DCgov/poirot/master/poirot/patterns/default.txt'

To search for a single term (like password):

poirot --term="password"

Say you want to search for password in the whole revision history of all branches. Then do:

poirot --term="password" --revlist="all"

You can further restrict the set of revisions Poirot looks through with the before, after, and author options (which correspond to the same flags in git). E.g.:

poirot --term="password" --revlist=40dc6d1...3e4c011 --before="2015-11-28" --after="2015-10-01" --author="me@poirot.com"

Perhaps you don’t have the repository available locally or you would like to update it from a remote URL. Just add the url to your command and it will allow you to clone or pull to the current folder.

poirot --url https://github.com/foo/baz.git --term="password"

You can also specify a different directory than the current one with dir. The following command will clone/pull to the folder thisotherfolder, which sits inside of the current directory. If it does not yet exist, it will be created.

poirot --url https://github.com/foo/baz.git --term="password" --dir="thisotherfolder"

To search changes that have been staged for commit, but not yet committed, use the staged flag:

poirot --term="password" --staged

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

poirot-1.0.1.tar.gz (17.2 kB view details)

Uploaded Source

Built Distribution

poirot-1.0.1-py2.py3-none-any.whl (20.9 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file poirot-1.0.1.tar.gz.

File metadata

  • Download URL: poirot-1.0.1.tar.gz
  • Upload date:
  • Size: 17.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for poirot-1.0.1.tar.gz
Algorithm Hash digest
SHA256 3437901a1e6368b367a2412c7c6f53c962cb2e941f22866bdc41eba2700c1fc3
MD5 dca4d74edbad5553beeb74c030b9d18d
BLAKE2b-256 041b50b77d35d438c0db30aef4404d3e82894bd980d2dba88774874e95c331aa

See more details on using hashes here.

File details

Details for the file poirot-1.0.1-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for poirot-1.0.1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 fab31c089b99dabd4fd5bea7616a4475063455ae64c8b8b70d46888cebe7ae88
MD5 cd7f199889b4d186938bc4a56bee9b22
BLAKE2b-256 9c5bea5405ee6c03ef77fd06c281d20c57577e2da2615097a5f88817a6e3ca30

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page