Skip to main content

Version of the glob module that can capture patterns and supports recursive wildcards

Project description

This is an extended version of Python’s builtin glob module (http://docs.python.org/library/glob.html) which adds:

  • The ability to capture the text matched by glob patterns, and return those matches alongside the filenames.

  • A recursive ‘**’ globbing syntax, akin for example to the globstar option of the bash shell.

  • The ability to replace the filesystem functions used, in order to glob on virtual filesystems.

  • Compatible with Python 2 and Python 3 (tested with 3.3).

Examples

Matches being returned:

import glob2

for filename, (version,) in glob2.iglob('./binaries/project-*.zip', with_matches=True):
    print version

Recursive glob:

>>> import glob2
>>> all_header_files = glob2.glob('src/**/*.h')
['src/fs.h', 'src/media/mp3.h', 'src/media/mp3/frame.h', ...]

Note that ** must appear on it’s own as a directory element to have its special meaning. **h will not have the desired effect.

** will match “.”, so **/*.py returns Python files in the current directory. If this is not wanted, */**/*.py should be used instead.

Custom Globber:

from glob2 import Globber

class VirtualStorageGlobber(Globber):
    def __init__(self, storage):
        self.storage = storage
    def listdir(self, path):
        # Must raise os.error if path is not a directory
        return self.storage.listdir(path)
    def exists(self, path):
        return self.storage.exists(path)
    def isdir(self, path):
        # Used only for trailing slash syntax (``foo/``).
        return self.storage.isdir(path)
    def islink(self, path):
        # Used only for recursive glob (``**``).
        return self.storage.islink(path)

globber = VirtualStorageGlobber(sftp_storage)
globber.glob('/var/www/**/*.js')

If isdir and/or islink cannot be implemented for a storage, you can make them return a fixed value, with the following consequences:

  • If isdir returns True, a glob expression ending with a slash will return all items, even non-directories, if it returns False, the same glob expression will return nothing.

  • Return islink True, the recursive globbing syntax ** will follow all links. If you return False, it will not work at all.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

glob2-0.4.tar.gz (9.5 kB view details)

Uploaded Source

File details

Details for the file glob2-0.4.tar.gz.

File metadata

  • Download URL: glob2-0.4.tar.gz
  • Upload date:
  • Size: 9.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for glob2-0.4.tar.gz
Algorithm Hash digest
SHA256 cf9a89f299a171e9f95e99b1dbe2ed963780a8496e3c0abed0b137a64359dc41
MD5 e9aa9ad7ec070af910d5df5df9b48201
BLAKE2b-256 765ed219714134046888fa66dfe5f2f9aa2b61ee6acdcbdfc309a16f518f4879

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page