dirstree

Another library for iterating through the contents of a directory

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

pomponchik

These details have not been verified by PyPI

Project description

ⓘ

logo

There are many libraries for traversing directories. You can also do this using the standard library. What makes this library different:

💎 Beautiful, laconic syntax.
⚗️ Filtering by file extensions, text patterns in .gitignore format, and using custom callables.
🐍 Natively works with both Path objects from the standard library and strings.
❌ Support for cancellation tokens.
👯‍♂️ Combining multiple crawling methods in one object.

Installation
Basic usage
Applying a function to each path
Filtering
Working with Cancellation Tokens
Combination
Transactionality

Installation

You can install dirstree with pip:

pip install dirstree

You can also use instld to quickly try out this package and others without installing them.

Basic usage

The library is easy to use:

Create a crawler object, passing the path to the base directory and, if necessary, additional arguments.
Iterate through it.

The simplest example would look like this:

from dirstree import Crawler

crawler = Crawler('.')

for file in crawler:
    print(file)

↑ This recursively prints all files in the current directory, including files in nested directories. At each iteration, we get a new Path object.

Applying a function to each path

If you just want to run a function for each file the crawler finds, you don't have to write the loop yourself — every crawler has an apply() method:

Crawler('src', exclude=['tests/**']).apply(print)

↑ This will print the entire contents of the directory, except for the excluded locations.

ⓘ All of the crawler's settings are respected, exactly as they would be during normal iteration.

Filtering

By default, crawlers iterate over files only. If you need every filesystem entity found under the base directory, pass only_files=False:

crawler = Crawler('.', only_files=False)

Iterating through the files in the directory, you may not want to view all files, but only files of a certain type. To do this, ignore all other files. How to do it? There are three ways:

Bypass only files with the specified extensions, such as .txt, .doc, or .py.
Bypass files whose paths follow a specific text pattern.
Use an arbitrary function to determine whether you need each specific path or not.

To select a specific method, you need to pass a specific parameter when creating the crawler object. Of course, all the methods can be combined with each other.

To set the file extensions you are interested in, use the extensions parameter:

crawler = Crawler('.', extensions=['.txt'])  # Iterate only on .txt files.

ⓘ The extensions parameter is available only in the default file-only mode, so it cannot be combined with only_files=False.

Also, if you only need Python files, you can use a special class to bypass them only, without specifying extensions:

from dirstree import PythonCrawler

crawler = PythonCrawler('.')  # Iterate only on .py files.

ⓘ PythonCrawler is always file-only.

To specify which files and directories you do NOT want to iterate over, use the exclude parameter:

crawler = Crawler('.', exclude=['.git', 'venv'])  # Exclude ".git" and "venv" directories.

↑ Please note that we use the .gitignore format here.

If you need a universal way to filter out unnecessary paths, pass your function as the filter parameter:

crawler = Crawler('.', filter=lambda path: len(str(path)) == 7)  # Iterate only on paths that are 7 characters long.

Working with Cancellation Tokens

You can set an arbitrary condition under which file traversal will stop using cancellation tokens from the cantok library.

There are two ways to do this ↓

If you use the crawler as a one-time object for a single iteration, set the token when creating it:

for path in Crawler('.', token=TimeoutToken(0.0001)): # Limit the iteration time to 0.0001 seconds.
    print(path)

If you plan to use the crawler object several times, use the go() method for iteration and pass a new token to it every time:

crawler = Crawler('.')

for path in crawler.go(token=TimeoutToken(0.0001)): # Limit the iteration time to 0.0001 seconds.
    print(path)

↑ Follow these rules to avoid accidentally "baking" an expired token inside a crawler object.

Combination

You can combine multiple crawler objects into one using the usual addition operator, like this:

for path in Crawler('../dirstree') + Crawler('../cantok'):
    print(path)

↑ The paths that you will iterate over will be automatically deduplicated.

↑ You can also impose arbitrary restrictions on each of the summed objects, all of them will be taken into account.

You can also pass multiple paths to a single crawler object:

for path in Crawler('../dirstree', '../cantok'):
    print(path)

↑ In this case, there is no deduplication of paths.

Transactionality

If you plan to modify the directory while iterating over it — for example, deleting or moving files inside an apply() callback — pass freeze=True to take a snapshot of every matching path up front, then iterate that snapshot instead of the live filesystem:

Crawler('path/to/directory', freeze=True).apply(lambda p: p.unlink())

↑ The snapshot is built on the first step of iteration, with every filter and cancellation token already applied. After that, any creation, renaming or deletion happening in the directory does not affect what is yielded — each call to go() or iter() produces its own fresh snapshot.

↑ Without freeze=True the order of yielded paths depends on the live state of the filesystem, so mid-iteration mutation may silently skip or duplicate entries.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

pomponchik

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.0.12

May 31, 2026

This version

0.0.11

May 31, 2026

0.0.10

May 29, 2026

0.0.9

May 29, 2026

0.0.8

May 29, 2026

0.0.7

May 28, 2026

0.0.6

May 21, 2026

0.0.5

Mar 17, 2026

0.0.4

Feb 13, 2026

0.0.3

Nov 13, 2025

0.0.2

Oct 7, 2025

0.0.1

Sep 28, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dirstree-0.0.11.tar.gz (29.1 kB view details)

Uploaded May 31, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

dirstree-0.0.11-py3-none-any.whl (9.6 kB view details)

Uploaded May 31, 2026 Python 3

File details

Details for the file dirstree-0.0.11.tar.gz.

File metadata

Download URL: dirstree-0.0.11.tar.gz
Upload date: May 31, 2026
Size: 29.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for dirstree-0.0.11.tar.gz
Algorithm	Hash digest
SHA256	`49dfe6adc9b2e080593ad7df6e2b32ed11b79a489c618922525838698cb636ae`
MD5	`66fe7292cdbfd5f95c04aa630b3dc636`
BLAKE2b-256	`8ec2df7ff1a869ca2686e2a42bf52f1f2ffcd539a1da00a3f3d80090b6879cbc`

See more details on using hashes here.

Provenance

The following attestation bundles were made for dirstree-0.0.11.tar.gz:

Publisher: release.yml on mutating/dirstree

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: dirstree-0.0.11.tar.gz
- Subject digest: 49dfe6adc9b2e080593ad7df6e2b32ed11b79a489c618922525838698cb636ae
- Sigstore transparency entry: 1676995653
- Sigstore integration time: May 31, 2026
Source repository:
- Permalink: mutating/dirstree@60f0819c15a3bde5844ac5dc3be738b4b5d21861
- Branch / Tag: refs/heads/main
- Owner: https://github.com/mutating
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@60f0819c15a3bde5844ac5dc3be738b4b5d21861
- Trigger Event: push

File details

Details for the file dirstree-0.0.11-py3-none-any.whl.

File metadata

Download URL: dirstree-0.0.11-py3-none-any.whl
Upload date: May 31, 2026
Size: 9.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for dirstree-0.0.11-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3a6383bb1fc1c6689e639529d44c55a8c146fa62b097f92880985763c3a0a268`
MD5	`807a9c4949a4a4bd942a5a5267de26ab`
BLAKE2b-256	`96a7c3023b3c298e5d6a5e7760a2beb7eaf5acd6f03dfeb9dfc7d91c3c87535a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for dirstree-0.0.11-py3-none-any.whl:

Publisher: release.yml on mutating/dirstree

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: dirstree-0.0.11-py3-none-any.whl
- Subject digest: 3a6383bb1fc1c6689e639529d44c55a8c146fa62b097f92880985763c3a0a268
- Sigstore transparency entry: 1676995659
- Sigstore integration time: May 31, 2026
Source repository:
- Permalink: mutating/dirstree@60f0819c15a3bde5844ac5dc3be738b4b5d21861
- Branch / Tag: refs/heads/main
- Owner: https://github.com/mutating
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@60f0819c15a3bde5844ac5dc3be738b4b5d21861
- Trigger Event: push

dirstree 0.0.11

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Table of contents

Installation

Basic usage

Applying a function to each path

Filtering

Working with Cancellation Tokens

Combination

Transactionality

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance