Skip to main content

A user-space filesystem that allows to exclude files from being shown in the filesystem using glob patterns

Project description

PassthroughSupportExcludeGlobFS: A Union Filesystem with Glob Pattern Exclusion

Pepy Total Downlods PyPI - Version CC BY-NC-ND 4.0

PassthroughSupportExcludeGlobFS is a user-space filesystem (FUSE) written in Python that provides a union mount functionality with the added power of glob pattern exclusion. It allows you to seamlessly merge the contents of two directories, while selectively excluding files or folders based on flexible glob patterns.

Key Features

  • Union Mount: Combine the contents of two directories into a single, unified view.
  • Glob Pattern Exclusion: Fine-grained control over which files and directories are included or excluded from two directories.
  • Cross-Platform: Works seamlessly on Linux, macOS (in theory but untested), and Windows.
  • Easy to Use: Simple CLI interface and Python API for integration into your projects.

Use Cases

  • Configuration Management: Overlay custom configurations on top of default settings.
  • Data Versioning: Track changes to files and directories by maintaining a separate "cache" directory.
  • Selective Syncing: Synchronize only specific files or folders between two locations.
  • Sandboxing: Isolate applications or processes by redirecting specific files or directories to a controlled environment.
  • Development Workflows: Manage different versions of code or assets by merging and excluding specific files.

Installation

Prerequisites

  • Python 3.6 or higher
  • FUSE library:
    • Linux: Install the fuse package using your distribution's package manager (e.g., apt-get install fuse on Debian/Ubuntu).
    • macOS: Install OSXFUSE.
    • Windows: Install WinFsp.

Installing PassthroughSupportExcludeGlobFS

pip install passthrough-support-excludeglob-fs

Usage

Command-Line Interface

passthrough_support_excludeglob_fs <mountpoint> -o root=<root_directory>,[options]

Options:

  • root=<root_directory>: The path to the lower directory (required). Use \ to escape , and =.
  • patterns=<pattern1:pattern2:patternN>: A colon-separated list of glob patterns to exclude from root. All files and directories matching these patterns will be stored in the cache directory (default none). Use \ to escape :.
  • cache_dir=<cache_directory>: The path to the upper directory (defaults to a cache directory within the user's cache folder). Use \ to escape , and =.
  • uid=<user_id>: The user ID to own the mounted filesystem (defaults to the current user).
  • gid=<group_id>: The group ID to own the mounted filesystem (defaults to the current group).
  • foreground=<True|False>: Run PassthroughSupportExcludeGlobFS in the foreground (default true).
  • nothreads=<True|False>: Disable multi-threading (default true because untested).
  • overwrite_rename_dest=<True|False>: When renaming, if True, overwrite the destination file if it already exists. If False, the rename operation will fail if the destination file already exists. The default behavior is False on Windows and True on Linux and macOS.
  • debug=<True|False>: Enable logging. Default is False. It must be enabled to use the log_in_file, log_in_console and log_in_syslog options. It is independent of fusedebug option. Be careful, it can generate a lot of logs.
    • log_in_syslog=<True|False>: Log to the system log. Default is False. To use this option on Windows and so that the log is visible in the Windows Event Viewer, you must run the program as an administrator. However, it is not recommended to use this option on Windows because it can saturate the WIndows system log.
    • log_in_file=<log_file_path|None>: Log to a file instead of the console. Default is None which means no log file.
    • log_in_console=<True|False>: Log to the console. Default is True.
  • fusedebug=<True|False>: Enable native FUSE debugging. Default is False. It is independent of debug, log_in_file, log_in_console and log_in_syslog options and always prints to the console. Be careful, it can also generate a lot of logs.

Example:

passthrough_support_excludeglob_fs /mnt/union -o root=/path/to/lower,patterns='**/*.log:**/*.tmp'

This command will mount a union filesystem at /mnt/union, merging the contents of /path/to/lower with a cache directory. All files matching the patterns **/*.log and **/*.tmp will be excluded from the lower directory and stored in the user local directory.

Python API

from passthrough_support_excludeglob_fs import start_passthrough_fs

# Start the filesystem
start_passthrough_fs(mountpoint='/mnt/union', root='/path/to/lower', patterns=['**/*.log', '**/*.tmp'], cache_dir='/path/to/cache' )

Glob Pattern Syntax

PassthroughSupportExcludeGlobFS uses the globmatch library for glob pattern matching. The following wildcards are supported:

  • *: Matches any number of characters (including zero).
  • ?: Matches any single character.
  • [abc]: Matches any character within the brackets.
  • [a-z]: Matches any character within the range.
  • {a,b,c}: Matches any of the patterns within the braces.
  • **: Matches any number of directories recursively.

Contributing

Contributions are welcome! Please see the CONTRIBUTING.md file for guidelines.

License

PassthroughSupportExcludeGlobFS is licensed under the CC-BY-NC-ND License. See the LICENCE file for details.

FAQ

Q: What sets passthrough_support_excludeglob_fs apart from other union filesystems like UnionFS, OverlayFS, and mergerfs?

A: passthrough_support_excludeglob_fs offers a unique combination of union mount capabilities with glob pattern exclusion. This allows you to merge directories while precisely controlling which files and folders are included or excluded. It's particularly useful for selective syncing and providing flexibility that other union filesystems might not offer allowing you to do selective mouting.

Q: How do I unmount the filesystem?

A: On Linux and macOS, you can use the fusermount -u <mountpoint> command. On Windows, you can use the "Unmount" option in the WinFsp context menu for the mountpoint.

Q: Can I use multiple glob patterns for exclusion?

A: Yes, you can specify multiple glob patterns separated by colons (:) in the patterns option.

Q: What happens if a file exists in both the root and cache directories?

A: The most recent file take precedence.

Q: What happen if an excluded file already exist on root directory?

A: The file will be moved automatically to the cache directory at the first access.

Q: Can I exclude entire directories?

A: Yes, you can use glob patterns that match directory names, such as **/logs to exclude the entire logs directory and its contents.

Q: Is PassthroughSupportExcludeGlobFS compatible with symbolic links?

A: Yes, PassthroughSupportExcludeGlobFS supports symbolic links. However some behavior may be unexpected with relative target links. Note mklink is untested on Windows.

Q: Can I use PassthroughSupportExcludeGlobFS in a production environment?

A: PassthroughSupportExcludeGlobFS is intended for testing and development purposes. While it is stable, it may not be suitable for production use.

Q: Why the CLI options are weird?

A: The CLI options are designed to be consistent with the mount options. This allows you to use PassthroughSupportExcludeGlobFS with existing FUSE tools and libraries.

Q: Why it is slow to access files?

A: The first access to a misplaced file will trigger a move operation to the right directory. This operation can be slow for large files or directories. However it should be fast for subsequent accesses.

Q: What is the default cache directory?

A: The default cache directory is a subdirectory within the user's cache folder. On Linux and macOS, this is typically ~/.cache/passthrough-support-excludeglob-fs. On Windows, it is %LOCALAPPDATA%\passthrough-support-excludeglob-fs. The name of the subdirectory is then the base64 encoded root directory path.

For example, if the root directory is /home/user/doc, the cache directory will be ~/.cache/passthrough-support-excludeglob-fs/L2hvbWUvdXNlci9kb2M=.

Q: What are common bugs?

A: Common know bugs:

  • Metadata time (ctime,atime,mtime) are sometime updated even if the file is not accessed or modified. It can happen during the first access of a misplaced file.
  • The filesystem is not thread safe. It is recommended to keep the nothreads option to True.
  • Symbolic links are not tested on Windows and can be buggy.
  • Exclude glob patterns must never be relative to root directory. It is recommended to always prefix with **/.
  • The rename operation can be slow because it is internally implemented with a copy-and-delete operation. This operation can be slow for large files or directories. It is implemented this way to mitigate the non-deterministic order of operations. For example, the kernel or FUSE may reorder the operations and block the rename operation.

Q: How can I contribute to PassthroughSupportExcludeGlobFS?

A: We welcome contributions! Please see our CONTRIBUTING.md file for guidelines on reporting issues, submitting pull requests, and contributing to the project.

Q: Where can I get help or ask questions?

A: You can open an issue on our GitHub repository.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Built Distribution

File details

Details for the file passthrough_support_excludeglob_fs-1.16.tar.gz.

File metadata

File hashes

Hashes for passthrough_support_excludeglob_fs-1.16.tar.gz
Algorithm Hash digest
SHA256 d86082f73c18008d5d6c64d07915ed223b42acc1152b0e1945c73973b9c87289
MD5 1c8ba078433bde99b0096086fabb319b
BLAKE2b-256 0c71c82fda585ca875990535929197542792bbce98f08186980c4c000e14b50a

See more details on using hashes here.

File details

Details for the file passthrough_support_excludeglob_fs-1.16-py3-none-any.whl.

File metadata

File hashes

Hashes for passthrough_support_excludeglob_fs-1.16-py3-none-any.whl
Algorithm Hash digest
SHA256 1a9819bf91706ee6fad466a1a7910e12d31783346b4c951cb8273ca09dac4eb5
MD5 39d80ca281b02982375495366bec3de7
BLAKE2b-256 c79f7736eefe2dcc1be4681120bd511b3a5e2cedb7f0a72fde39d8d1a33eac4e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page