Skip to main content

Uses os.walk to find and catalogue a given file.

Project description

As stated in the short description this project simply uses os.walk to traverse a filesystem, starting in the users home directory, looking for a given file. It was created for internal use as there are several of us in the office running python scripts which are located on a onedrive and each time a new machine was initialised onedrive would have wildly different paths. Originally we were just using a simple function at the start of each script which would return the onedrive path but as different parts of the path started changing with new devices I realised this would quickly become an incredibly intricate and inneficient solution so I wrote this to help solve it. Published here for easier distribution around the office.

use:

import pathconf

pathconf.find('filename.filetype')

Alternatively you can give it a partial directory.

pathconf.find('folder/filename.filetype')

or

pathconf.find('foldera/folderb/filename.filetype')

If no filetype is specified and the folder param is not passed, eg:

pathconf.find('filename')	

Then the search will be done without taking file extension into account. If a file name has a . in it, like filename.filename.txt, it should still work. The workaround was to take the part after the . as an extension only if it is 5 characters or less. This however causes problems with files such as file.name.txt as if you do not specify the extension .name will be taken as the extension instead. This issue exists only in the defining of the search term as when searching for files I just looked for a second . and split there.

This functionality is available but not recommended, you're better off just giving the extension.

Passing the optional folder= parameter will search for a folder instead of a file.

pathconf.find('folderName', folder=True)

Passing the optional starting_dir= parameter will change the starting directory from the users home directory.

pathconf.find('filename.filetype', starting_dir='C:\\')
# for windows

#or

pathconf.find('filename.filetype', starting_dir='/')
# for unix

Will start the search from the specified directory, in the example above this is the root directory.

This will create a .file_paths json file in ~/.config/pathconf/ if it doesn't already exist and then will search for your file. Once found it will add that file to the pathconf file for quicker access in future runs. If the file does already exist it will lookup your desired file and if it exists in the pathconf.json it will then check that the file exists where stated. If the file exists where the pathconf.json says it does then it will use that file, if the file doesn't exist there or the file doesn't appear in the pathconf.json then it will search for the file and append it to the pathconf.json.

Other functions: remove(), reset(), get_paths(), list_paths(), and index().

pathconf.list_paths()

Will list all key and value pairs in the pathconf.json.

pathconf.get_paths()	

Will return a dictionary of paths held in the pathconf.json.

pathconf.reset()

Will reset the pathconf.json to a blank file.

pathconf.remove('filename.filetype')

Will remove the entry for filename.filetype from the pathconf.json.

pathconf.index('PathToIndex')

Will return a dictionary of all files and their full paths in that directory as well as writing that to its own json with the lowest level folder as the name. For example:

pathconf.index('/home/joe/Documents')

Will return a dictionary of files within /home/joe/Documents/ and their full paths. This dictionary is also written to Documents.json in ~/.config/pathconf/

Optional arguments for index() are: depth (int): index() Will only search folders up to this depth, so for depth=3 it will index 3 down from the starting directory. exceptions (list): You can pass a list of folders to be ignored when indexing. Note: if passed with folders=True then the folders in exceptions will not be returned or added to the .json folders (bool): If True then index will return a dictionary of folders int the given directory.

Installation:

Easiest way to install is using pip

pip install pathconf

Current issues:

Haven't yet run into but understand that this will always take the first match it finds, if there are files with the same name and type elsewhere in your home directory then it will use them if found first, even if they're not the desired file. Current attempts to work around this are a default argument of deprecated=False, which will skip the search of any folder containing deprecated in the name (since most of our duplicates exist in 'deprecated' folders, this can be overridden by passing deprecated=True), the other attempt for a workaround was the the ability to give partial directories as Documents/file is a little more precise than file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pathconf-1.0.10.5.tar.gz (5.9 kB view details)

Uploaded Source

Built Distribution

pathconf-1.0.10.5-py3-none-any.whl (5.9 kB view details)

Uploaded Python 3

File details

Details for the file pathconf-1.0.10.5.tar.gz.

File metadata

  • Download URL: pathconf-1.0.10.5.tar.gz
  • Upload date:
  • Size: 5.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.8

File hashes

Hashes for pathconf-1.0.10.5.tar.gz
Algorithm Hash digest
SHA256 1b9b7766b109a55125695edab22aff78fa6ee1ae9a1e2fc2709cda31605cc7c3
MD5 9979d9cd77bd47f728ff0928a9fe3476
BLAKE2b-256 f79aa5e4156efb7cb8218f9ddd56ef825a65832fb089abd107547ff0210503fd

See more details on using hashes here.

File details

Details for the file pathconf-1.0.10.5-py3-none-any.whl.

File metadata

  • Download URL: pathconf-1.0.10.5-py3-none-any.whl
  • Upload date:
  • Size: 5.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.8

File hashes

Hashes for pathconf-1.0.10.5-py3-none-any.whl
Algorithm Hash digest
SHA256 66eb1da91d2ae1559b712fe424390954b33dbe602ed3431c5aa4f910f9a96b6c
MD5 8ccd44223a5a640347b03946c5be4112
BLAKE2b-256 47d43e4b74de78aaeee21976c4a09b0b272a71e2880d081e140070abc91b6a11

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page