Skip to main content

mft to parquet (pyarrow dtypes)

Project description

mft to parquet (pyarrow dtypes)

Tested against Windows 10 / Python 3.11 / Anaconda

pip install mft2parquet

Reads HDD (Hard Disk Drive) information from a specified drive and returns it as a pandas DataFrame.

Args:
drive (str, optional): The drive path to read from. Default is "c:\\".
outputfile (str, optional): If provided, the DataFrame will be saved as a Parquet file at this path.
					  Default is None.

Returns:
pd.DataFrame: A DataFrame with pyarrow dtypes containing HDD information with the specified columns.

Raises:
subprocess.CalledProcessError: If the external command fails to execute.

Note:
- This function uses an external command-line utility https://github.com/githubrobbi/Ultra-Fast-File-Search to retrieve HDD information.
- The DataFrame will have the following columns:
- aa_path
- aa_name
- aa_path_only
- aa_size
- aa_size_on_disk
- aa_created
- aa_last_written
- aa_last_accessed
- aa_descendents
- aa_read-only
- aa_archive
- aa_system
- aa_hidden
- aa_offline
- aa_not_content_indexed_file
- aa_no_scrub_file
- aa_integrity
- aa_pinned
- aa_unpinned
- aa_directory_flag
- aa_compressed
- aa_encrypted
- aa_sparse
- aa_reparse
- aa_attributes

Example:
df = read_hdd(drive="d:\\", outputfile="hdd_info.parquet")
# Reads HDD information from the 'D:' drive and saves it as 'hdd_info.parquet'.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mft2parquet-0.11.tar.gz (679.2 kB view details)

Uploaded Source

Built Distribution

mft2parquet-0.11-py3-none-any.whl (681.5 kB view details)

Uploaded Python 3

File details

Details for the file mft2parquet-0.11.tar.gz.

File metadata

  • Download URL: mft2parquet-0.11.tar.gz
  • Upload date:
  • Size: 679.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for mft2parquet-0.11.tar.gz
Algorithm Hash digest
SHA256 38eb1b4fea7f7397809d3cf1dba04afee50a6151bff7b5e9c3d3e02222147761
MD5 c235d79266314ab7661d0449f653e9c5
BLAKE2b-256 89803ad5d0ae0df238fcc0108cb5531012047741a1f0a6aec23862f6989e16f7

See more details on using hashes here.

File details

Details for the file mft2parquet-0.11-py3-none-any.whl.

File metadata

  • Download URL: mft2parquet-0.11-py3-none-any.whl
  • Upload date:
  • Size: 681.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for mft2parquet-0.11-py3-none-any.whl
Algorithm Hash digest
SHA256 b7bcad862194241311b99cc0e571a03ebebdf0493d164cbef0358096ce718ecb
MD5 c4ba995b68b1a2dc9bb87c9691f33a4e
BLAKE2b-256 5b611ce7c4389039c4ff723171dc32e0c0c996f34fa3a4a0a30dfd8717308516

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page