mft to parquet (pyarrow dtypes)
Project description
mft to parquet (pyarrow dtypes)
Tested against Windows 10 / Python 3.11 / Anaconda
pip install mft2parquet
Reads HDD (Hard Disk Drive) information from a specified drive and returns it as a pandas DataFrame.
Args:
drive (str, optional): The drive path to read from. Default is "c:\\".
outputfile (str, optional): If provided, the DataFrame will be saved as a Parquet file at this path.
Default is None.
Returns:
pd.DataFrame: A DataFrame with pyarrow dtypes containing HDD information with the specified columns.
Raises:
subprocess.CalledProcessError: If the external command fails to execute.
Note:
- This function uses an external command-line utility https://github.com/githubrobbi/Ultra-Fast-File-Search to retrieve HDD information.
- The DataFrame will have the following columns:
- aa_path
- aa_name
- aa_path_only
- aa_size
- aa_size_on_disk
- aa_created
- aa_last_written
- aa_last_accessed
- aa_descendents
- aa_read-only
- aa_archive
- aa_system
- aa_hidden
- aa_offline
- aa_not_content_indexed_file
- aa_no_scrub_file
- aa_integrity
- aa_pinned
- aa_unpinned
- aa_directory_flag
- aa_compressed
- aa_encrypted
- aa_sparse
- aa_reparse
- aa_attributes
Example:
df = read_hdd(drive="d:\\", outputfile="hdd_info.parquet")
# Reads HDD information from the 'D:' drive and saves it as 'hdd_info.parquet'.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
mft2parquet-0.11.tar.gz
(679.2 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
mft2parquet-0.11-py3-none-any.whl
(681.5 kB
view details)
File details
Details for the file mft2parquet-0.11.tar.gz.
File metadata
- Download URL: mft2parquet-0.11.tar.gz
- Upload date:
- Size: 679.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
38eb1b4fea7f7397809d3cf1dba04afee50a6151bff7b5e9c3d3e02222147761
|
|
| MD5 |
c235d79266314ab7661d0449f653e9c5
|
|
| BLAKE2b-256 |
89803ad5d0ae0df238fcc0108cb5531012047741a1f0a6aec23862f6989e16f7
|
File details
Details for the file mft2parquet-0.11-py3-none-any.whl.
File metadata
- Download URL: mft2parquet-0.11-py3-none-any.whl
- Upload date:
- Size: 681.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b7bcad862194241311b99cc0e571a03ebebdf0493d164cbef0358096ce718ecb
|
|
| MD5 |
c4ba995b68b1a2dc9bb87c9691f33a4e
|
|
| BLAKE2b-256 |
5b611ce7c4389039c4ff723171dc32e0c0c996f34fa3a4a0a30dfd8717308516
|