An extension to tarfile to allow adding compressed-on-the-fly files to a tarfile, allowing files too large to fit into memory or data that is generated on the fly. Note that the output file object must support "seek()", hence must be a regular uncompressed tar file.
Project description
Version: 1.0
Summary
This module provides an extension to the standard tarfile.TarFile class which provides the ability to add files to a TarFile that are compressed on-the-fly. It will only work on an uncompressed output tarfile, since after the data is written it will overwrite the header for the file with the correct size data.
Limitations
The object to which the tarfile is being written must support “seek()”, so this cannot work over a socket, nor presumably with a compressed tarfile. Note: re-compressing contents is not very useful.
The “close_gz_file” method must be called when the data is finished, and calling “close” on the GzipStream object is not sufficient.
Example Usage
#!/usr/bin/env python3
import os, sys, shutil
import targzstream
# USAGE: ./foo.py TARFILE INPUT [ INPUT2 ... ]
# Eg: ./foo.py myoutput.tar *.cpp *.h
with targzstream.TarFile(sys.argv[1], mode='w') as tarball:
for fname in sys.argv[2:]:
st = os.stat(fname)
obj = tarball.add_gz_file(name=fname + '.gz', mtime=st.st_mtime,
uid=st.st_uid, gid=st.st_gid, mode=st.st_mode)
# Copy the data.
with open(fname, 'rb') as fin:
shutil.copyfileobj(fin, obj)
# REMEMBER: close_gz_file() is required
tarball.close_gz_file()
# The end.
TODO
Wrap add_gz_file and close_gz_file as a context manager, allowing simply:
with tarball.gzstream(name=fname + '.gz', mtime=mtime, ...) as obj: with open(fname, 'rb') as fin: shutil.copyfileobj(fin, obj)
Have add_gz_file handle the result of an os.stat. Eg:
with tarball.gz_file(name=fname + '.gz', stat=os.stat(fname)) as obj: with open(fname, 'rb') as fin: shutil.copyfileobj(fin, obj)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.