libarchive-c

Python interface to libarchive

These details have not been verified by PyPI

Project links

Homepage

Project description

A Python interface to libarchive. It uses the standard ctypes module to dynamically load and access the C library.

Installation

pip install libarchive-c

Compatibility

python

python-libarchive-c is currently tested with python 3.12 and 3.13.

If you find an incompatibility with older versions you can send us a small patch, but we won’t accept big changes.

libarchive

python-libarchive-c may not work properly with obsolete versions of libarchive such as the ones included in MacOS. In that case you can install a recent version of libarchive (e.g. with brew install libarchive on MacOS) and use the LIBARCHIVE environment variable to point python-libarchive-c to it:

export LIBARCHIVE=/usr/local/Cellar/libarchive/3.3.3/lib/libarchive.13.dylib

Usage

Import:

import libarchive

Extracting archives

To extract an archive, use the extract_file function:

os.chdir('/path/to/target/directory')
libarchive.extract_file('test.zip')

Alternatively, the extract_memory function can be used to extract from a buffer, and extract_fd from a file descriptor.

The extract_* functions all have an integer flags argument which is passed directly to the C function archive_write_disk_set_options(). You can import the EXTRACT_* constants from the libarchive.extract module and see the official description of each flag in the archive_write_disk(3) man page.

By default, when the flags argument is None, the SECURE_NODOTDOT, SECURE_NOABSOLUTEPATHS and SECURE_SYMLINKS flags are passed to libarchive, unless the current directory is the root (/).

Reading archives

To read an archive, use the file_reader function:

with libarchive.file_reader('test.7z') as archive:
    for entry in archive:
        for block in entry.get_blocks():
            ...

Alternatively, the memory_reader function can be used to read from a buffer, fd_reader from a file descriptor, stream_reader from a stream object (which must support the standard readinto method), and custom_reader from anywhere using callbacks.

To learn about the attributes of the entry object, see the libarchive/entry.py source code or run help(libarchive.entry.ArchiveEntry) in a Python shell.

Displaying progress

If your program processes large archives, you can keep track of its progress with the bytes_read attribute. Here’s an example of a progress bar using tqdm:

with tqdm(total=os.stat(archive_path).st_size, unit='bytes') as pbar, \
     libarchive.file_reader(archive_path) as archive:
    for entry in archive:
        ...
        pbar.update(archive.bytes_read - pbar.n)

Creating archives

To create an archive, use the file_writer function:

from libarchive.entry import FileType

with libarchive.file_writer('test.tar.gz', 'ustar', 'gzip') as archive:
    # Add the `libarchive/` directory and everything in it (recursively),
    # then the `README.rst` file.
    archive.add_files('libarchive/', 'README.rst')
    # Add a regular file defined from scratch.
    data = b'foobar'
    archive.add_file_from_memory('../escape-test', len(data), data)
    # Add a directory defined from scratch.
    early_epoch = (42, 42)  # 1970-01-01 00:00:42.000000042
    archive.add_file_from_memory(
        'metadata-test', 0, b'',
        filetype=FileType.DIRECTORY, permission=0o755, uid=4242, gid=4242,
        atime=early_epoch, mtime=early_epoch, ctime=early_epoch, birthtime=early_epoch,
    )

Alternatively, the memory_writer function can be used to write to a memory buffer, fd_writer to a file descriptor, and custom_writer to a callback function.

For each of those functions, the mandatory second argument is the archive format, and the optional third argument is the compression format (called “filter” in libarchive). The acceptable values are listed in libarchive.ffi.WRITE_FORMATS and libarchive.ffi.WRITE_FILTERS.

Symbolic links

By default, libarchive preserves symbolic links. If you want it to resolve the links and archive the files they point to instead, pass symlink_mode='logical' when calling the add_files method. If you do that, an ArchiveError exception will be raised when a symbolic link points to a nonexistent file.

File metadata codecs

By default, UTF-8 is used to read and write file attributes from and to archives. A different codec can be specified through the header_codec arguments of the *_reader and *_writer functions. Example:

with libarchive.file_writer('test.tar', 'ustar', header_codec='cp037') as archive:
    ...
with file_reader('test.tar', header_codec='cp037') as archive:
    ...

In addition to file paths (pathname and linkpath), the specified codec is used to encode and decode user and group names (uname and gname).

License

CC0 Public Domain Dedication

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

5.3

May 22, 2025

5.2

Mar 14, 2025

5.1

Mar 5, 2024

5.0

Jul 4, 2023

4.0

Jan 22, 2022

3.2

Nov 26, 2021

3.1

Jun 1, 2021

3.0

May 25, 2021

2.9

Oct 20, 2019

2.8

Jun 10, 2018

2.7

Dec 17, 2016

2.6

Nov 29, 2016

2.5

Jul 23, 2016

2.4

May 29, 2016

2.3

Mar 9, 2016

2.2

Nov 24, 2015

2.1

May 28, 2015

2.0

Apr 28, 2015

1.1

Feb 6, 2015

1.0

Aug 24, 2014

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

libarchive_c-5.3.tar.gz (54.3 kB view details)

Uploaded May 22, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

libarchive_c-5.3-py3-none-any.whl (17.0 kB view details)

Uploaded May 22, 2025 Python 3

File details

Details for the file libarchive_c-5.3.tar.gz.

File metadata

Download URL: libarchive_c-5.3.tar.gz
Upload date: May 22, 2025
Size: 54.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for libarchive_c-5.3.tar.gz
Algorithm	Hash digest
SHA256	`5ddb42f1a245c927e7686545da77159859d5d4c6d00163c59daff4df314dae82`
MD5	`0f6b2d96b936c9b7f26dfec3255f7aaf`
BLAKE2b-256	`2623e72434d5457c24113e0c22605cbf7dd806a2561294a335047f5aa8ddc1ca`

See more details on using hashes here.

File details

Details for the file libarchive_c-5.3-py3-none-any.whl.

File metadata

Download URL: libarchive_c-5.3-py3-none-any.whl
Upload date: May 22, 2025
Size: 17.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for libarchive_c-5.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`651550a6ec39266b78f81414140a1e04776c935e72dfc70f1d7c8e0a3672ffba`
MD5	`5b1c5d0770f23ca56c430660735eb237`
BLAKE2b-256	`883fff00c588ebd7eae46a9d6223389f5ae28a3af4b6d975c0f2a6d86b1342b9`

See more details on using hashes here.

libarchive-c 5.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Installation

Compatibility

python

libarchive

Usage

Extracting archives

Reading archives

Displaying progress

Creating archives

Symbolic links

File metadata codecs

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes