File type identification using libmagic
Project description
mimetypes-magic
mimetypes-magic is a Python interface to the libmagic file type
identification library. libmagic identifies file types by checking
their headers according to a predefined list of file types. This
functionality is exposed to the command line by the Unix command
file
.
Usage
>>> import magic
>>> magic.from_file("testdata/test.pdf")
'PDF document, version 1.2'
# recommend using at least the first 2048 bytes, as less can produce incorrect identification
>>> magic.from_buffer(open("testdata/test.pdf", "rb").read(2048))
'PDF document, version 1.2'
>>> magic.from_file("testdata/test.pdf", mime=True)
'application/pdf'
You can also use it this way:
>>> from magic import detect_from_filename
>>> detect_from_filename("setup.py")
FileMagic(mime_type='text/x-script.python', encoding='us-ascii', name='Python script, ASCII text executable', extension='.py')
>>>
If you want to see the file extension by name only (it is not necessary for the file to exist in the directory), the option below will be useful:
>>> from magic import mime_type
>>>
>>> mime_type("Example.mp4")
'video/mp4'
>>>
There is also a Magic
class that provides more direct control,
including overriding the magic database file and turning on character
encoding detection. This is not recommended for general use. In
particular, it's not safe for sharing across multiple threads and
will fail throw if this is attempted.
>>> f = magic.Magic(uncompress=True)
>>> f.from_file('testdata/test.gz')
'ASCII text (gzip compressed data, was "test", last modified: Sat Jun 28
21:32:52 2008, from Unix)'
You can also combine the flag options:
>>> f = magic.Magic(mime=True, uncompress=True)
>>> f.from_file('testdata/test.gz')
'text/plain'
Installation
The current stable version of mimetypes-magic is available on PyPI and
can be installed by running pip install mimetypes-magic
.
Other sources:
- PyPI: http://pypi.python.org/pypi/mimetypes-magic/
- GitHub: https://github.com/ElissonRodrigues/mimetypes-magic
This module is a simple wrapper around the libmagic C library, and that must be installed as well:
Debian/Ubuntu
sudo apt-get install libmagic1
Windows
You'll need DLLs for libmagic. @julian-r maintains a pypi package with the DLLs, you can fetch it with:
pip install python-magic-bin
OSX
- When using Homebrew:
brew install libmagic
- When using macports:
port install file
Troubleshooting
-
'MagicException: could not find any magic files!': some installations of libmagic do not correctly point to their magic database file. Try specifying the path to the file explicitly in the constructor:
magic.Magic(magic_file="path_to_magic_file")
. -
'WindowsError: [Error 193] %1 is not a valid Win32 application': Attempting to run the 32-bit libmagic DLL in a 64-bit build of python will fail with this error. Here are 64-bit builds of libmagic for windows: https://github.com/pidydx/libmagicwin64. Newer version can be found here: https://github.com/nscaife/file-windows.
-
'WindowsError: exception: access violation writing 0x00000000 ' This may indicate you are mixing Windows Python and Cygwin Python. Make sure your libmagic and python builds are consistent.
Bug Reports
mimetypes-magic is a thin layer over the libmagic C library. Historically, most bugs that have been reported against mimetypes-magic are actually bugs in libmagic; libmagic bugs can be reported on their tracker here: https://bugs.astron.com/my_view_page.php. If you're not sure where the bug lies feel free to file an issue on GitHub and I can triage it.
Running the tests
To run the tests across a variety of linux distributions (depends on Docker):
./test_docker.sh
To run tests locally across all available python versions:
./test/run.py
To run against a specific python version:
LC_ALL=en_US.UTF-8 python3 test/test.py
libmagic python API compatibility
The python bindings shipped with libmagic use a module name that conflicts with this package. To work around this, mimetypes-magic includes a compatibility layer for the libmagic API. See COMPAT.md for a guide to libmagic / mimetypes-magic compatibility.
Versioning
Minor version bumps should be backwards compatible. Major bumps are not.
Author
Written by Adam Hupp in 2001 for a project that never got off the ground. It originally used SWIG for the C library bindings, but switched to ctypes once that was part of the python standard library.
You can contact me via my website or GitHub.
License
mimetypes-magic is distributed under the MIT license. See the included LICENSE file for details.
I am providing code in the repository to you under an open source license. Because this is my personal repository, the license you receive to my code is from me and not my employer (Facebook).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file mimetypes-magic-0.4.30.tar.gz
.
File metadata
- Download URL: mimetypes-magic-0.4.30.tar.gz
- Upload date:
- Size: 16.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.7.1 requests/2.26.0 setuptools/57.4.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.10.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ca78fc718a7ea5b886b5380e3a4189d479abf496c7f618b0c21e1cf2140ce3d3 |
|
MD5 | f86c31bb6d6637d97cac1add630d4f00 |
|
BLAKE2b-256 | 4217f1d9e65a3808cd83b85d0665264fbe1a8db5ba7faf283c1888a7468dc854 |