Skip to main content

A handy tool to trash your metadata

Project description

 _____ _____ _____ ___
|     |  _  |_   _|_  |  Keep your data,
| | | |     | | | |  _|     trash your meta!
|_|_|_|__|__| |_| |___|

This software is currently in beta, please don't use it for anything critical.

Metadata and privacy

Metadata consist of information that characterizes data. Metadata are used to provide documentation for data products. In essence, metadata answer who, what, when, where, why, and how about every facet of the data that are being documented.

Metadata within a file can tell a lot about you. Cameras record data about when a picture was taken and what camera was used. Office documents like PDF or Office automatically adds author and company information to documents and spreadsheets. Maybe you don't want to disclose those information on the web.

This is precisely the job of MAT2: getting rid, as much as possible, of metadata.

Requirements

  • python3-mutagen for audio support
  • python3-gi-cairo and gir1.2-poppler-0.18 for PDF support
  • gir1.2-gdkpixbuf-2.0 for images support
  • FFmpeg, optionally, for video support
  • libimage-exiftool-perl for everything else

Please note that MAT2 requires at least Python3.5, meaning that it doesn't run on Debian Jessie.

Running the test suite

$ python3 -m unittest discover -v

How to use MAT2

usage: mat2 [-h] [-v] [-l] [--check-dependencies] [-V]
            [--unknown-members policy] [-s | -L]
            [files [files ...]]

Metadata anonymisation toolkit 2

positional arguments:
  files                 the files to process

optional arguments:
  -h, --help            show this help message and exit
  -v, --version         show program's version number and exit
  -l, --list            list all supported fileformats
  --check-dependencies  check if MAT2 has all the dependencies it needs
  -V, --verbose         show more verbose status information
  --unknown-members policy
                        how to handle unknown members of archive-style files
                        (policy should be one of: abort, omit, keep)
  -s, --show            list harmful metadata detectable by MAT2 without
                        removing them
  -L, --lightweight     remove SOME metadata

Note that MAT2 will not clean files in-place, but will produce, for example, with a file named "myfile.png" a cleaned version named "myfile.cleaned.png".

Notes about detecting metadata

While MAT2 is doing its very best to display metadata when the --show flag is passed, it doesn't mean that a file is clean from any metadata if MAT2 doesn't show any. There is no reliable way to detect every single possible metadata for complex file formats.

This is why you shouldn't rely on metadata's presence to decide if your file must be cleaned or not.

Related software

Contact

If possible, use the issues system or the mailing list Should a more private contact be needed (eg. for reporting security issues), you can email Julien (jvoisin) Voisin at julien.voisin+mat2@dustri.org, using the gpg key 9FCDEE9E1A381F311EA62A7404D041E8171901CC.

License

This program is free software: you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU Lesser General Public License along with this program. If not, see http://www.gnu.org/licenses/.

Copyright 2018 Julien (jvoisin) Voisin julien.voisin+mat2@dustri.org Copyright 2016 Marie Rose for MAT2's logo

Thanks

MAT2 wouldn't exist without:

Many thanks to them!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mat2-0.6.0.tar.gz (20.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mat2-0.6.0-py3-none-any.whl (26.2 kB view details)

Uploaded Python 3

File details

Details for the file mat2-0.6.0.tar.gz.

File metadata

  • Download URL: mat2-0.6.0.tar.gz
  • Upload date:
  • Size: 20.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/40.5.0 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for mat2-0.6.0.tar.gz
Algorithm Hash digest
SHA256 4042fd68b0f4ce2c5b9a41e5680d58976e1519e13472d484ef7d6b0ce119560e
MD5 bc6f052533818df5d4b861eb598abba4
BLAKE2b-256 4c4e901038bcb1f3a9fcab9b675cb25a8a4859716ece1e7fa3641c1c386ad41c

See more details on using hashes here.

File details

Details for the file mat2-0.6.0-py3-none-any.whl.

File metadata

  • Download URL: mat2-0.6.0-py3-none-any.whl
  • Upload date:
  • Size: 26.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/40.5.0 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for mat2-0.6.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7d753db842027f7d8089ea3e6e03951ed174bd806bbfec646a73732a55216634
MD5 371bbc915aab67d322a0307907599712
BLAKE2b-256 0506b5357df17d677cb6f7f92c2728505d5b764067767383f8c8b79bf99c9bc0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page