Skip to main content

Ad removal tool for PDFs.

Project description

Gulag Cleaner

Twitter Instagram Ko-fi

Gulag Cleaner is a tool designed to remove advertisements from PDFs, making it easier to read and navigate documents without being disrupted by unwanted ads.

This tool does not just crop the ads out of the PDF, instead, we extract the original file without ads by manipulating the internal structure of the PDF, ensuring maximum quality.

In addition to removing advertisements, Gulag Cleaner is also capable of extracting metadata, such as the author, subject, university, and more, from the file.

Web Version

This tool can be used without installation directly from our website (in Spanish).

Gulag Cleaner webpage

Installation

To install Gulag Cleaner, please download and install Python and then run the following command in your terminal:

pip install gulagcleaner

Usage

Gulag Cleaner can be used through both a Command Line Interface (CLI) and in your code.

Command Line Interface

To use Gulag Cleaner through the CLI, simply run the following command, replacing <filename> with the name of one or more PDF files or folders containing PDF:

gulagcleaner [-r] [-s] [-n] [-h] [-v] <filename>...

Options

Gulag Cleaner provides several options for its usage:

  • '-r': Replace the original file with the cleaned version.
  • '-s': Do not show metadata about cleaned files.
  • '-n': Force the naive cleaning method.
  • '-h': Display the help message, providing information on how to use Gulag Cleaner.
  • '-v': Display the current version of Gulag Cleaner.

Code

To use Gulag Cleaner in your code, you can use the following code snippet:

from gulagcleaner.extract import clean_pdf

return_msg = clean_pdf_path("input.pdf","output.pdf")

License

Gulag Cleaner is distributed under the GPL-3 license, which means it's open-source and free to use.

Contributing

We're always looking for ways to improve Gulag Cleaner, and we welcome contributions from the community. If you have ideas for improvements or bug fixes, please feel free to submit a pull request.

TODO

If you want to help, these are the top priorities right now:

  • Write tests for the package.
  • Add README.md (With code examples) for the rust and JS distributions.
  • Add comments to a lot of the rust code.
  • Optimize the rust code for performance improvements.
  • Add a new "clean_pdf_bytes()" function in python that does not require a file path, just the bytes.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

gulagcleaner-0.16.4-cp313-cp313-win_amd64.whl (410.5 kB view details)

Uploaded CPython 3.13Windows x86-64

gulagcleaner-0.16.4-cp313-cp313-musllinux_1_2_x86_64.whl (671.8 kB view details)

Uploaded CPython 3.13musllinux: musl 1.2+ x86-64

gulagcleaner-0.16.4-cp313-cp313-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (606.9 kB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ x86-64

gulagcleaner-0.16.4-cp313-cp313-macosx_11_0_arm64.whl (522.2 kB view details)

Uploaded CPython 3.13macOS 11.0+ ARM64

gulagcleaner-0.16.4-cp312-cp312-win_amd64.whl (410.7 kB view details)

Uploaded CPython 3.12Windows x86-64

gulagcleaner-0.16.4-cp312-cp312-musllinux_1_2_x86_64.whl (672.5 kB view details)

Uploaded CPython 3.12musllinux: musl 1.2+ x86-64

gulagcleaner-0.16.4-cp312-cp312-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (608.0 kB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64

gulagcleaner-0.16.4-cp312-cp312-macosx_11_0_arm64.whl (522.4 kB view details)

Uploaded CPython 3.12macOS 11.0+ ARM64

gulagcleaner-0.16.4-cp311-cp311-win_amd64.whl (409.2 kB view details)

Uploaded CPython 3.11Windows x86-64

gulagcleaner-0.16.4-cp311-cp311-musllinux_1_2_x86_64.whl (670.7 kB view details)

Uploaded CPython 3.11musllinux: musl 1.2+ x86-64

gulagcleaner-0.16.4-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (605.8 kB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64

gulagcleaner-0.16.4-cp311-cp311-macosx_11_0_arm64.whl (524.3 kB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

gulagcleaner-0.16.4-cp310-cp310-win_amd64.whl (409.1 kB view details)

Uploaded CPython 3.10Windows x86-64

gulagcleaner-0.16.4-cp310-cp310-musllinux_1_2_x86_64.whl (670.7 kB view details)

Uploaded CPython 3.10musllinux: musl 1.2+ x86-64

gulagcleaner-0.16.4-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (605.8 kB view details)

Uploaded CPython 3.10manylinux: glibc 2.17+ x86-64

gulagcleaner-0.16.4-cp310-cp310-manylinux_2_17_i686.manylinux2014_i686.whl (633.1 kB view details)

Uploaded CPython 3.10manylinux: glibc 2.17+ i686

gulagcleaner-0.16.4-cp310-cp310-macosx_11_0_arm64.whl (524.6 kB view details)

Uploaded CPython 3.10macOS 11.0+ ARM64

gulagcleaner-0.16.4-cp39-cp39-win_amd64.whl (409.5 kB view details)

Uploaded CPython 3.9Windows x86-64

gulagcleaner-0.16.4-cp39-cp39-musllinux_1_2_x86_64.whl (671.1 kB view details)

Uploaded CPython 3.9musllinux: musl 1.2+ x86-64

gulagcleaner-0.16.4-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (606.2 kB view details)

Uploaded CPython 3.9manylinux: glibc 2.17+ x86-64

gulagcleaner-0.16.4-cp39-cp39-manylinux_2_17_i686.manylinux2014_i686.whl (633.4 kB view details)

Uploaded CPython 3.9manylinux: glibc 2.17+ i686

gulagcleaner-0.16.4-cp39-cp39-macosx_11_0_arm64.whl (525.0 kB view details)

Uploaded CPython 3.9macOS 11.0+ ARM64

File details

Details for the file gulagcleaner-0.16.4-cp313-cp313-win_amd64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 8d9aa83f159e13d651cdbe48208b1f9b60b8dc53a59d14494dd678d1d95e7e0d
MD5 621725792e918c457000a66ad7890b8f
BLAKE2b-256 dda6a5ce6798209d9b8a2571604b297e21901c0078210b3f6e5f3658e695f882

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp313-cp313-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp313-cp313-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 b1d46e4db1c3637e647b2abacfd749f885ac99fd97ba24dc4cab29dd135a150a
MD5 7eb4a289c0f6f39ca6a4cfe4c6065602
BLAKE2b-256 78c2b71fd6f1da27dca1580789a3fd2cf7307e09c9df34cc8c7e4599271b6cb0

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp313-cp313-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp313-cp313-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Algorithm Hash digest
SHA256 3737dd08142100149863f1ad41c3620d9df6153b65b288151675ae4d7cddaf6b
MD5 28f8efa56e9fdd301c369008d2977314
BLAKE2b-256 cb8bcfc1dc1e321e8fdf28076b5e8a892bddffabbabae25b88682268a805e279

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp313-cp313-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp313-cp313-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 7b52c7a80b34db9ea118b846751d0cb0a1f6b8e9159cda25f97924e41cd1f420
MD5 6b99b2f6d0ba9448c98c7944480f5c46
BLAKE2b-256 f61af6f865f2b647645e45f5ec0c8a5c3a1860724b18e7316caabb53d312da2e

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 6d46a913b5490c9670a2e2c9dd97e0f2819ea31ecaa592eca6b7a4e989d01e4b
MD5 f0a47c2219d58de97b5659c9b375679f
BLAKE2b-256 27883d718a17c4f8a4e969b90d1f136f29eef8af5b442c6658a8ea04f597c2a3

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp312-cp312-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp312-cp312-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 0e73227d0d6a62cdced9b64cf68897325f7f30dc79a7e6b31c4e205bd2fec1ad
MD5 712750ade72393fde3c1a8008b2a3078
BLAKE2b-256 fdab9f059807b247975d2ef16f55762d547987cb847e30c6256e2a9908f28a09

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp312-cp312-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp312-cp312-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Algorithm Hash digest
SHA256 2ba68709ddd686a9f6eed114d418782acaae0f43d7c1cf5b87c78c3edb0820c6
MD5 2d37480366908dde45060116d30af11d
BLAKE2b-256 9396644eb0ea6b93c2bdf30c41e3f623c093f9fa281a3bd4e256dd2167598d84

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp312-cp312-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 aa2fbc27e1943d967b9a4e793db6eff4fa07b40f41e5a03fd7757a789dec1cfe
MD5 26bddb210e1048fd9e10c5852ca33f63
BLAKE2b-256 44cf3198b5e9ce6fa46c85dc8fe305d76a6572fceb22ede506fcfcb12eeb9cff

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 35df72be1e201da7907114b247599305a747af51eb281605e796a802f0aa203b
MD5 1db44052867ffa4ecbf09104c4beb4df
BLAKE2b-256 2d47b8615215476e99b791a4d15d9004acebeb8a1074fc5c54d346623d016251

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp311-cp311-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp311-cp311-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 679c23df4e3120c7dd2cf2084de5b20e723c7fddb76354f2660841b384578ff5
MD5 5804db7d96b7946e9039409ab526a6c7
BLAKE2b-256 86e653dc70c1a7fdf5481728faf72f58050bbf43f22dcabe30576be7242a8e23

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Algorithm Hash digest
SHA256 e78f000f4fd15cc4beff5d4c9fea43f1486a82cc7a81e7b195be4689e49bfe2d
MD5 68f18f68f4f8b748023afda83b2a6641
BLAKE2b-256 2fa7fdcb1731bc497e6c93ed4b9ab2a5d50e56aeaa1ec545e5cd4a4d1ada9175

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 bb91fd654830cb0f321d965f093e79106235c567a40fbc880f65807ddd645715
MD5 d199e203be0fa34bd1935ab2009f1e04
BLAKE2b-256 12ebf8753a27a9f7d6b107ad674763b32889fad0334a6a89077c8162ff3167e1

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 648d741a0e55d0374fc2c2fcca0b4f758c0da324ec5fb8642fb943fb66225eeb
MD5 eab79dbf58f0a63a759ef9d9e0aaa989
BLAKE2b-256 25bc9ec0733a08bf9d573a0faf74dea685c0ed4fd9a47f0462046a9fceede728

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp310-cp310-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp310-cp310-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 0854e53ff37ea00f6b4ef23caded5f7a0c904d34b86d3808b08d54c27489acfc
MD5 7599555d89a5a8583512d1a50675ca06
BLAKE2b-256 a2b811be18212f114a86d5786e531e4f0427c2de0c4a7506138c591b55e97760

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 644082a696b3e10731cbb9b8ddf95f1099055da53ef1d5fa07d18497c529c833
MD5 80d596e59f3859fdaa64cad5310bfb37
BLAKE2b-256 5112065ff1dad4f66b95a2866a20743c9e16b68f0a1453335900ce1a75749bae

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp310-cp310-manylinux_2_17_i686.manylinux2014_i686.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp310-cp310-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm Hash digest
SHA256 c666683f9ae0c6f848ba1bca0558cc930b60793d90f14e25dbb2ad748a00010b
MD5 6320b51b3f65da098d3db2ec0a8b315c
BLAKE2b-256 2220c2dba0f842ff19131018f33d27624746a86289788ba01a6a6a02307a2aef

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp310-cp310-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp310-cp310-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 f3f522a4f86d80365ea22c5caf52cb7e4220364bce67ad10ed142e70c78f8725
MD5 126d2f34655ff6c676ce5b38587e6f64
BLAKE2b-256 d105a29f3f4300fb53dfdee3cf52d958739365d084dcb1d592a2a17626e58c9a

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp39-cp39-win_amd64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp39-cp39-win_amd64.whl
Algorithm Hash digest
SHA256 bb30b052fb6cf61db777fc570344f315f3008551db72affb0fe8f126e81ae563
MD5 e4264ac28203b97d55856dde1982929c
BLAKE2b-256 e5ec37b1f5ee66494f32744d0def83c27b6ad4e130373d4e844f25545f2c0a1e

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp39-cp39-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp39-cp39-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 a68cf44b91886ff0f9d63f1cb3ad56e846aa4bba46d8ab8c73d1d9c740c4f3f7
MD5 3d2f7e1b346ee62f5b203369f8c7ee6c
BLAKE2b-256 5550b861401a87d4b55152c5c61d9e9ff510bdcce07af93def56e5f01dbba159

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 43c61da23ea7d5bd932c597c56a407f38c6caf6c2ebf5159fbfa1092659a1ca6
MD5 ba60a2c71eaa162d5177918da118fc4b
BLAKE2b-256 37960317ee0a74236ed3e60cb1069f1ded9b11943731e171ea6dec2e69c369b9

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp39-cp39-manylinux_2_17_i686.manylinux2014_i686.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp39-cp39-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm Hash digest
SHA256 836e04321f787dc993a528ba511755ac2e344b97a09d01d5951446a4e7826d26
MD5 588953f54bb79616447aecdebd69c423
BLAKE2b-256 0bf7843e757b282b67c19bbb950e8e98d13bd114122ce9b54e3a4958f17b2efc

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.16.4-cp39-cp39-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.16.4-cp39-cp39-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 498c06c0ad03bc9fd37fbd2947132266dd1ff735c5a09e449f7854472fd8c58f
MD5 24489142262d1e1dac4da5140e9c244e
BLAKE2b-256 f9ea7e16b373158454f6e168e9c76217fb37c3891a12a5032eb70f574a02ec8b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page