Skip to main content

Ad removal tool for PDFs.

Project description

Gulag Cleaner

Twitter Instagram Ko-fi

Gulag Cleaner is a tool designed to remove advertisements from PDFs, making it easier to read and navigate documents without being disrupted by unwanted ads.

This tool does not just crop the ads out of the PDF, instead, we extract the original file without ads by manipulating the internal structure of the PDF, ensuring maximum quality.

In addition to removing advertisements, Gulag Cleaner is also capable of extracting metadata, such as the author, subject, university, and more, from the file.

Web Version

This tool can be used without installation directly from our website (in Spanish).

Gulag Cleaner webpage

Installation

To install Gulag Cleaner, please download and install Python and then run the following command in your terminal:

pip install gulagcleaner

Usage

Gulag Cleaner can be used through both a Command Line Interface (CLI) and in your code.

Command Line Interface

To use Gulag Cleaner through the CLI, simply run the following command, replacing <filename> with the name of one or more PDF files or folders containing PDF:

gulagcleaner [-r] [-s] [-n] [-h] [-v] <filename>...

Options

Gulag Cleaner provides several options for its usage:

  • '-r': Replace the original file with the cleaned version.
  • '-s': Do not show metadata about cleaned files.
  • '-n': Force the naive cleaning method.
  • '-h': Display the help message, providing information on how to use Gulag Cleaner.
  • '-v': Display the current version of Gulag Cleaner.

Code

To use Gulag Cleaner in your code, you can use the following code snippet:

from gulagcleaner.extract import clean_pdf

return_msg = clean_pdf_path("input.pdf","output.pdf")

License

Gulag Cleaner is distributed under the GPL-3 license, which means it's open-source and free to use.

Contributing

We're always looking for ways to improve Gulag Cleaner, and we welcome contributions from the community. If you have ideas for improvements or bug fixes, please feel free to submit a pull request.

TODO

If you want to help, these are the top priorities right now:

  • Write tests for the package.
  • Add README.md (With code examples) for the rust and JS distributions.
  • Add comments to a lot of the rust code.
  • Optimize the rust code for performance improvements.
  • Add a new "clean_pdf_bytes()" function in python that does not require a file path, just the bytes.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

gulagcleaner-4-cp312-cp312-win_amd64.whl (390.2 kB view details)

Uploaded CPython 3.12 Windows x86-64

gulagcleaner-4-cp312-cp312-win32.whl (373.2 kB view details)

Uploaded CPython 3.12 Windows x86

gulagcleaner-4-cp312-cp312-musllinux_1_2_x86_64.whl (626.5 kB view details)

Uploaded CPython 3.12 musllinux: musl 1.2+ x86-64

gulagcleaner-4-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (579.2 kB view details)

Uploaded CPython 3.12 manylinux: glibc 2.17+ x86-64

gulagcleaner-4-cp312-cp312-manylinux_2_17_i686.manylinux2014_i686.whl (596.2 kB view details)

Uploaded CPython 3.12 manylinux: glibc 2.17+ i686

gulagcleaner-4-cp311-cp311-win_amd64.whl (390.1 kB view details)

Uploaded CPython 3.11 Windows x86-64

gulagcleaner-4-cp311-cp311-win32.whl (373.3 kB view details)

Uploaded CPython 3.11 Windows x86

gulagcleaner-4-cp311-cp311-musllinux_1_2_x86_64.whl (626.5 kB view details)

Uploaded CPython 3.11 musllinux: musl 1.2+ x86-64

gulagcleaner-4-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (579.1 kB view details)

Uploaded CPython 3.11 manylinux: glibc 2.17+ x86-64

gulagcleaner-4-cp311-cp311-manylinux_2_17_i686.manylinux2014_i686.whl (596.4 kB view details)

Uploaded CPython 3.11 manylinux: glibc 2.17+ i686

gulagcleaner-4-cp310-cp310-win_amd64.whl (390.1 kB view details)

Uploaded CPython 3.10 Windows x86-64

gulagcleaner-4-cp310-cp310-win32.whl (373.3 kB view details)

Uploaded CPython 3.10 Windows x86

gulagcleaner-4-cp310-cp310-musllinux_1_2_x86_64.whl (626.4 kB view details)

Uploaded CPython 3.10 musllinux: musl 1.2+ x86-64

gulagcleaner-4-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (579.1 kB view details)

Uploaded CPython 3.10 manylinux: glibc 2.17+ x86-64

gulagcleaner-4-cp310-cp310-manylinux_2_17_i686.manylinux2014_i686.whl (596.3 kB view details)

Uploaded CPython 3.10 manylinux: glibc 2.17+ i686

gulagcleaner-4-cp39-cp39-win_amd64.whl (390.1 kB view details)

Uploaded CPython 3.9 Windows x86-64

gulagcleaner-4-cp39-cp39-win32.whl (373.2 kB view details)

Uploaded CPython 3.9 Windows x86

gulagcleaner-4-cp39-cp39-musllinux_1_2_x86_64.whl (626.3 kB view details)

Uploaded CPython 3.9 musllinux: musl 1.2+ x86-64

gulagcleaner-4-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (579.0 kB view details)

Uploaded CPython 3.9 manylinux: glibc 2.17+ x86-64

gulagcleaner-4-cp39-cp39-manylinux_2_17_i686.manylinux2014_i686.whl (596.1 kB view details)

Uploaded CPython 3.9 manylinux: glibc 2.17+ i686

File details

Details for the file gulagcleaner-4-cp312-cp312-win_amd64.whl.

File metadata

  • Download URL: gulagcleaner-4-cp312-cp312-win_amd64.whl
  • Upload date:
  • Size: 390.2 kB
  • Tags: CPython 3.12, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.13

File hashes

Hashes for gulagcleaner-4-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 8d3e180289b50cedffc169a63a97114ea8997ef9ac843351539b43e11b3c02d3
MD5 d4c655d34994b022f9b469f5df736222
BLAKE2b-256 5328bbaee15c5cc993fc63844a25d9e713f1477da377cbd508dbd3b3c78cbcdf

See more details on using hashes here.

File details

Details for the file gulagcleaner-4-cp312-cp312-win32.whl.

File metadata

  • Download URL: gulagcleaner-4-cp312-cp312-win32.whl
  • Upload date:
  • Size: 373.2 kB
  • Tags: CPython 3.12, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.13

File hashes

Hashes for gulagcleaner-4-cp312-cp312-win32.whl
Algorithm Hash digest
SHA256 9ed58970be7184489bf0351fe6110d499de2a95fe3e20c39cc5f656dbbc13e06
MD5 902b8e01aa0eac1b40910e985c40ad36
BLAKE2b-256 7a507ca658cf974031ed74e27eebd9573211e390d63d67b1267faee334c78f46

See more details on using hashes here.

File details

Details for the file gulagcleaner-4-cp312-cp312-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-4-cp312-cp312-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 9155422a2fa873fbf8f5c88e5926e8b92139b5c02ed3a217e634d8408c4b819d
MD5 809dfd790cc6dfee463cbcbd2f68351d
BLAKE2b-256 3f28366cb4560dc2cd06fcbf7505981e90d83672ebc1ec2a083fc5fad9699b8a

See more details on using hashes here.

File details

Details for the file gulagcleaner-4-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-4-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 df1f4c50ad797f6562698600839accc28da4fe028aedea4b93b29da12cd1f196
MD5 bcedd86e74a5debb0cc0dcb23ccf9507
BLAKE2b-256 68bd25a7fb152fc8601a339d0a33330b356deea33f504349545ff443c3254855

See more details on using hashes here.

File details

Details for the file gulagcleaner-4-cp312-cp312-manylinux_2_17_i686.manylinux2014_i686.whl.

File metadata

File hashes

Hashes for gulagcleaner-4-cp312-cp312-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm Hash digest
SHA256 1417f4753486b20cc3646224d7ca895e32b78e84bb538f4a81c86fcee80da9ab
MD5 a70e4b3c90bfb3f8e623befefd02b43a
BLAKE2b-256 d5adeae9b0463a90ecdb96f0026c8ec12a893ed5aebc8115ed27e83cd420ba23

See more details on using hashes here.

File details

Details for the file gulagcleaner-4-cp311-cp311-win_amd64.whl.

File metadata

  • Download URL: gulagcleaner-4-cp311-cp311-win_amd64.whl
  • Upload date:
  • Size: 390.1 kB
  • Tags: CPython 3.11, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.13

File hashes

Hashes for gulagcleaner-4-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 5aed9b5a79dcd6e07bcd159f95e92b7264a08fd5af0f3435415e8bc1b2593ac8
MD5 7d6abb2e218ba83bcc8b2b786f0a5a69
BLAKE2b-256 a6f27e4bf38058b8effbe1e7fad148ebebe809c5035aafb804e5e5630fbe5308

See more details on using hashes here.

File details

Details for the file gulagcleaner-4-cp311-cp311-win32.whl.

File metadata

  • Download URL: gulagcleaner-4-cp311-cp311-win32.whl
  • Upload date:
  • Size: 373.3 kB
  • Tags: CPython 3.11, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.13

File hashes

Hashes for gulagcleaner-4-cp311-cp311-win32.whl
Algorithm Hash digest
SHA256 0f47c4b8ad4f4c2f691b2a551b8bbc9ffe08d59e330047e4de56567cfea3064c
MD5 e1036b2080bcf2206a257449a33bc475
BLAKE2b-256 c3a0701a4ac320bfc63c464fceaf1fcac91792ac52cb5ca4e78de101c6cab5e3

See more details on using hashes here.

File details

Details for the file gulagcleaner-4-cp311-cp311-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-4-cp311-cp311-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 feb745f84c50c8b81f2b83e6f93282ccf555f0caa6f2c8fc922f75272ada1871
MD5 39720b117dc50b46c442cecf233aabe2
BLAKE2b-256 57fd35452e2bc24b15948ef6ba2b0e35b2bc9a3675d6bc6de3bdd8895efdf3cf

See more details on using hashes here.

File details

Details for the file gulagcleaner-4-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-4-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 b1a8110133ab64808e5b6b6009c5014935a6494b9c6858b345739923ad19b5fa
MD5 fc26c2051a148c2f3b1b839dcaff04c0
BLAKE2b-256 9b59f2cf288fca645307aa64b7e02ec5c08d1cdaf2b2e837acbcb62414e3af00

See more details on using hashes here.

File details

Details for the file gulagcleaner-4-cp311-cp311-manylinux_2_17_i686.manylinux2014_i686.whl.

File metadata

File hashes

Hashes for gulagcleaner-4-cp311-cp311-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm Hash digest
SHA256 d63b625ca1ae4b7dbb9238de05b2941ea7dd358ec8b702c39322f50e32d2c7a9
MD5 1f79c517a823a9d9300315aedf0b6cb9
BLAKE2b-256 d58bfc8d21ec69edc05e097710fbe12159f5b5e23e72b99525f0fcf63d8aab1d

See more details on using hashes here.

File details

Details for the file gulagcleaner-4-cp310-cp310-win_amd64.whl.

File metadata

  • Download URL: gulagcleaner-4-cp310-cp310-win_amd64.whl
  • Upload date:
  • Size: 390.1 kB
  • Tags: CPython 3.10, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.13

File hashes

Hashes for gulagcleaner-4-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 5c80f163d6604461c6816c35abff4ced6bdc0ed3083cba99e771247cd7577cfb
MD5 d017cf93b116ad5c84c3aa7fb4c24c45
BLAKE2b-256 a1d1649111f99b2c5a186964bba9cface79e1072d3d0868f90f464e6fd16a85e

See more details on using hashes here.

File details

Details for the file gulagcleaner-4-cp310-cp310-win32.whl.

File metadata

  • Download URL: gulagcleaner-4-cp310-cp310-win32.whl
  • Upload date:
  • Size: 373.3 kB
  • Tags: CPython 3.10, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.13

File hashes

Hashes for gulagcleaner-4-cp310-cp310-win32.whl
Algorithm Hash digest
SHA256 81e65e6a255c11db8fda1e11b116d84b160f13f88236410d224123437fca3cc7
MD5 3b630d23c1ef6bfa37a07f2b48bace67
BLAKE2b-256 53fbb804a7424061afff11b5fd1fa74d59a2f79e81d3de08a810a3714301a048

See more details on using hashes here.

File details

Details for the file gulagcleaner-4-cp310-cp310-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-4-cp310-cp310-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 ee341130a51132358cdbafa0c7d7c872fe13a5d74c68a337cea1b22e07d0d9ea
MD5 06d5f1ee92854d889ff358bcceacb4d9
BLAKE2b-256 0c3530de0986f6fa2937823fe9b4e7ab373e1516bf9f63a4389e43935f282ebb

See more details on using hashes here.

File details

Details for the file gulagcleaner-4-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-4-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 3055b6ee3bdd9dabdd1654ee22f8a79abe417cdd69f8aeccdea1ad7300e07062
MD5 e8d1a6311332e70267b0fa7f55bf1628
BLAKE2b-256 0c53d6e2bd6870328d30edadf079afcaf79d400c5333d7381b4c27a005c506e1

See more details on using hashes here.

File details

Details for the file gulagcleaner-4-cp310-cp310-manylinux_2_17_i686.manylinux2014_i686.whl.

File metadata

File hashes

Hashes for gulagcleaner-4-cp310-cp310-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm Hash digest
SHA256 4b60fece12685568b226d3573053cab97ad81fb932d586d00f4cb083a8ae7892
MD5 4e225760f968c9eca9f8f09b0e77c7c5
BLAKE2b-256 e0f6d77e60dc891a5e490af8877b305bfb240633055c3886858beb93f73d36bb

See more details on using hashes here.

File details

Details for the file gulagcleaner-4-cp39-cp39-win_amd64.whl.

File metadata

  • Download URL: gulagcleaner-4-cp39-cp39-win_amd64.whl
  • Upload date:
  • Size: 390.1 kB
  • Tags: CPython 3.9, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.13

File hashes

Hashes for gulagcleaner-4-cp39-cp39-win_amd64.whl
Algorithm Hash digest
SHA256 993a8ead2a3e0b62af2652582394291cf9729042709990c52ab30409a87e5f73
MD5 e6ba6fd4d5cd365509900721d6a2070e
BLAKE2b-256 90bf58a488cdb7e91f3e82437493750cee1320c2198cbcba89eb22c271b4b6aa

See more details on using hashes here.

File details

Details for the file gulagcleaner-4-cp39-cp39-win32.whl.

File metadata

  • Download URL: gulagcleaner-4-cp39-cp39-win32.whl
  • Upload date:
  • Size: 373.2 kB
  • Tags: CPython 3.9, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.13

File hashes

Hashes for gulagcleaner-4-cp39-cp39-win32.whl
Algorithm Hash digest
SHA256 df8dbcacb27ebddcad622987203f65f5c90ee790995e05255f58d0adff562c87
MD5 78ed514822a71fab7064c442814829b1
BLAKE2b-256 57b5504b21bc8fca0a33cb4857d349a3faad4d82f9a290877b86517a45d7e0e9

See more details on using hashes here.

File details

Details for the file gulagcleaner-4-cp39-cp39-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-4-cp39-cp39-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 1cb743978dbcda7a26a4aeb832e9dc71b7e4615de75f7a72aa185cef5c420a7f
MD5 1febec2513c15d40625b9c8e4b0a5356
BLAKE2b-256 c4af2f89fcfbf4391dbee8eb5d349ccf6ec861f697d4fbdd6e670ba9a5e0fb58

See more details on using hashes here.

File details

Details for the file gulagcleaner-4-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-4-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 d836eb4f6ac1bee43e94a34aed48bc1a4b0da6dfe981a98d9cc56aff08cae689
MD5 ad6476a1196a4b410b0e0c447f850c62
BLAKE2b-256 a3fc0dff5e904964053c1c041fce2c169c1e97208c70efe3f0936855f48b21f0

See more details on using hashes here.

File details

Details for the file gulagcleaner-4-cp39-cp39-manylinux_2_17_i686.manylinux2014_i686.whl.

File metadata

File hashes

Hashes for gulagcleaner-4-cp39-cp39-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm Hash digest
SHA256 2a5dd82579f9f054ba93d2e75e7171ad6241499f697f88fa6dbf1e8c368306d3
MD5 b92b5e08ed1cecf80d10a4fe26b19024
BLAKE2b-256 794eaab39837e6ddea386d8a55e4ad425c50260fabdfce80a149e8b4f26e77f1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page