Skip to main content

Ad removal tool for PDFs.

Project description

Gulag Cleaner

Twitter Instagram Ko-fi

Gulag Cleaner is a tool designed to remove advertisements from PDFs, making it easier to read and navigate documents without being disrupted by unwanted ads.

This tool does not just crop the ads out of the PDF, instead, we extract the original file without ads by manipulating the internal structure of the PDF, ensuring maximum quality.

In addition to removing advertisements, Gulag Cleaner is also capable of extracting metadata, such as the author, subject, university, and more, from the file.

Web Version

This tool can be used without installation directly from our website (in Spanish).

Gulag Cleaner webpage

Installation

To install Gulag Cleaner, please download and install Python and then run the following command in your terminal:

pip install gulagcleaner

Usage

Gulag Cleaner can be used through both a Command Line Interface (CLI) and in your code.

Command Line Interface

To use Gulag Cleaner through the CLI, simply run the following command, replacing <filename> with the name of one or more PDF files or folders containing PDF:

gulagcleaner [-r] [-s] [-n] [-h] [-v] <filename>...

Options

Gulag Cleaner provides several options for its usage:

  • '-r': Replace the original file with the cleaned version.
  • '-s': Do not show metadata about cleaned files.
  • '-n': Force the naive cleaning method.
  • '-h': Display the help message, providing information on how to use Gulag Cleaner.
  • '-v': Display the current version of Gulag Cleaner.

Code

To use Gulag Cleaner in your code, you can use the following code snippet:

from gulagcleaner.extract import clean_pdf

return_msg = clean_pdf_path("input.pdf","output.pdf")

License

Gulag Cleaner is distributed under the GPL-3 license, which means it's open-source and free to use.

Contributing

We're always looking for ways to improve Gulag Cleaner, and we welcome contributions from the community. If you have ideas for improvements or bug fixes, please feel free to submit a pull request.

TODO

If you want to help, these are the top priorities right now:

  • Write tests for the package.
  • Add README.md (With code examples) for the rust and JS distributions.
  • Add comments to a lot of the rust code.
  • Optimize the rust code for performance improvements.
  • Add a new "clean_pdf_bytes()" function in python that does not require a file path, just the bytes.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

gulagcleaner-0.14.2-cp312-cp312-win_amd64.whl (386.8 kB view details)

Uploaded CPython 3.12 Windows x86-64

gulagcleaner-0.14.2-cp312-cp312-win32.whl (368.4 kB view details)

Uploaded CPython 3.12 Windows x86

gulagcleaner-0.14.2-cp312-cp312-musllinux_1_2_x86_64.whl (622.9 kB view details)

Uploaded CPython 3.12 musllinux: musl 1.2+ x86-64

gulagcleaner-0.14.2-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (575.9 kB view details)

Uploaded CPython 3.12 manylinux: glibc 2.17+ x86-64

gulagcleaner-0.14.2-cp312-cp312-manylinux_2_17_i686.manylinux2014_i686.whl (592.4 kB view details)

Uploaded CPython 3.12 manylinux: glibc 2.17+ i686

gulagcleaner-0.14.2-cp311-cp311-win_amd64.whl (386.6 kB view details)

Uploaded CPython 3.11 Windows x86-64

gulagcleaner-0.14.2-cp311-cp311-win32.whl (368.9 kB view details)

Uploaded CPython 3.11 Windows x86

gulagcleaner-0.14.2-cp311-cp311-musllinux_1_2_x86_64.whl (622.8 kB view details)

Uploaded CPython 3.11 musllinux: musl 1.2+ x86-64

gulagcleaner-0.14.2-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (575.4 kB view details)

Uploaded CPython 3.11 manylinux: glibc 2.17+ x86-64

gulagcleaner-0.14.2-cp311-cp311-manylinux_2_17_i686.manylinux2014_i686.whl (592.4 kB view details)

Uploaded CPython 3.11 manylinux: glibc 2.17+ i686

gulagcleaner-0.14.2-cp310-cp310-win_amd64.whl (386.5 kB view details)

Uploaded CPython 3.10 Windows x86-64

gulagcleaner-0.14.2-cp310-cp310-win32.whl (368.9 kB view details)

Uploaded CPython 3.10 Windows x86

gulagcleaner-0.14.2-cp310-cp310-musllinux_1_2_x86_64.whl (622.8 kB view details)

Uploaded CPython 3.10 musllinux: musl 1.2+ x86-64

gulagcleaner-0.14.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (575.4 kB view details)

Uploaded CPython 3.10 manylinux: glibc 2.17+ x86-64

gulagcleaner-0.14.2-cp310-cp310-manylinux_2_17_i686.manylinux2014_i686.whl (592.3 kB view details)

Uploaded CPython 3.10 manylinux: glibc 2.17+ i686

gulagcleaner-0.14.2-cp39-cp39-win_amd64.whl (386.5 kB view details)

Uploaded CPython 3.9 Windows x86-64

gulagcleaner-0.14.2-cp39-cp39-win32.whl (368.9 kB view details)

Uploaded CPython 3.9 Windows x86

gulagcleaner-0.14.2-cp39-cp39-musllinux_1_2_x86_64.whl (622.7 kB view details)

Uploaded CPython 3.9 musllinux: musl 1.2+ x86-64

gulagcleaner-0.14.2-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (575.4 kB view details)

Uploaded CPython 3.9 manylinux: glibc 2.17+ x86-64

gulagcleaner-0.14.2-cp39-cp39-manylinux_2_17_i686.manylinux2014_i686.whl (592.2 kB view details)

Uploaded CPython 3.9 manylinux: glibc 2.17+ i686

File details

Details for the file gulagcleaner-0.14.2-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.2-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 74bbad1f72cbb66204bc23409e0c8cc280708b75937f3a4e5e36bc9729ad8849
MD5 d782bf4316f88427e42f768d78f5c7d0
BLAKE2b-256 3d5bc13cf4391301be241e7a57736d8482ec9d0ff94cea74eb9b6173a8201441

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.2-cp312-cp312-win32.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.2-cp312-cp312-win32.whl
Algorithm Hash digest
SHA256 f889ff52498a2875dd786a6188b90273296f177be559165ec7a77b5f604417e7
MD5 042e5280b8947bce2465adb1551e6177
BLAKE2b-256 172e4ade989e16f34d75bf3086ef3a424384fece781abd34f7b21abe097f4773

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.2-cp312-cp312-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.2-cp312-cp312-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 27e792aa04cf64e56439b033e1e208b2e1f16dbc6d177699afaec3ca4fbbe13e
MD5 5177cebe72295795b015ede0ea491b94
BLAKE2b-256 1d6249565626962a2d76035064b9ef40fa049533bfc08832316cebe205793653

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.2-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.2-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 118b1c36f2eec23435f89e7695edf92fa31ce35981e9754888d6649022c6aa1c
MD5 aef61c4ee1cb8191439e0095e20cdd55
BLAKE2b-256 b8c7556cbc54ccb3f8d6d43ccc11870b64a3b0339d343f6dea70e94b51cc0e7d

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.2-cp312-cp312-manylinux_2_17_i686.manylinux2014_i686.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.2-cp312-cp312-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm Hash digest
SHA256 a7120c500b5f819cf9d60fc9e798d9dc1648e2e4cf21bf5ac957758ac471bfbe
MD5 2e718ceb885410d7f9518b08dc9eaafc
BLAKE2b-256 0c5474e758f48aa4672b09ea9b4a105d1642134045129b1d85a1ffb47535376f

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.2-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.2-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 87f097517409c36fc2ac8eee2eeeaf31c6842424c24b786cb52c1d35bd92f82b
MD5 7bcfa0a313f3cdc960a7c5d7e40383e4
BLAKE2b-256 3dc73eb4c305d35781bdd36638d49e1c0d4572b585f859d3d22b609c76160c91

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.2-cp311-cp311-win32.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.2-cp311-cp311-win32.whl
Algorithm Hash digest
SHA256 986196d9c4b2fd224667e7143204ad1d030cdb3f465876d3b61868b51df5390f
MD5 0c0c8427c9cf9d1f8f29f16471d9f815
BLAKE2b-256 478d2edfee4614c0e5acc0efedd329f40cd4782dc908facfe4377f98a9cf316d

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.2-cp311-cp311-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.2-cp311-cp311-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 0723e7709e3acd15f3e13e2fa96114722c4677aaa97ce609350b34143309867c
MD5 36de4d553fefb2e1471164f9f02a9aa5
BLAKE2b-256 55ced66d2577ac9472568126c17dedba6d5ccc284e063cb497566a60c7d1cb4e

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.2-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.2-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 8cb179a2e2aeaf3f02284cd678f5cf35bdd661e3e87d0d4b3a32713c3de5f54e
MD5 65a98d71742a1f4104e1c81b00beef64
BLAKE2b-256 bb1c0fd3bd8a5f1c5e428058cc7ddc5f164a909d7e14dd0eb3ca72c2fdc92b8e

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.2-cp311-cp311-manylinux_2_17_i686.manylinux2014_i686.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.2-cp311-cp311-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm Hash digest
SHA256 e66becc99b44f78025e2505cb96e72bcb67d0f2425adb95fe40ffa53be45ee88
MD5 c25fd0bf2cef5677c7474b9852023d19
BLAKE2b-256 221e7bb889f6e076b17a7af56fa300bff73c707c297db94897b94e65a1c8dd07

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.2-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.2-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 2cac3820755cdf49dfef87760ebbe3c17d7901ea949920b5293a5466e258080a
MD5 f1000e528f0abdd2f1b468f033bdbeee
BLAKE2b-256 667aef0f5ea02d56f0c608b0f122a01db43717fea4c6435f777b5de2facd0e9b

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.2-cp310-cp310-win32.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.2-cp310-cp310-win32.whl
Algorithm Hash digest
SHA256 b3b99a9e129cc576b6c51e88f51b5f19a66fc9718dc014c7a5a8793c7227f861
MD5 8c06fb5e3d7f666b5e955a7b56b36fe4
BLAKE2b-256 8861d2faa0a155a8b727fecb3d1874a7fcde6b575cefefe3cbd6c66afbf652ef

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.2-cp310-cp310-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.2-cp310-cp310-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 3f7c7f7083209ee4e02de7499d5602a7f378b1a7ef5919df61771509557b44fd
MD5 448f3f01af182a11522abd01e19f7e33
BLAKE2b-256 047906ab3e5702be3e866c420279c9e094d403f5b1cca1b8b0f8372df7137d4f

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 8ea46ab41c9b28d172e6bf495e3493942b47317c8ca566626d5b696d28f539f5
MD5 ef9f9f7edfe98a90cdef57828c99b9c5
BLAKE2b-256 ea717dc16c56a4a8aba35de4774dacc63614cd87646ed0ed7f2f2858894dda56

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.2-cp310-cp310-manylinux_2_17_i686.manylinux2014_i686.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.2-cp310-cp310-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm Hash digest
SHA256 4d3e468dbed90610cf5b62d24f57c4d197ca4b07c669d52d2e9db83c0987bafc
MD5 b85a42302f6dc7932dffc2e65f59915f
BLAKE2b-256 93c586e231640aefdbafc1afaa3962a83a1ec2c067ab0c389dd197b859fb6892

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.2-cp39-cp39-win_amd64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.2-cp39-cp39-win_amd64.whl
Algorithm Hash digest
SHA256 8c2ebb35161332e4d6c0edeab185f5633d4327733ea1d74a0ae3d1828e288fda
MD5 7ef276c2ec644b44b81c672a89d989e4
BLAKE2b-256 1828a3a38a08811836a4f1df8d766096e77db43ad0f9a7ca1f7bbc467a042b99

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.2-cp39-cp39-win32.whl.

File metadata

  • Download URL: gulagcleaner-0.14.2-cp39-cp39-win32.whl
  • Upload date:
  • Size: 368.9 kB
  • Tags: CPython 3.9, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.13

File hashes

Hashes for gulagcleaner-0.14.2-cp39-cp39-win32.whl
Algorithm Hash digest
SHA256 c5b9b53737484dc50ab02f8b1ee28301b0e3cc4d3ac90054d355a8599f447eb9
MD5 7796146f694c23913c74a980542a4b40
BLAKE2b-256 dd4854a88d27bc4d8e87a8fa99ce8985b595e5819f19faed08bd94e2c42f3749

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.2-cp39-cp39-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.2-cp39-cp39-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 7c0413cf6df06589b43096393de2e04670927dd2f52635104c1c65b143f244c0
MD5 4c1b0d753c335d06ea185cf6943da770
BLAKE2b-256 7dc31644c477e8e463966a69608ad8f8346e5a515d1c8264a397a5d4184eeb2e

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.2-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.2-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 49cde00c003df9d8c817599f9384834753a29c4794b961bfdb97fc76213c88a2
MD5 262db9e27947522c5d26e9740706497a
BLAKE2b-256 0465f506d7ca75b7d74c5edbd9a59ace68a04ede6e0aaafe3e05da2a944f6e1a

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.2-cp39-cp39-manylinux_2_17_i686.manylinux2014_i686.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.2-cp39-cp39-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm Hash digest
SHA256 5677f5b867a112f52a7166ea1af21253fd961ffee24f63cad194d8bfbe62aa85
MD5 847d86c9378bf4ea57c9812ee19449d3
BLAKE2b-256 5eb7a1e148d06322ad06647df1e1ab94e3d7cc8f43fb5b13d6208b814c56d6e1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page