Skip to main content

Ad removal tool for PDFs.

Project description

Gulag Cleaner

Twitter Instagram Ko-fi

Gulag Cleaner is a tool designed to remove advertisements from PDFs, making it easier to read and navigate documents without being disrupted by unwanted ads.

This tool does not just crop the ads out of the PDF, instead, we extract the original file without ads by manipulating the internal structure of the PDF, ensuring maximum quality.

In addition to removing advertisements, Gulag Cleaner is also capable of extracting metadata, such as the author, subject, university, and more, from the file.

Web Version

This tool can be used without installation directly from our website (in Spanish).

Gulag Cleaner webpage

Installation

To install Gulag Cleaner, please download and install Python and then run the following command in your terminal:

pip install gulagcleaner

Usage

Gulag Cleaner can be used through both a Command Line Interface (CLI) and in your code.

Command Line Interface

To use Gulag Cleaner through the CLI, simply run the following command, replacing <filename> with the name of one or more PDF files or folders containing PDF:

gulagcleaner [-r] [-s] [-n] [-h] [-v] <filename>...

Options

Gulag Cleaner provides several options for its usage:

  • '-r': Replace the original file with the cleaned version.
  • '-s': Do not show metadata about cleaned files.
  • '-n': Force the naive cleaning method.
  • '-h': Display the help message, providing information on how to use Gulag Cleaner.
  • '-v': Display the current version of Gulag Cleaner.

Code

To use Gulag Cleaner in your code, you can use the following code snippet:

from gulagcleaner.extract import clean_pdf

return_msg = clean_pdf_path("input.pdf","output.pdf")

License

Gulag Cleaner is distributed under the GPL-3 license, which means it's open-source and free to use.

Contributing

We're always looking for ways to improve Gulag Cleaner, and we welcome contributions from the community. If you have ideas for improvements or bug fixes, please feel free to submit a pull request.

TODO

If you want to help, these are the top priorities right now:

  • Write tests for the package.
  • Add README.md (With code examples) for the rust and JS distributions.
  • Add comments to a lot of the rust code.
  • Optimize the rust code for performance improvements.
  • Add a new "clean_pdf_bytes()" function in python that does not require a file path, just the bytes.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

gulagcleaner-0.14.3-cp312-cp312-win_amd64.whl (389.8 kB view details)

Uploaded CPython 3.12 Windows x86-64

gulagcleaner-0.14.3-cp312-cp312-win32.whl (372.8 kB view details)

Uploaded CPython 3.12 Windows x86

gulagcleaner-0.14.3-cp312-cp312-musllinux_1_2_x86_64.whl (625.6 kB view details)

Uploaded CPython 3.12 musllinux: musl 1.2+ x86-64

gulagcleaner-0.14.3-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (578.8 kB view details)

Uploaded CPython 3.12 manylinux: glibc 2.17+ x86-64

gulagcleaner-0.14.3-cp312-cp312-manylinux_2_17_i686.manylinux2014_i686.whl (595.7 kB view details)

Uploaded CPython 3.12 manylinux: glibc 2.17+ i686

gulagcleaner-0.14.3-cp311-cp311-win_amd64.whl (389.7 kB view details)

Uploaded CPython 3.11 Windows x86-64

gulagcleaner-0.14.3-cp311-cp311-win32.whl (372.8 kB view details)

Uploaded CPython 3.11 Windows x86

gulagcleaner-0.14.3-cp311-cp311-musllinux_1_2_x86_64.whl (625.7 kB view details)

Uploaded CPython 3.11 musllinux: musl 1.2+ x86-64

gulagcleaner-0.14.3-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (578.5 kB view details)

Uploaded CPython 3.11 manylinux: glibc 2.17+ x86-64

gulagcleaner-0.14.3-cp311-cp311-manylinux_2_17_i686.manylinux2014_i686.whl (595.9 kB view details)

Uploaded CPython 3.11 manylinux: glibc 2.17+ i686

gulagcleaner-0.14.3-cp310-cp310-win_amd64.whl (389.7 kB view details)

Uploaded CPython 3.10 Windows x86-64

gulagcleaner-0.14.3-cp310-cp310-win32.whl (372.8 kB view details)

Uploaded CPython 3.10 Windows x86

gulagcleaner-0.14.3-cp310-cp310-musllinux_1_2_x86_64.whl (625.5 kB view details)

Uploaded CPython 3.10 musllinux: musl 1.2+ x86-64

gulagcleaner-0.14.3-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (578.5 kB view details)

Uploaded CPython 3.10 manylinux: glibc 2.17+ x86-64

gulagcleaner-0.14.3-cp310-cp310-manylinux_2_17_i686.manylinux2014_i686.whl (595.8 kB view details)

Uploaded CPython 3.10 manylinux: glibc 2.17+ i686

gulagcleaner-0.14.3-cp39-cp39-win_amd64.whl (389.6 kB view details)

Uploaded CPython 3.9 Windows x86-64

gulagcleaner-0.14.3-cp39-cp39-win32.whl (372.8 kB view details)

Uploaded CPython 3.9 Windows x86

gulagcleaner-0.14.3-cp39-cp39-musllinux_1_2_x86_64.whl (625.4 kB view details)

Uploaded CPython 3.9 musllinux: musl 1.2+ x86-64

gulagcleaner-0.14.3-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (578.4 kB view details)

Uploaded CPython 3.9 manylinux: glibc 2.17+ x86-64

gulagcleaner-0.14.3-cp39-cp39-manylinux_2_17_i686.manylinux2014_i686.whl (595.6 kB view details)

Uploaded CPython 3.9 manylinux: glibc 2.17+ i686

File details

Details for the file gulagcleaner-0.14.3-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.3-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 557a9f7010552e17dfc702ff7692759d124788e2bfccee2e1c039115483a5be6
MD5 0254c0f1f4405cbd935d437858f1e303
BLAKE2b-256 78c88a09e553cc37b215ff308ae79bd76f10890a2b7c513634183000bd2e3e52

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.3-cp312-cp312-win32.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.3-cp312-cp312-win32.whl
Algorithm Hash digest
SHA256 66e755fa2ae64ce4e34aa6b7ca5d09e2fc297daea45f66dcd7451f311c5fcac7
MD5 09139393167281d3d62f48fda04916a5
BLAKE2b-256 c36fc69d8b72c0a13875a0d93f1b0de1ab1434dd8d87ef92d3b9e37d162cf2a5

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.3-cp312-cp312-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.3-cp312-cp312-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 3256a1fed377e21db46bd353c334f8a68719b0f033fc072b4533265a4922f374
MD5 d12d8914b7bc347ca40b26d3086cda0a
BLAKE2b-256 e89b03511103dfac7ee43eeb5443873d06ad50da1888ac2c88811ead35858dc8

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.3-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.3-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 94b0f8d69be56c72378d317c31d19c3f821badaf0134b701d3f757156f99ee03
MD5 37ef1498181877df58658aed026d1355
BLAKE2b-256 6d2b4f3880659e593b72b56668a6505bb0389c9abbbd2b849569d1fcdacbf37d

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.3-cp312-cp312-manylinux_2_17_i686.manylinux2014_i686.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.3-cp312-cp312-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm Hash digest
SHA256 226df6d4573b73b474086d0207a32b61c0f62fdbbc8d29fb9011553bc8b353e9
MD5 ddda74074e99f23737c51ef5110c8b24
BLAKE2b-256 e85671132525d2e89773a08382788408fdeb84086fca6a9a1291d85c2f03e28d

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.3-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.3-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 b05beb6ad6522f1cb20ed1b2a1e5f2b8a768096aa57a28f6d4fb4a0d61978c9b
MD5 b9d8268ca59e2da2e1fc815b3ec8ed0a
BLAKE2b-256 85efe408f1ebc90edce2bc6e564e6ff5e4ad2fd68a1cca91dcfe12cd9fe485e7

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.3-cp311-cp311-win32.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.3-cp311-cp311-win32.whl
Algorithm Hash digest
SHA256 a8d6a8601593c57efee497f2c446256bdf532fba7013eb199fe89cbb08dbb7b7
MD5 efa3facfb807e32fa97cbd2184c61a19
BLAKE2b-256 bd828ac232435a9bf759fab4f29dcdc5a5c271fc433f36211f6c44dc1c48a6f6

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.3-cp311-cp311-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.3-cp311-cp311-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 60c4d411ab4a629ee5989b380e37983c16d3e0746e4c8e33a3dddec2bc42cd2a
MD5 4b164d44af5a8734069e62f67a998b52
BLAKE2b-256 62762df5c116e59b058c642e1472b9cf031029e754c58fb5edf20c1a4a575d1e

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.3-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.3-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 a9a1b985b477a0831b2029254bac06caf039a9ee2ba9392f2045c42eb78ad99d
MD5 3a36c0e38bc6c2a1ba0a899ee7cafecc
BLAKE2b-256 4ed3a8ec70a25f82a042e4cd6c8d1d9c7a0d53cc90ea8277c1753f20a4d82723

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.3-cp311-cp311-manylinux_2_17_i686.manylinux2014_i686.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.3-cp311-cp311-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm Hash digest
SHA256 c711c08543fc2b4bd48a2f3c6156de8f8b5f9b834c44c58a5cd015d6b971af16
MD5 2351757ff50628b4e6cf3648a1caeec8
BLAKE2b-256 112ec73ef1fff66005f082786bdd7ac240eee55bad5123de01828ae7b5c0fd3b

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.3-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.3-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 d50730e4f422d2f2a75958b83cb3090c87d7fe04d51ae9fccd836feebc441557
MD5 d90ea1b0de43de6107cd170176d3a772
BLAKE2b-256 03eb35d651a197c86de4afa6417a998496db76a58cee791a4980b849204a04b9

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.3-cp310-cp310-win32.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.3-cp310-cp310-win32.whl
Algorithm Hash digest
SHA256 df97810f71681baf16751146ea0bfa95c465600da0d482636cc7164f3ae0277a
MD5 bb189fa058afcd63a74e6f543f5fcb6a
BLAKE2b-256 202767253b44e2ed80fbd31c824e6736041d8caa464710aef77b4bdddffec5b4

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.3-cp310-cp310-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.3-cp310-cp310-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 18e3bbe9cf3daa2aa023e37d53c78bc2e0a767bc9a0dea94056f4fe76a1b021b
MD5 9991fb5440e477fa6726c9f70c42ca6c
BLAKE2b-256 c4ad1bd2a0e372b75eb3ce4f5f4b70fbb70ca11aed1641d8cde2e0598c51fa4a

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.3-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.3-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 4ba6e9dda22b1b753bd7dac12881e598d02b8eeb4ccb4f6949ef5df99adf6651
MD5 d85f717c629029ba7dc8a7c29102bf22
BLAKE2b-256 8c9ebda2d61e7e936f71100e0c508276c586e45bcd6d4ca3ecfc5b162392ef05

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.3-cp310-cp310-manylinux_2_17_i686.manylinux2014_i686.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.3-cp310-cp310-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm Hash digest
SHA256 73dad3829c92559567ea9a2cf5b3dfdbd7a01d4edc4c4f01a40675935a1bb092
MD5 dbae402be65dc0ddd0d4c1dbd8d8dda0
BLAKE2b-256 e1b37345ffd75fc34dd984ac7c52832296db6063c9a37f7c687b77a98254d6f5

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.3-cp39-cp39-win_amd64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.3-cp39-cp39-win_amd64.whl
Algorithm Hash digest
SHA256 dbdcc09823ffe93de09119be453b66ed9f8a159466f384a25b804e6bc7c88adc
MD5 73beabafc1f2e99389cab8569bc47204
BLAKE2b-256 0aa36e03896d7216ebd0bcabf79ee67fd51be32acab52d9f62bedbd32dd78c50

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.3-cp39-cp39-win32.whl.

File metadata

  • Download URL: gulagcleaner-0.14.3-cp39-cp39-win32.whl
  • Upload date:
  • Size: 372.8 kB
  • Tags: CPython 3.9, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.13

File hashes

Hashes for gulagcleaner-0.14.3-cp39-cp39-win32.whl
Algorithm Hash digest
SHA256 b843bf68b3b862fbf045e11f4f89a392df58a22ef4ccd1daf776055d14592822
MD5 74941208ea4173c6fa309bfa5cc43349
BLAKE2b-256 95797a92716ff5d90656fc7ebea33c48bc61a18a64db07d5ec6224118f5007ad

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.3-cp39-cp39-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.3-cp39-cp39-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 bcb7712701d7dfd999ee520e30fef73483d1110d889b5c6c0ea0991f5a36a84c
MD5 00e3c40d0a2b5e4f1deb8c6ec0faa892
BLAKE2b-256 734727cfd61d960c2893c7b4c11b67cc68a01d914536a3a2a812c091492cae2a

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.3-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.3-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 f843e45a5c75b320bf3176eb497c6aa6892b7c28e07ba2600ad08fb681f76804
MD5 4868d3fe99529da42666fd022e62d22b
BLAKE2b-256 fc6492fb13816ad29f0a2b3320a943a6b3c2c81c1c0a53d636fb1aa6fc61297b

See more details on using hashes here.

File details

Details for the file gulagcleaner-0.14.3-cp39-cp39-manylinux_2_17_i686.manylinux2014_i686.whl.

File metadata

File hashes

Hashes for gulagcleaner-0.14.3-cp39-cp39-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm Hash digest
SHA256 a03272178040940669cbd1c1233e84aa97118a2fedf1f0ea6e74c867ca5faa44
MD5 d43f1477a3c3401baee6189ad63419c8
BLAKE2b-256 ead7e85fe892e7d99b95361586eea417f824cfae4390610ec05c35f6c8487998

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page