Ad removal tool for PDFs.
Project description
Gulag Cleaner
Gulag Cleaner is a tool designed to remove advertisements from PDFs, making it easier to read and navigate documents without being disrupted by unwanted ads.
This tool does not just crop the ads out of the PDF, instead, we extract the original file without ads by manipulating the internal structure of the PDF, ensuring maximum quality.
In addition to removing advertisements, Gulag Cleaner is also capable of extracting metadata, such as the author, subject, university, and more, from the file.
Web Version
This tool can be used without installation directly from our website (in Spanish).
Installation
To install Gulag Cleaner, please download and install Python and then run the following command in your terminal:
pip install gulagcleaner
Usage
Gulag Cleaner can be used through both a Command Line Interface (CLI) and in your code.
Command Line Interface
To use Gulag Cleaner through the CLI, simply run the following command, replacing <filename>
with the name of one or more PDF files or folders containing PDF:
gulagcleaner [-r] [-s] [-n] [-h] [-v] <filename>...
Options
Gulag Cleaner provides several options for its usage:
- '-r': Replace the original file with the cleaned version.
- '-s': Do not show metadata about cleaned files.
- '-n': Force the naive cleaning method.
- '-h': Display the help message, providing information on how to use Gulag Cleaner.
- '-v': Display the current version of Gulag Cleaner.
Code
To use Gulag Cleaner in your code, you can use the following code snippet:
from gulagcleaner.extract import clean_pdf
return_msg = clean_pdf_path("input.pdf","output.pdf")
License
Gulag Cleaner is distributed under the GPL-3 license, which means it's open-source and free to use.
Contributing
We're always looking for ways to improve Gulag Cleaner, and we welcome contributions from the community. If you have ideas for improvements or bug fixes, please feel free to submit a pull request.
TODO
If you want to help, these are the top priorities right now:
- Write tests for the package.
- Add README.md (With code examples) for the rust and JS distributions.
- Add comments to a lot of the rust code.
- Optimize the rust code for performance improvements.
- Add a new "clean_pdf_bytes()" function in python that does not require a file path, just the bytes.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
File details
Details for the file gulagcleaner-0.14.2-cp312-cp312-win_amd64.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp312-cp312-win_amd64.whl
- Upload date:
- Size: 386.8 kB
- Tags: CPython 3.12, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 74bbad1f72cbb66204bc23409e0c8cc280708b75937f3a4e5e36bc9729ad8849 |
|
MD5 | d782bf4316f88427e42f768d78f5c7d0 |
|
BLAKE2b-256 | 3d5bc13cf4391301be241e7a57736d8482ec9d0ff94cea74eb9b6173a8201441 |
File details
Details for the file gulagcleaner-0.14.2-cp312-cp312-win32.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp312-cp312-win32.whl
- Upload date:
- Size: 368.4 kB
- Tags: CPython 3.12, Windows x86
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f889ff52498a2875dd786a6188b90273296f177be559165ec7a77b5f604417e7 |
|
MD5 | 042e5280b8947bce2465adb1551e6177 |
|
BLAKE2b-256 | 172e4ade989e16f34d75bf3086ef3a424384fece781abd34f7b21abe097f4773 |
File details
Details for the file gulagcleaner-0.14.2-cp312-cp312-musllinux_1_2_x86_64.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp312-cp312-musllinux_1_2_x86_64.whl
- Upload date:
- Size: 622.9 kB
- Tags: CPython 3.12, musllinux: musl 1.2+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 27e792aa04cf64e56439b033e1e208b2e1f16dbc6d177699afaec3ca4fbbe13e |
|
MD5 | 5177cebe72295795b015ede0ea491b94 |
|
BLAKE2b-256 | 1d6249565626962a2d76035064b9ef40fa049533bfc08832316cebe205793653 |
File details
Details for the file gulagcleaner-0.14.2-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 575.9 kB
- Tags: CPython 3.12, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 118b1c36f2eec23435f89e7695edf92fa31ce35981e9754888d6649022c6aa1c |
|
MD5 | aef61c4ee1cb8191439e0095e20cdd55 |
|
BLAKE2b-256 | b8c7556cbc54ccb3f8d6d43ccc11870b64a3b0339d343f6dea70e94b51cc0e7d |
File details
Details for the file gulagcleaner-0.14.2-cp312-cp312-manylinux_2_17_i686.manylinux2014_i686.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp312-cp312-manylinux_2_17_i686.manylinux2014_i686.whl
- Upload date:
- Size: 592.4 kB
- Tags: CPython 3.12, manylinux: glibc 2.17+ i686
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a7120c500b5f819cf9d60fc9e798d9dc1648e2e4cf21bf5ac957758ac471bfbe |
|
MD5 | 2e718ceb885410d7f9518b08dc9eaafc |
|
BLAKE2b-256 | 0c5474e758f48aa4672b09ea9b4a105d1642134045129b1d85a1ffb47535376f |
File details
Details for the file gulagcleaner-0.14.2-cp311-cp311-win_amd64.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp311-cp311-win_amd64.whl
- Upload date:
- Size: 386.6 kB
- Tags: CPython 3.11, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 87f097517409c36fc2ac8eee2eeeaf31c6842424c24b786cb52c1d35bd92f82b |
|
MD5 | 7bcfa0a313f3cdc960a7c5d7e40383e4 |
|
BLAKE2b-256 | 3dc73eb4c305d35781bdd36638d49e1c0d4572b585f859d3d22b609c76160c91 |
File details
Details for the file gulagcleaner-0.14.2-cp311-cp311-win32.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp311-cp311-win32.whl
- Upload date:
- Size: 368.9 kB
- Tags: CPython 3.11, Windows x86
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 986196d9c4b2fd224667e7143204ad1d030cdb3f465876d3b61868b51df5390f |
|
MD5 | 0c0c8427c9cf9d1f8f29f16471d9f815 |
|
BLAKE2b-256 | 478d2edfee4614c0e5acc0efedd329f40cd4782dc908facfe4377f98a9cf316d |
File details
Details for the file gulagcleaner-0.14.2-cp311-cp311-musllinux_1_2_x86_64.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp311-cp311-musllinux_1_2_x86_64.whl
- Upload date:
- Size: 622.8 kB
- Tags: CPython 3.11, musllinux: musl 1.2+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0723e7709e3acd15f3e13e2fa96114722c4677aaa97ce609350b34143309867c |
|
MD5 | 36de4d553fefb2e1471164f9f02a9aa5 |
|
BLAKE2b-256 | 55ced66d2577ac9472568126c17dedba6d5ccc284e063cb497566a60c7d1cb4e |
File details
Details for the file gulagcleaner-0.14.2-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 575.4 kB
- Tags: CPython 3.11, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8cb179a2e2aeaf3f02284cd678f5cf35bdd661e3e87d0d4b3a32713c3de5f54e |
|
MD5 | 65a98d71742a1f4104e1c81b00beef64 |
|
BLAKE2b-256 | bb1c0fd3bd8a5f1c5e428058cc7ddc5f164a909d7e14dd0eb3ca72c2fdc92b8e |
File details
Details for the file gulagcleaner-0.14.2-cp311-cp311-manylinux_2_17_i686.manylinux2014_i686.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp311-cp311-manylinux_2_17_i686.manylinux2014_i686.whl
- Upload date:
- Size: 592.4 kB
- Tags: CPython 3.11, manylinux: glibc 2.17+ i686
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e66becc99b44f78025e2505cb96e72bcb67d0f2425adb95fe40ffa53be45ee88 |
|
MD5 | c25fd0bf2cef5677c7474b9852023d19 |
|
BLAKE2b-256 | 221e7bb889f6e076b17a7af56fa300bff73c707c297db94897b94e65a1c8dd07 |
File details
Details for the file gulagcleaner-0.14.2-cp310-cp310-win_amd64.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp310-cp310-win_amd64.whl
- Upload date:
- Size: 386.5 kB
- Tags: CPython 3.10, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2cac3820755cdf49dfef87760ebbe3c17d7901ea949920b5293a5466e258080a |
|
MD5 | f1000e528f0abdd2f1b468f033bdbeee |
|
BLAKE2b-256 | 667aef0f5ea02d56f0c608b0f122a01db43717fea4c6435f777b5de2facd0e9b |
File details
Details for the file gulagcleaner-0.14.2-cp310-cp310-win32.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp310-cp310-win32.whl
- Upload date:
- Size: 368.9 kB
- Tags: CPython 3.10, Windows x86
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b3b99a9e129cc576b6c51e88f51b5f19a66fc9718dc014c7a5a8793c7227f861 |
|
MD5 | 8c06fb5e3d7f666b5e955a7b56b36fe4 |
|
BLAKE2b-256 | 8861d2faa0a155a8b727fecb3d1874a7fcde6b575cefefe3cbd6c66afbf652ef |
File details
Details for the file gulagcleaner-0.14.2-cp310-cp310-musllinux_1_2_x86_64.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp310-cp310-musllinux_1_2_x86_64.whl
- Upload date:
- Size: 622.8 kB
- Tags: CPython 3.10, musllinux: musl 1.2+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3f7c7f7083209ee4e02de7499d5602a7f378b1a7ef5919df61771509557b44fd |
|
MD5 | 448f3f01af182a11522abd01e19f7e33 |
|
BLAKE2b-256 | 047906ab3e5702be3e866c420279c9e094d403f5b1cca1b8b0f8372df7137d4f |
File details
Details for the file gulagcleaner-0.14.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 575.4 kB
- Tags: CPython 3.10, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8ea46ab41c9b28d172e6bf495e3493942b47317c8ca566626d5b696d28f539f5 |
|
MD5 | ef9f9f7edfe98a90cdef57828c99b9c5 |
|
BLAKE2b-256 | ea717dc16c56a4a8aba35de4774dacc63614cd87646ed0ed7f2f2858894dda56 |
File details
Details for the file gulagcleaner-0.14.2-cp310-cp310-manylinux_2_17_i686.manylinux2014_i686.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp310-cp310-manylinux_2_17_i686.manylinux2014_i686.whl
- Upload date:
- Size: 592.3 kB
- Tags: CPython 3.10, manylinux: glibc 2.17+ i686
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4d3e468dbed90610cf5b62d24f57c4d197ca4b07c669d52d2e9db83c0987bafc |
|
MD5 | b85a42302f6dc7932dffc2e65f59915f |
|
BLAKE2b-256 | 93c586e231640aefdbafc1afaa3962a83a1ec2c067ab0c389dd197b859fb6892 |
File details
Details for the file gulagcleaner-0.14.2-cp39-cp39-win_amd64.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp39-cp39-win_amd64.whl
- Upload date:
- Size: 386.5 kB
- Tags: CPython 3.9, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8c2ebb35161332e4d6c0edeab185f5633d4327733ea1d74a0ae3d1828e288fda |
|
MD5 | 7ef276c2ec644b44b81c672a89d989e4 |
|
BLAKE2b-256 | 1828a3a38a08811836a4f1df8d766096e77db43ad0f9a7ca1f7bbc467a042b99 |
File details
Details for the file gulagcleaner-0.14.2-cp39-cp39-win32.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp39-cp39-win32.whl
- Upload date:
- Size: 368.9 kB
- Tags: CPython 3.9, Windows x86
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c5b9b53737484dc50ab02f8b1ee28301b0e3cc4d3ac90054d355a8599f447eb9 |
|
MD5 | 7796146f694c23913c74a980542a4b40 |
|
BLAKE2b-256 | dd4854a88d27bc4d8e87a8fa99ce8985b595e5819f19faed08bd94e2c42f3749 |
File details
Details for the file gulagcleaner-0.14.2-cp39-cp39-musllinux_1_2_x86_64.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp39-cp39-musllinux_1_2_x86_64.whl
- Upload date:
- Size: 622.7 kB
- Tags: CPython 3.9, musllinux: musl 1.2+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7c0413cf6df06589b43096393de2e04670927dd2f52635104c1c65b143f244c0 |
|
MD5 | 4c1b0d753c335d06ea185cf6943da770 |
|
BLAKE2b-256 | 7dc31644c477e8e463966a69608ad8f8346e5a515d1c8264a397a5d4184eeb2e |
File details
Details for the file gulagcleaner-0.14.2-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 575.4 kB
- Tags: CPython 3.9, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 49cde00c003df9d8c817599f9384834753a29c4794b961bfdb97fc76213c88a2 |
|
MD5 | 262db9e27947522c5d26e9740706497a |
|
BLAKE2b-256 | 0465f506d7ca75b7d74c5edbd9a59ace68a04ede6e0aaafe3e05da2a944f6e1a |
File details
Details for the file gulagcleaner-0.14.2-cp39-cp39-manylinux_2_17_i686.manylinux2014_i686.whl
.
File metadata
- Download URL: gulagcleaner-0.14.2-cp39-cp39-manylinux_2_17_i686.manylinux2014_i686.whl
- Upload date:
- Size: 592.2 kB
- Tags: CPython 3.9, manylinux: glibc 2.17+ i686
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5677f5b867a112f52a7166ea1af21253fd961ffee24f63cad194d8bfbe62aa85 |
|
MD5 | 847d86c9378bf4ea57c9812ee19449d3 |
|
BLAKE2b-256 | 5eb7a1e148d06322ad06647df1e1ab94e3d7cc8f43fb5b13d6208b814c56d6e1 |