Skip to main content

No project description provided

Project description

LAION made scale possible and we made CPU scale possible. A library to scale pdf parsing on CPU.

from scale2pdf import scalablepdf 
from scale2pdf import extractimages

scalablepdf("/content/2408.06257v3.pdf", "example-pdf.json")
extract_images_from_pdf("2408.06257v3.pdf", "/path/to/output/folder")

CRAP CPU (NO GPU): 3 min 42 seconds to finish parsing and saving it to JSON.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scale2pdf-0.0.1.tar.gz (2.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scale2pdf-0.0.1-py3-none-any.whl (2.8 kB view details)

Uploaded Python 3

File details

Details for the file scale2pdf-0.0.1.tar.gz.

File metadata

  • Download URL: scale2pdf-0.0.1.tar.gz
  • Upload date:
  • Size: 2.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.8

File hashes

Hashes for scale2pdf-0.0.1.tar.gz
Algorithm Hash digest
SHA256 309fba0ae4588a94d6a91b194a438508a307a4a8a9e949a325713b473a515008
MD5 05c7cb30bde2dd724c1d3bc7ccc2d2a7
BLAKE2b-256 a0976a1aafbcf2608a99659c2708d4280c9784d94a462a5ed4b6de7bede44985

See more details on using hashes here.

File details

Details for the file scale2pdf-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: scale2pdf-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 2.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.8

File hashes

Hashes for scale2pdf-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 f40f2fb8d1f64c2cd6fd6f3bd6f2e75a34f52403f560c45a7e25c0710dbabcdb
MD5 f6f588070c043e143fa81b428e480a8f
BLAKE2b-256 2f6e45ba78ec47b37dfb3781455c536d4c54d3bdf15d542a863327492af90cad

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page