PDF Ingester
Project description
PDF Ingest
Language(X) -> English translations
https://github.com/facebookresearch/fairseq/blob/main/examples/translation/README.md
Use
pdf-ingest X:\yourfiles
Misc
- How to use GPU in paddleocr
- https://github.com/PaddlePaddle/PaddleOCR/issues/10429
Extensions TODO:
[ ] fb2 [ ] epub
Instructions from Mike Adams
MA: I was wondering though if the filename could have a pre-extension based on language like *-EN.txt MA: Or *-RUS.txt, etc. MA: Like, if it's easy for your program to realize what language it is
ME: that's trivial ME: but how is your AI going to make sense out of different languages?
MA: We are just gonna archive non-English for now, and only process English
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pdf_ingest-1.0.11-py3-none-any.whl.
File metadata
- Download URL: pdf_ingest-1.0.11-py3-none-any.whl
- Upload date:
- Size: 16.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.11.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5b9b932985ac9073d90ce148580493bd1451636b9595990b8a77ee245d3c5d3b
|
|
| MD5 |
41072f9e7123148dab2ae23ec17bad65
|
|
| BLAKE2b-256 |
65d6fe3b488205e051e6d754f4296e7258bbc06733fcd8e4b86ca59890a7daeb
|