Skip to main content

A python library for extracting parts from sheetmusic pdfs

Project description

sheatless - A python library for extracting parts from sheetmusic pdfs

Sheatless, a tool for The Beatless to become sheetless. Written and managed by the web-committee in the student orchestra The Beatless. Soon to be integrated in taktlaus.no.

API

Currently the entire library has a single entry point function

def processUploadedPdf(pdfPath, imagesDirPath, instruments_file=None, instruments=None, use_lstm=False, tessdata_dir=None):
    ...
	return parts, instrumentsDefaultParts

which will be available with

from sheatless import processUploadedPdf

Arguments description here:

Argument Optional Description
pdfPath Full path to PDF file.
imagesDirPath Full path to output images.
instruments_file (optional) Full path to instruments file. Accepted formats: YAML (.yaml, .yml), JSON (.json).
instruments (optional) Dictionary of instruments. Will override any provided instruments file.
If neither instruments_file nor instruments is provided a default instruments file will be used.
use_lstm (optional) Use LSTM instead of legacy engine mode.
tessdata_dir (optional) Full path to tessdata directory. If not provided, whatever the environment variable TESSDATA_DIR will be used.

Returns description here:

Return Description
parts A list of dictionaries { "name": "[name]", "fromPage": i, "toPage": j } describing each part
instrumentsDefaultParts A dictionary { ..., "instrument_i": j, ... }, where j is the index in the parts list for the default part for instrument_i.

Example docker setup

Sheatless requires tesseract and poppler installed on the system to work. An example docker setup as well as integration of the library can be found in sheatless-splitter.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sheatless-0.0.11.tar.gz (18.7 kB view details)

Uploaded Source

Built Distribution

sheatless-0.0.11-py3-none-any.whl (18.6 kB view details)

Uploaded Python 3

File details

Details for the file sheatless-0.0.11.tar.gz.

File metadata

  • Download URL: sheatless-0.0.11.tar.gz
  • Upload date:
  • Size: 18.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.6.4 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.8.10

File hashes

Hashes for sheatless-0.0.11.tar.gz
Algorithm Hash digest
SHA256 0cee506ce3ae1c48c6ca96f478eb7dc6a84b558dd12cc55c03f8ed8f2675d54c
MD5 bb60eac7da1654ba8155b28b0e47b5fd
BLAKE2b-256 907c3dcd7fca3c1d81d7d7ce08ebf77aa1a4260d313087244bde4e3d8eea417c

See more details on using hashes here.

File details

Details for the file sheatless-0.0.11-py3-none-any.whl.

File metadata

  • Download URL: sheatless-0.0.11-py3-none-any.whl
  • Upload date:
  • Size: 18.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.6.4 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.8.10

File hashes

Hashes for sheatless-0.0.11-py3-none-any.whl
Algorithm Hash digest
SHA256 939734392aef2116341c24735871a80b32b17a75aaa140405dfa8dfe814eaeb7
MD5 d05bc2a5b71a6606b262b3116ba58d37
BLAKE2b-256 04038d2a7486c62ceae38a8a17f2d39e3dde03047bbc7648488518c2ed708618

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page