Python Humdrum **kern and **mens utilities

These details have not been verified by PyPI

Project links

License
- OSI Approved :: GNU Affero General Public License v3
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

Python Humdrum kern and mens utilities

Documentation: https://kernpy.pages.dev/

kernpy logo

Code examples

Basic Usage

Load a **kern/**mens file into a kp.Document.

import kernpy as kp

# Read a **kern file
document, errors = kp.load("path/to/file.krn")

Load a **kern/**mens from a string into a kp.Document.

import kernpy as kp

document, errors = kp.loads("**kern\n*clefC3\n*k[b-e-a-]\n*M3/4\n4e-\n4g\n4c\n=1\n4r\n2cc;\n==\n*-")

Create a new standardized file from a kp.Document.

import kernpy as kp

kp.dump(document, "newfile.krn")

Save the document in a string from a kp.Document.

import kernpy as kp

content = kp.dumps(document)

Exploring different options when creating new files

Only use the specified spines in spine_types.

import kernpy as kp

kp.dump(document, "newfile_core.krn",
        spine_types=['**kern'])
kp.dump(document, "newfile_lyrics.krn",
        spine_types=['**text])
kp.dump(document, "newfile_core_and_lyrics.krn",
        spine_types=['*+text'])

Use include for selecting the **kern semantic categories to use. The output only contains what is passed. By default, all the categories are included.

import kernpy as kp

kp.dump(document, "newfile_only_clefs.krn",
        include={kp.TokenCategory.CLEF})
kp.dump(document, "newfile_only_durations_and_bounding_boxes.krn",
        include={kp.TokenCategory.DURATION, kp.TokenCategory.BOUNDING_BOXES})

Use exclude for selecting the **kern semantic categories to not use. The output contains everything except what is passed. By default, any category is excluded.

import kernpy as kp

kp.dump(document, "newfile_without_pitches.krn",
        exclude={kp.TokenCategory.PITCH})
kp.dump(document, "newfile_without_durations_or_rests.krn",
        exclude={kp.TokenCategory.BARLINES, kp.TokenCategory.REST})

Use include and exclude together to select the **kern semantic categories to use. The output combines both.

import kernpy as kp

kp.dump(document, "newfile_custom.krn",
        include=kp.BEKERN_CATEGORIES,  # Preloaded set of simple categories
        exclude={kp.TokenCategory.PITCH})

Use tokenizer to select how the categories are split. By default, the normalizedKern tokenizer is used.

import kernpy as kp

kp.dump(document, "newfile_normalized.krn",
        tokenizer=kp.Encoding.normalizedKern)  # Default tokenizer

Select the proper Humdrum **kern tokenizer:

kernpy provides different tokenizers to export the content each symbol in different formats.

Encoding	Tokenized	Description
kern	2.bb-_L	Traditional Humdrum **kern encoding
ekern	2@.@bb@-·_·L	Extended Humdrum **kern encoding

Use the Encoding enum class to select the tokenizer:

import kernpy as kp

doc, _ = kp.load('resource_dir/legacy/chor048.krn')

kern_content = kp.dumps(doc, tokenizer=kp.Encoding.normalizedKern)
ekern_content = kp.dumps(doc, tokenizer=kp.Encoding.eKern)

Use from_measure and to_measure to select the measures to export. By default, all the measures are exported.

import kernpy as kp

kp.dump(document, "newfile_1_to_10.krn",
        from_measure=1,  # First from measure 1
        to_measure=10)   # Last measure exported

Use spine_ids to select the spines to export. By default, all the spines are exported.

import kernpy as kp

kp.dump(document, "newfile_1_and_2.krn",
        spine_ids=[0, 1])  # Export only the first and the second spine

Use show_measure_numbers to select if the measure numbers are shown. By default, the measure numbers are shown.

import kernpy as kp

kp.dump(document, "newfile_no_measure_numbers.krn",
        show_measure_numbers=False)  # Do not show measure numbers

Use all the options at the same time.

import kernpy as kp

kp.dump(document, "newfile.krn",
        spine_types=['**kern'],  # Export only the **kern spines
        include=kp.BEKERN_CATEGORIES,  # Token categories to include
        exclude={kp.TokenCategory.PITCH},  # Token categories to exclude
        tokenizer=kp.Encoding.eKern,  # Kern encoding
        from_measure=1,  # First from measure 1
        to_measure=10,  # Last measure exported
        spine_ids=[0, 1],  # Export only the first and the second spine
        show_measure_numbers=False,  # Do not show measure numbers
        )

Exploring `kernpy` utilities.

Spines analysis Retrieve all the spine types of the document.

import kernpy as kp

kp.spine_types(document)
# ['**kern', '**kern', '**kern', '**kern', '**root', '**harm']

kp.spine_types(document, spine_types=None)
# ['**kern', '**kern', '**kern', '**kern', '**root', '**harm']

kp.spine_types(document, spine_types=['**kern'])
# ['**kern', '**kern', '**kern', '**kern']

Get specific **kern spines.

import kernpy as kp

def how_many_instrumental_spines(document):
    print(kp.spine_types(document, ['**kern']))
    return len(kp.spine_types(document, ['**kern']))
# ['**kern', '**kern', '**kern', '**kern']
# 4

def has_voice(document):
    return len(kp.spine_types(document, ['**text'])) > 0
# True

How many measures are there in the document? Which measures do you want to export?

After reading the score into the Document object. You can get some useful data:

first_measure: int = document.get_first_measure()
last_measure: int = document.measures_count()

Iterate over all the measures of the document.

import kernpy as kp

doc, _ = kp.load('resource_dir/legacy/chor048.krn')  # 10 measures score
for i in range(doc.get_first_measure(), doc.measures_count(), 1):  # from 1 to 11, step 1
    # Export only the i-th measure (1 long measure scores)
    content_ith_measure = kp.dumps(doc, from_measure=i, to_measure=i)
    
    # Export the i-th measure and the next 4 measures (5 long measure scores)
    if i + 4 <= doc.measures_count():
        content_longer = kp.dumps(doc, from_measure=i, to_measure=i + 4)
    ...

It is easier to iterate over all the measures using the for measure in doc: loop (using the __ iter__ method):

import kernpy as kp

for measure in doc:
    content = kp.dumps(doc, from_measure=measure, to_measure=measure)
    ...

Exploring the page bounding boxes.

import kernpy as kp

# Iterate over the pages using the bounding boxes
doc, _ = kp.load('kern_having_bounding_boxes.krn')

# Inspect the bounding boxes
print(doc.page_bounding_boxes)


def are_there_bounding_boxes(doc):
   return len(doc.get_all_tokens(filter_by_categories=[kp.TokenCategory.BOUNDING_BOXES])) > 0


# True

# Iterate over the pages
for page_label, bounding_box_measure in doc.page_bounding_boxes.items():
   print(f"Page: {page_label}"
         f"Bounding box: {bounding_box_measure}"
         f"from_measure: {bounding_box_measure.from_measure}"
         f"to_measure+1: {bounding_box_measure.to_measure}")  # TODO: Check bounds
   kp.dump(doc, f"foo_{page_label}.ekrn",
           spine_types=['**kern'],
           token_categories=kp.BEKERN_CATEGORIES,
           tokenizer=kp.Encoding.eKern,
           from_measure=bounding_box_measure.from_measure,
           to_measure=bounding_box_measure.to_measure - 1  # TODO: Check bounds            
           )

Merge different full kern scores

import kernpy as kp
# NOT AVAILABLE YET!!!
# Pay attention to `kp.merge` too.

# Concat two valid documents
score_a = '**kern\n*clefG2\n=1\n4c\n4d\n4e\n4f\n*-\n'
score_b = '**kern\n*clefG2\n=1\n4a\n4c\n4d\n4c\n*-\n'
concatenated = kp.merge([score_a, score_b])

Concatenate sorted fragments of the same score

import kernpy as kp

fragment_a = '**kern\n*clefG2\n=1\n4c\n4d\n4e\n4f\n*-\n'
fragment_b = '=2\n4a\n4c\n4d\n4c\n*-\n=3\n4a\n4c\n4d\n4c\n*-\n'
fragment_c = '=4\n4a\n4c\n4d\n4c\n*-\n=5\n4a\n4c\n4d\n4c\n*-\n'
fragment_d = '=6\n4a\n4c\n4d\n4c\n*-\n=7\n4a\n4c\n4d\n4c\n*-\n==*-'
fragments = [fragment_a, fragment_b, fragment_c, fragment_d]

doc_merged, indexes = kp.concat(fragments)
for index_pair in indexes:
    from_measure, to_measure = index_pair
    print(f'From measure: {from_measure}, To measure: {to_measure}')
    print(kp.dumps(doc_merged, from_measure=from_measure, to_measure=to_measure))

# Sometimes is useful having a different separator between the fragments rather than the default one (newline)...
doc_merged, indexes = kp.concat(fragments, separator='')

Inspect the `Document` class functions

import kernpy as kp
doc, _ = kp.load('resource_dir/legacy/chor048.krn')  # 10 measures score

frequencies = doc.frequencies()  # All the token categories
filtered_frequencies = doc.frequencies(filter_by_categories=[kp.TokenCategory.SIGNATURES])
frequencies['*k[f#c#]']
# {
#   'occurrences': 4,
#   'category': SIGNATURES,
# }

# Get all the tokens in the document
all_tokens: [kp.Token] = doc.get_all_tokens()
all_tokens_encodings: [str] = doc.get_all_tokens_encodings()

# Get the unique tokens in the document (vocabulary)
unique_tokens: [kp.Token] = doc.get_unique_tokens()
unique_token_encodings: [str] = doc.get_unique_token_encodings()

# Get the line comments in the document
document.get_metacomments()
# ['!!!COM: Coltrane', '!!!voices: 1', '!!!OPR: Blue Train']
document.get_metacomments(KeyComment='COM')
# ['!!!COM: Coltrane']
document.get_metacomments(KeyComment='COM', clear=True)
# ['Coltrane']
document.get_metacomments(KeyComment='non_existing_key')
# []

Transpose

Inspect what intervals are available for transposing.

import kernpy as kp

print(kp.AVAILABLE_INTERVALS)

Transpose the document to a specific interval.

import kernpy as kp

doc, err = kp.load('resource_dir/legacy/chor048.krn')  # 10 measures score
higher_octave_doc = doc.to_transposed('octave', 'up')

kp.dump(higher_octave_doc, 'higher_octave.krn')

On your own

Handle the document if needed.

import kernpy as kp

# Access the document tree
print(document.tree)
# <kernpy.core.document.DocumentTree object at 0x7f8b3b3b3d30>

# View the tree-based Document structure for debugging.
kp.graph(document, '/tmp/graph.dot')
# Render the graph 
# - using Graphviz extension in your IDE
# - in the browser here: https://dreampuf.github.io/GraphvizOnline/

Installation

Production version:

Just install the last version of kernpy using pip:

pip3 uninstall kernpy     # Uninstall the previous version before installing the new one
pip3 install git+https://github.com/OMR-PRAIG-UA-ES/kernpy.git

[!NOTE] This module is downloaded by default in the /tmp directory in Linux. So it is removed when shutdown the machine.

Development version:

[!IMPORTANT]

Add the development dependencies to the requirements.txt file.

Add the production dependencies to the pyproject.toml file.

After every change in the grammar, the next steps are mandatory:

Run the antlr4.sh script (JAVA required).

Commit & push the changes to the repository.

Generate antrl4 grammar:
For generating the Python code required for parsing the **kern files, the shell script antlr4.sh inside the kernpy package must be run.

./antlr4.sh

Install all the dependencies using the requirements.txt file:

pip install -r requirements.txt

Otherwise, install the required packages manually:

It requires the antlr4 package to be installed using:

pip install antlr4-python3-runtime

For visualizing the bounding boxes, the library, the Pillow library is required:

pip install Pillow

To parse a IIIF (International Image Interoperability Framework) manifest in Python, we use the requests library to fetch the manifest file:

pip install requests

If fetching data from https fails, install the following version of urllib:

pip install urllib3==1.26.6

It has been tested with version 4.13.1 of the package.

Documentation

Documentation available at https://kernpy.pages.dev/

kernpy also supports been executed as a module. Find out the available commands:

python -m kernpy --help
python -m kernpy <command> <options>

Run tests:

cd tests && python -m pytest

Contributing

We welcome contributions from the community! If you'd like to contribute to the project, please follow these steps:

Fork the Repository from GitHub.
Clone your own fork repository.
```
git clone ...
cd ...
```
Create a Branch:
Create a new branch for your feature or bug fix:
```
git checkout -b feature/your-feature-name
```
Commit Your Changes: Commit your changes with a descriptive message:
```
git commit -m "feat: add your feature or fix"
```
Push to Your Branch: Push your changes to your forked repository:
```
git push origin feature/your-feature-name
```
Create a Pull Request: Open a pull request to the main repository, describing your changes.

Citation:

@inproceedings{kernpy_mec_2025,
  title={{kernpy: a Humdrum **Kern Oriented Python Package for Optical Music Recognition Tasks}},
  author={Cerveto-Serrano, Joan and Rizo, David and Calvo-Zaragoza, Jorge},
  booktitle={{Proceedings of the Music Encoding Conference (MEC2025)}},
  address={London, United Kingdom},
  year={2025}
}

Project details

These details have not been verified by PyPI

Project links

License
- OSI Approved :: GNU Affero General Public License v3
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

1.7.0

Mar 30, 2026

1.6.2

Mar 27, 2026

1.6.1

Mar 27, 2026

1.3.0

Nov 3, 2025

1.2.0

Oct 23, 2025

1.1.0

Jul 4, 2025

1.0.3

Jun 10, 2025

1.0.2

Jun 4, 2025

1.0.1

Jun 3, 2025

This version

1.0.0

May 30, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kernpy-1.0.0.tar.gz (6.4 MB view details)

Uploaded May 30, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

kernpy-1.0.0-py3-none-any.whl (152.2 kB view details)

Uploaded May 30, 2025 Python 3

File details

Details for the file kernpy-1.0.0.tar.gz.

File metadata

Download URL: kernpy-1.0.0.tar.gz
Upload date: May 30, 2025
Size: 6.4 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for kernpy-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`937260f6e8f6f52f06dd949e29237f1362fd7f33fdbd2dbf078e16cb808f788e`
MD5	`8746f9fdd2a775062f226e1575771f95`
BLAKE2b-256	`d77c4fea909c22666169e1fe767ebd363d5fef0974121f4085bbbf88b00c8ad9`

See more details on using hashes here.

File details

Details for the file kernpy-1.0.0-py3-none-any.whl.

File metadata

Download URL: kernpy-1.0.0-py3-none-any.whl
Upload date: May 30, 2025
Size: 152.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for kernpy-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1b3ed7e26286d6230674234974f1587bd08dd46b2b9d85c99750330678017724`
MD5	`7ff0cab5b7592f0bcde67d8fb078c252`
BLAKE2b-256	`db84e17e99344686802d360288f39c17659df006d66f323e9d260752b2bdcd2a`

See more details on using hashes here.

kernpy 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Python Humdrum **kern and **mens utilities

Documentation: https://kernpy.pages.dev/

Index:

Code examples

Basic Usage

Exploring different options when creating new files

Exploring kernpy utilities.

How many measures are there in the document? Which measures do you want to export?

Merge different full kern scores

Concatenate sorted fragments of the same score

Inspect the Document class functions

Transpose

On your own

Installation

Production version:

Development version:

Documentation

Run tests:

Contributing

Citation:

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Python Humdrum kern and mens utilities

Exploring `kernpy` utilities.

Inspect the `Document` class functions