Pythonic HTML5 generation without templating

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 5 - Production/Stable
Intended Audience
- Developers
License
- OSI Approved :: MIT License
- Public Domain
Operating System
- OS Independent
Programming Language
Topic
- Text Processing :: Markup :: HTML

Project description

Pythonic HTML5 generation

The module is named html5tagger because it makes heavy use of the simplified HTML5 syntax where many opening and closing tags are optional. Tags are written with no consideration of DOM tree structure, which the browsers determine automatically based on the content that follows. No pretty printing is added to the HTML code because such extra whitespace would create unnecessary DOM nodes, often affecting output formatting as well.

pip install html5tagger

Since the module is a single file with no dependencies, you may also just copy html5tagger.py directly into your project.

Intro

You can create HTML snippets by starting with E (for an empty builder) and adding elements with dot notation:

from html5tagger import Document, E

snippet = E.table(E.tr.th("First").th("Second").th("Third").tr.td(1).td(2).td(3))

print(snippet)  # Print snippet's code

<table><tr><th>First<th>Second<th>Third<tr><td>1<td>2<td>3</table>

The Builder object converts to HTML string when printed or by str(snippet). Jupyter Notebook and others support automatic display in HTML by _repr_html_ and __html__ conversions.

In contrast to E which creates snippets, Document creates a new document (i.e. it begins with a DOCTYPE declaration). A minimal head structure is created using any provided title and/or urls. html attributes may be defined by keyword arguments.

Document("Test page", lang="en")

<!DOCTYPE html><html lang=en><meta charset="utf-8"><title>Test page</title>

This is a valid document by itself. </head><body> and </body></html> are not needed in HTML5, and thus any content may simply be appended to this, without ever closing the document.

You can also add your js, css, favicon and manifest files:

Document(_urls=("style.css", "logo.png", "jquery.js"))

<!DOCTYPE html>
<link rel=stylesheet href="style.css">
<link rel=icon href="logo.png", type="image/png">
<script src="jquery.js"></script>

Nesting

Explicit nesting needs to be used for elements such as table and ul where contents may be provided as sub-snippet parameters, or by with blocks:

doc = Document("Test page", lang="en")
with doc.ul:  # Nest using the with statement
    doc.li("Write documents in Python").li("Simple syntax")
    with doc.ul:
        doc.li("No brackets or closing tags").li("Integrates with other code")
        doc.ul(E.li("Easy").li("Efficient"))  # Nest using (...)
    doc.li("Avoids whitespace problems common in templating")

Output formatted for readability:

<!DOCTYPE html>
<html lang=en>
  <meta charset="utf-8">
  <title>Test page</title>
  <ul>
    <li>Write documents in Python
    <li>Simple syntax
      <ul>
        <li>No brackets or closing tags
        <li>Integrates with other code
          <ul>
            <li>Easy
            <li>Efficient
          </ul>
      </ul>
    <li>Avoids whitespace problems common in templating
  </ul>

Escaping

All content and attributes are automatically escaped. For instance, we can put the entire document into an iframe's srcdoc attribute where only the minimal but necessary escaping is applied:

E.iframe(srcdoc=doc)

<iframe srcdoc="<!DOCTYPE html><html lang=en><meta charset=&quot;utf-8&quot;><title>Test page</title><ul><li>Write documents in Python<li>Simple syntax<ul><li>No brackets or closing tags<li>Integrates with other code<ul><li>Easy<li>Efficient</ul></ul><li>Avoids whitespace problems common in templating</ul>"></iframe>

Name mangling and boolean attributes

Underscore at the end of name is ignored so that Python's reserved names such as for can be specified. Other underscores convert into hyphens.

Boolean values convert into short attributes.

E.input(type="checkbox", id="somebox", checked=True).label(for_="somebox", aria_role="img")("🥳")

<input type=checkbox id=somebox checked><label for=somebox aria-role=img>🥳</label>

Performance

%timeit str(Document("benchmarking", lang="en", _urls=("foo.js", "bar.js")))

35.7 µs ± 1.11 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)

Unless you are creating very large documents, this should be quite fast enough.

Traditional web frameworks like Django and Flask are probably much slower. Sanic users might need to optimise some more to stay above 20000 req/s or so.

Further development

There have been no changes to the tagging API since 2018 when this module was brought to production use, and thus the interface is considered stable.

If there is need, a future version of this module may support templating where a document is baked into a list of string snippets, where dynamic content may be injected much faster than what Jinja2 and other regex-based templating engines can do. Other than that, no further development other than maintenance is planned.

Pull requests are still welcome.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 5 - Production/Stable
Intended Audience
- Developers
License
- OSI Approved :: MIT License
- Public Domain
Operating System
- OS Independent
Programming Language
Topic
- Text Processing :: Markup :: HTML

Release history Release notifications | RSS feed

1.3.0

Mar 28, 2023

1.2.1

Feb 5, 2023

1.2.0

Feb 5, 2023

1.1.0

Mar 19, 2020

1.0.2

Mar 19, 2020

1.0.1

Mar 19, 2020

This version

1.0.0

Mar 13, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

html5tagger-1.0.0.tar.gz (4.1 kB view hashes)

Uploaded Mar 13, 2020 Source

Built Distribution

html5tagger-1.0.0-py3-none-any.whl (3.6 kB view hashes)

Uploaded Mar 13, 2020 Python 3

Hashes for html5tagger-1.0.0.tar.gz

Hashes for html5tagger-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`01fa1220535e711af3893ebe921a571a6a5be32cff2b4d1aff5047f602a46524`
MD5	`d2b69b94ce1af3b3975c837a87293fec`
BLAKE2b-256	`617bd5d60c8515bafb7eb6202eab6382672752fad0e37364dd1dfe444f2475c9`

Hashes for html5tagger-1.0.0-py3-none-any.whl

Hashes for html5tagger-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7172694591f4255b1c3da3429784e6884fbfd6c671083814b5b8ad7ff25cdfd6`
MD5	`18d811006d004a4d25e0c4deeb7e4234`
BLAKE2b-256	`5cfa0641aa1f89d7fd60d904bf96578d6468db07eba7da3115ea7b4d7d3baf2f`