Skip to main content

A Python package for creating HTML documents from rich content: dataframes, plots, images, markdown etc. It provides a high-level, easy-to-use API with reasonable defaults, as well as low-level access for better control.

Project description

rico: rich content to HTML as easy as Doc(df, plot)

rico is a Python package for creating HTML documents from rich content: dataframes, plots, images, markdown etc. It provides a high-level, easy-to-use API with reasonable defaults, as well as low-level access for better control.

Use rico if you want to create an HTML document from objects created in a Python script.

With rico you can avoid:

  • Writing data to intermediate files or a database from a script.
  • Loading data into a Jupyter notebook from intermediate files or a database.
  • Using nbconvert or similar tools for creating HTML files.

More on the topic:

CI Coverage License Version Package Status PyPI Python Versions

Installation

Install the core functionality:

pip install rico

The core functionality has no dependencies other than the standard Python packages. Optional additional dependencies are required to support the following content types:

  • Plots. Altair, Matplotlib Pyplot and Seaborn are currently supported.
  • Markdown.

Install one or several extras to use plots or Markdown in HTML documents.

Altair:

pip install rico[altair]

Markdown:

pip install rico[markdown]

Matplotlib Pyplot:

pip install rico[pyplot]

Seaborn:

pip install rico[seaborn]

Install rico[seaborn] extra only to use the seaborn.objects interface. Otherwise install rico[pyplot] as the old Seaborn plotting functions return Matplotlib Pyplot Axes objects.

All extras:

pip install rico[complete]

User guide

Basic usage

rico provides both declarative and imperative style interfaces.

Declarative style:

import pandas as pd
import rico

df = pd.DataFrame({
    "a": list("CCCDDDEEE"),
    "b": [2, 7, 4, 1, 2, 6, 8, 4, 7],
})
plot = df.plot.scatter(x="a", y="b")

doc = rico.Doc("Hello world!", df, plot, title="My doc")

Imperative style:

doc = rico.Doc(title="My doc")
doc.append("Hello world!", df, plot)

Also imperative style:

doc = rico.Doc(title="My doc")
doc.append("Hello world!")
doc.append(df)
doc.append(plot)

Mix-and-match:

doc = rico.Doc("Hello world!", df, title="My doc")
doc.append(plot)

Serialization

Serialize the document to HTML using str(doc):

with open("doc.html", "w") as f:
    f.write(str(doc))

Implicit serialization:

with open("doc.html", "w") as f:
    print(doc, file=f)

Internally, str(doc) calls doc.serialize() with default parameter values. Call doc.serialize(indent=True) to indent the HTML element tree visually:

with open("doc.html", "w") as f:
    f.write(doc.serialize(indent=True))

Set custom whitespace for indentation using the space parameter:

with open("doc.html", "w") as f:
    f.write(doc.serialize(indent=True, space="    "))

Remove unnecessary whitespace by setting strip to True:

with open("doc.html", "w") as f:
    f.write(doc.serialize(strip=True))

Control the default behavior of str(doc) and doc.serialize() using the global options indent_html, indent_space, and strip_html:

with open("doc.html", "w") as f, rico.config_context(indent_html=True):
    f.write(str(doc))

The default option values are:

  • indent_html = False,
  • indent_space = " ",
  • strip_html = False.

Rich content types

rico automatically recognizes the following content types:

  • rico content classes (subclasses of rico.ContentBase).
  • Plots (Altair, Matplotlib Pyplot, Seaborn).
  • Dataframes and other types with _repr_html_ method.
  • Text.

Use specific classes for plots and texts to change the default behavior:

doc = rico.Doc(
    rico.Text("Hello world!", mono=True),  # The default value is False.
    df,
    rico.Plot(plot, format="png"),  # The default value is "svg".
    title="My doc",
)

The following code gives the same result as the code above:

doc = rico.Doc(title="My doc")
doc.append_text("Hello world!", mono=True)
doc.append(df)
doc.append_plot(plot, format="png")

Some options can be set in the global configuration:

with rico.config_context(text_mono=True, image_format="png"):
    doc = rico.Doc("Hello world!", df, plot, title="My doc")

Use specific classes and methods for other content types:

  • Images: Image or Doc.append_image.
  • Code: Code or Doc.append_code.
  • Markdown: Markdown or Doc.append_markdown.
  • HTML tag: Tag or Doc.append_tag.
  • Raw HTML: HTML or Doc.append_html.

For example:

doc = rico.Doc(
    rico.Markdown("## Dataframe"),
    df,
    rico.Tag("h2", "Plot"),  # An alternative way to add a header.
    plot,
    rico.HTML("<h2>Code</h2>"),  # Another way to add a header.
    rico.Code("print('Hello world!')"),
    title="My doc",
)

The following code gives the same result as the code above:

doc = rico.Doc(title="My doc")
doc.append_markdown("## Dataframe")
doc.append(df)
doc.append_tag("h2", "Plot")
doc.append(plot)
doc.append_html("<h2>Code</h2>")
doc.append_code("print('Hello world!')")

Check the docstrings for details.

Serialize content to HTML using str(object) or object.serialize():

obj = rico.Tag("p", "Hello world!")

print(obj)
# <div><p>Hello world!</p></div>

print(obj.serialize(indent=True, space="    "))
# <div>
#     <p>Hello world!</p>
# </div>

Bootstrap, HTML classes and document layout

By default, Bootstrap styles are included in the document. Change the default behavior using the bootstrap parameter:

doc = rico.Doc("Hello world!", bootstrap="full")
  • Set bootstrap to "css" (default) to include only CSS.
  • Set bootstrap to "full" to include both the CSS and JS.
  • Set bootstrap to "none" to not include Bootstrap*.

*Keep in mind that rico relies on Bootstrap classes and styles. For example:

  • The mono and wrap parameters of the Text class use Bootstrap's font-monospace and font-monospace classes.
  • rico's dataframe style definition uses Bootstrap variables.

Each content element is wrapped in a <div> container. Specify the element's container class using the class_ parameter:

print(rico.Tag("p", "Hello world!", class_="col"))
# <div class="col"><p>Hello world!</p></div>

All elements' containers in the document are also wrapped in a <div> container. Specify the document's container class using the class_ parameter:

doc = rico.Doc("Hello world!", class_="container-fluid")

Define the document layout using Bootstrap and Div class:

doc = rico.Doc(rico.Div(
    rico.Obj(df, class_="col"),
    rico.Obj(plot, class_="col"),
    class_="row row-cols-auto",
))

The code above creates a document with two columns, one with a dataframe and another with a plot. The Obj is a magic class which automatically determines the content type in the same way that Doc and Doc.append do.

Another example:

doc = rico.Doc(
    rico.Tag("h1", "My doc"),
    rico.Tag("h2", "Description"),
    "This is an example of custom document layout using Bootstrap classes.",
    rico.Tag("h2", "Data"),
    rico.Div(
        rico.Obj("Dataframe", df, class_="col"),
        rico.Obj("Plot", plot, class_="col"),
        class_="row row-cols-auto",
    ),
    title="My doc",
)

The following code gives the same result as the code above:

doc = rico.Doc(title="My doc")
doc.append_tag("h1", "My doc")
doc.append_tag("h2", "Description")
doc.append("This is an example of custom document layout using Bootstrap classes.")
doc.append_tag("h2", "Data")
div = rico.Div(class_="row row-cols-auto")
doc.append(div)
div.append("Dataframe", df, class_="col")
div.append("Plot", plot, class_="col")

Keep in mind that obj.append(x, y) works differently than

obj.append(x)
obj.append(y)

The first one wraps both elements in a single <div> container. The second one creates a separate <div> container for each element.

Obj(x, y, class_="z") wraps both x and y elements in a single <div> container with class attribute set to "z".

More on Bootstrap layout and grid system:

Styles and scripts

By default, rico includes the following styles in the document:

  • Bootstrap CSS. Change the default behavior using the bootstrap parameter of the Doc class.
  • Dataframe style. Change it by setting the dataframe_style global option.

Exclude dataframe style from the document by setting dataframe_style to "":

with rico.config_context(dataframe_style=""):
    doc = rico.Doc(df)

Include custom styles and scripts using the Style and Script classes:

dark_theme = "https://cdn.jsdelivr.net/npm/bootswatch@5/dist/darkly/bootstrap.min.css"
jquery = "https://cdn.jsdelivr.net/npm/jquery@3/dist/jquery.min.js"

doc = rico.Doc(
    rico.Text("Click me", class_="click"),
    extra_styles=(
        rico.Style(src=dark_theme),
        rico.Style(".click {color: red;}"),
    ),
    extra_scripts=(
        rico.Script(src=jquery),
        rico.Script(
            "$('p').on('click', function() {alert('Hello world!');})",
            defer=True,
        ),
    ),
)

The defer parameter adds the defer attribute to the <script> tag if the src parameter is used. Otherwise, if the text parameter is used, the script is placed in the footer of the document.

By default, external styles and scripts are included as file links. This means that these files must be available when someone opens the document. Include the contents of these files in the document using the inline parameter:

doc = rico.Doc(
    rico.Text("Click me", class_="click"),
    extra_styles=(
        rico.Style(src=dark_theme, inline=True),
        rico.Style(".click {color: red;}"),
    ),
    extra_scripts=(
        rico.Script(src=jquery, inline=True),
        rico.Script(
            "$('p').on('click', function() {alert('Hello world!');})",
            defer=True,
        ),
    ),
)

In the example above, the Bootstrap styles are still included as a link. Use the global options inline_styles and inline_scripts to include the contents of the style and script files in the document:

with rico.config_context(inline_styles=True, inline_scripts=True):
    doc = rico.Doc(
        rico.Text("Click me", class_="click"),
        extra_styles=(
            rico.Style(src=dark_theme),
            rico.Style(".click {color: red;}"),
        ),
        extra_scripts=(
            rico.Script(src=jquery),
            rico.Script(
                "$('p').on('click', function() {alert('Hello world!');})",
                defer=True,
            ),
        ),
    )

Global configuration

Use global configuration to:

  • Get or set default parameter values.
  • Get or set document properties.

The following global options define the default parameter values:

Global option Parameter Classes, methods, functions
indent_html indent obj.serialize, serialize_html
indent_space space obj.serialize, serialize_html
strip_html strip obj.serialize, serialize_html
text_mono mono Text, obj.append_text
text_wrap wrap Text, obj.append_text
image_format format Plot, obj.append_plot
inline_styles inline Style
inline_scripts inline Script

The following global options define document properties:

  • meta_charset defines a document charset metadata. Set it to "" to disable.
  • meta_viewport defines a document viewport metadata. Set it to "" to disable.
  • bootstrap_css defines a link to the Bootstrap CSS file.
  • bootstrap_js defines a link to the Bootstrap JS file.
  • dataframe_style defines a dataframe style. Set it to "" to disable.

Get a dictionary with global options using get_config without parameters:

global_config = rico.get_config()
print(global_config["indent_html"])
# False

Get a global option value using get_config with the option name as a parameter:

print(rico.get_config("indent_html"))
# False

Set a global option value using set_config:

rico.set_config(indent_html=True)
print(rico.get_config("indent_html"))
# True
rico.set_config(indent_html=False)

Set a global option value within a context using config_context:

with rico.config_context(indent_html=True):
    print(rico.get_config("indent_html"))
    # True

print(rico.get_config("indent_html"))
# False

Low-level control

Internally, rico uses the standard xml.etree.ElementTree module:

  • Every content object (Tag, Text, Div etc.) has a container attribute of type xml.etree.ElementTree.Element. The value is a <div> container element.
  • Doc objects has additional attributes html, head, and body of type xml.etree.ElementTree.Element. They represent the <html>, <head>, and <body> elements, respectively.

Access these attributes and use xml.etree.ElementTree API to gain low-level control over the document and its elements.

Also, rico provides the following functions for working with xml.etree.ElementTree.Element objects:

  • parse_html parses HTML from a string.
  • indent_html indents an HTML element tree visually.
  • strip_html removes unnecessary whitespace.
  • serialize_html serializes xml.etree.ElementTree.Element object.

Check the docstrings for details.

Alternatives

More on the topic:

Roadmap

  • Support most of IPython rich representation methods.
  • Create docs with MkDocs and Material for MkDocs.
  • Create short quick start.
  • Create examples with resulting HTML files.
  • Support math with KaTeX.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rico-0.3.1.tar.gz (25.6 kB view hashes)

Uploaded Source

Built Distribution

rico-0.3.1-py3-none-any.whl (16.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page