Skip to main content

Extract recipes from web pages that use JSON-LD structured data and output Markdown, LaTeX, and PDF.

Project description

Recipy

Recipy extracts recipes from web pages using JSON-LD and converts them into Python objects. It also supports generating Markdown, LaTeX, and PDFs.

from recipy.json_ld import recipe_from_url

recipe = recipe_from_url("https://www.allrecipes.com/recipe/14231/guacamole/")

if recipe:
    print(recipe.model_dump_json(indent=2))

Installation

Install via pip

pip install python-recipy

Install texlive for PDF Generation

Debian/Ubuntu

sudo apt install texlive

macOS

brew install texlive

Examples

Load Recipe from JSON-LD

from recipy.json_ld import recipe_from_json

json_data = '''
{
    "name": "Tomato Basil Salad",
    "description": "A simple and fresh tomato basil salad.",
    "recipeIngredient": ["2 ripe tomatoes, sliced", "1/4 cup fresh basil leaves, torn"],
    "recipeInstructions": [
        {
            "@type": "HowToSection",
            "name": "Making the Salad",
            "itemListElement": [
                {"@type": "HowToStep", "text": "Arrange the tomato slices on a plate."},
                {"@type": "HowToStep", "text": "Scatter the torn basil leaves over the tomatoes."}
            ]
        },
        {
            "@type": "HowToSection",
            "name": "Preparing the Dressing",
            "itemListElement": [
                {"@type": "HowToStep", "text": "In a small bowl, whisk together the olive oil and balsamic vinegar."},
                {"@type": "HowToStep", "text": "Drizzle the dressing over the tomatoes and basil before serving."}
            ]
        }
    ],
    "comment": "Serve immediately for the best flavor."
}
'''

recipe = recipe_from_json(json_data)

if recipe:
    print(recipe.model_dump_json(indent=2))

See:

Parse Recipe from Markdown

from recipy.markdown import recipe_from_markdown

markdown_content = """
# Tomato Basil Salad

A simple and fresh tomato basil salad.

![Tomato Basil Salad](https://example.com/tomato_basil_salad.jpg)

## Ingredients

### For the Salad

* 2 ripe tomatoes, sliced
* 1/4 cup fresh basil leaves, torn

### For the Dressing

* 2 tablespoons olive oil
* 1 tablespoon balsamic vinegar

## Instructions

### Making the Salad

1. Arrange the tomato slices on a plate.
2. Scatter the torn basil leaves over the tomatoes.

### Preparing the Dressing

1. In a small bowl, whisk together the olive oil and balsamic vinegar.
2. Drizzle the dressing over the tomatoes and basil before serving.

## Notes

Serve immediately for the best flavor.
"""

recipe = recipe_from_markdown(markdown_content)

if recipe:
    print(recipe.model_dump_json(indent=2))

Markdown Structure

  • The recipe title must be an H1 (# Title).
  • An optional description can be included after the title. It must be a single paragraph.
  • An optional image can be included after the description.
  • Ingredients must be under an H2 heading ## Ingredients, with optional H3 subheadings for ingredient groups.
  • Instructions must be under an H2 heading ## Instructions, with optional H3 subheadings for instruction groups.
  • Notes can be included under an H2 heading ## Notes.

Convert Recipe to JSON-LD

from recipy.json_ld import recipe_from_url, recipe_to_json

recipe = recipe_from_url("https://www.allrecipes.com/recipe/14231/guacamole/")

if recipe:
    json_data = recipe_to_json(recipe)
    print(json_data)

Convert Recipe to LaTeX

from recipy.json_ld import recipe_from_url
from recipy.latex import recipe_to_latex

recipe = recipe_from_url("https://www.allrecipes.com/recipe/14231/guacamole/")

if recipe:
    latex_content = recipe_to_latex(recipe)
    print(latex_content)

Advanced LaTeX Options

from recipy.json_ld import recipe_from_url
from recipy.latex import LatexOptions, recipe_to_latex

recipe = recipe_from_url("https://www.allrecipes.com/recipe/14231/guacamole/")

if recipe:
    latex_options = LatexOptions(
        main_font="TeX Gyre Termes",
        heading_font="TeX Gyre Heros"
    )
    latex_content = recipe_to_latex(recipe, latex_options)
    print(latex_content)

Convert Recipe to PDF

from recipy.json_ld import recipe_from_url
from recipy.pdf import recipe_to_pdf

recipe = recipe_from_url("https://www.allrecipes.com/recipe/14231/guacamole/")

if recipe:
    pdf_content = recipe_to_pdf(recipe)
    with open("recipe.pdf", "wb") as f:
        f.write(pdf_content)

Advanced PDF Options

from recipy.json_ld import recipe_from_url
from recipy.latex import LatexOptions
from recipy.pdf import recipe_to_pdf, PdfOptions

recipe = recipe_from_url("https://www.allrecipes.com/recipe/14231/guacamole/")

if recipe:
    latex_options = LatexOptions(
        main_font="Source Serif Pro",
        heading_font="Source Sans Pro"
    )
    pdf_options = PdfOptions(
        reproducible=True,
        source_date_epoch="0" 
    )
    pdf_content = recipe_to_pdf(recipe, latex_options, pdf_options)
    with open("recipe.pdf", "wb") as f:
        f.write(pdf_content)

Recipe Model

from recipy.models import Recipe, IngredientGroup, InstructionGroup, Review, Meta, Rating

recipe = Recipe(
    title="Tomato Basil Salad",
    description="A simple, fresh salad perfect for summer.",
    ingredient_groups=[
        IngredientGroup(
            title="For the Salad",
            ingredients=[
                "2 ripe tomatoes, sliced",
                "1/4 cup fresh basil leaves, torn"
            ]
        ),
        IngredientGroup(
            title="For the Dressing",
            ingredients=[
                "2 tablespoons olive oil",
                "1 tablespoon balsamic vinegar"
            ]
        )
    ],
    instruction_groups=[
        InstructionGroup(
            title="Making the Salad",
            instructions=[
                "Arrange the tomato slices on a plate.",
                "Scatter the torn basil leaves over the tomatoes."
            ]
        ),
        InstructionGroup(
            title="Preparing the Dressing",
            instructions=[
                "In a small bowl, whisk together the olive oil and balsamic vinegar.",
                "Drizzle the dressing over the tomatoes and basil before serving."
            ]
        )
    ],
    notes="Serve immediately for the best flavor.",
    reviews=[
        Review(
            author="Jane Doe",
            body="This salad is so fresh and delicious!",
            rating=4.5
        ),
        Review(
            author="John Smith",
            body="Simple yet tasty. I added some mozzarella for extra flavor.",
            rating=4.0
        )
    ],
    image_urls=[
        "https://example.com/tomato_basil_salad_small.jpg",
        "https://example.com/tomato_basil_salad_medium.jpg",
        "https://example.com/tomato_basil_salad_large.jpg"
    ],
    rating=Rating(value=4.3, count=28),
    meta=Meta(
        prep_time_minutes=10,
        cook_time_minutes=0,
        total_time_minutes=10,
        recipe_yield="2 servings"
    )
)

print(recipe.model_dump_json(indent=2))

Supported Features by Format

JSON-LD Markdown LaTeX
Feature Input Output Input Output Output
Title
Description
Ingredient Groups
Ingredients
Instruction Groups
Instructions
Images
Rating
Reviews
Metadata
Notes

License

Permission to use, copy, modify, and/or distribute this software for
any purpose with or without fee is hereby granted.

THE SOFTWARE IS PROVIDED “AS IS” AND THE AUTHOR DISCLAIMS ALL
WARRANTIES WITH REGARD TO THIS SOFTWARE INCLUDING ALL IMPLIED WARRANTIES
OF MERCHANTABILITY AND FITNESS. IN NO EVENT SHALL THE AUTHOR BE LIABLE
FOR ANY SPECIAL, DIRECT, INDIRECT, OR CONSEQUENTIAL DAMAGES OR ANY
DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN
AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT
OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

python_recipy-0.1.10.tar.gz (320.5 kB view details)

Uploaded Source

Built Distribution

python_recipy-0.1.10-py3-none-any.whl (14.7 kB view details)

Uploaded Python 3

File details

Details for the file python_recipy-0.1.10.tar.gz.

File metadata

  • Download URL: python_recipy-0.1.10.tar.gz
  • Upload date:
  • Size: 320.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.3

File hashes

Hashes for python_recipy-0.1.10.tar.gz
Algorithm Hash digest
SHA256 0626920c1e4feeecc1ee9af4592d865cf4856ed945efb6bf3d3b66f72f8acde7
MD5 56e10f1a47021f46effd69fb3077b2cc
BLAKE2b-256 8791a588c0994caa7a5b396e57ab80fba6812eaf757b3784b52097a45ed37470

See more details on using hashes here.

File details

Details for the file python_recipy-0.1.10-py3-none-any.whl.

File metadata

File hashes

Hashes for python_recipy-0.1.10-py3-none-any.whl
Algorithm Hash digest
SHA256 8484394587e8b2f2a2ac5e60c6e382edb6653a10b9194ca0bc1d65b16cdf8bff
MD5 20734c97b5b1b90ba3e3d0897f93a77b
BLAKE2b-256 9ba8b96e0a2fd6542c2ba6025d9b0432e379fc05cebbf54aa6bf2b54762910e0

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page