Extract recipes from web pages that use JSON-LD structured data and output Markdown, LaTeX, and PDF.
Project description
Recipy
Recipy extracts recipes from web pages using JSON-LD and converts them into Python objects. It also supports generating Markdown, LaTeX, and PDFs.
from recipy.json_ld import recipe_from_url
recipe = recipe_from_url("https://www.allrecipes.com/recipe/14231/guacamole/")
if recipe:
print(recipe.model_dump_json(indent=2))
Installation
Install via pip
pip install python-recipy
Install texlive
for PDF Generation
Debian/Ubuntu
sudo apt install texlive
macOS
brew install texlive
Examples
Load Recipe from JSON-LD
from recipy.json_ld import recipe_from_json
json_data = '''
{
"name": "Tomato Basil Salad",
"description": "A simple and fresh tomato basil salad.",
"recipeIngredient": [
"2 ripe tomatoes, sliced",
"1/4 cup fresh basil leaves, torn"
],
"recipeInstructions": [
{
"@type": "HowToSection",
"name": "Making the Salad",
"itemListElement": [
{
"@type": "HowToStep",
"text": "Arrange the tomato slices on a plate."
},
{
"@type": "HowToStep",
"text": "Scatter the torn basil leaves over the tomatoes."
}
]
},
{
"@type": "HowToSection",
"name": "Preparing the Dressing",
"itemListElement": [
{
"@type": "HowToStep",
"text": "In a small bowl, whisk together the olive oil and balsamic vinegar."
},
{
"@type": "HowToStep",
"text": "Drizzle the dressing over the tomatoes and basil before serving."
}
]
}
],
"comment": "Serve immediately for the best flavor."
}
'''
recipe = recipe_from_json(json_data)
if recipe:
print(recipe.model_dump_json(indent=2))
See:
Parse Recipe from Markdown
from recipy.markdown import recipe_from_markdown
markdown_content = """
# Tomato Basil Salad
A simple and fresh tomato basil salad.
## Ingredients
### For the Salad
* 2 ripe tomatoes, sliced
* 1/4 cup fresh basil leaves, torn
### For the Dressing
* 2 tablespoons olive oil
* 1 tablespoon balsamic vinegar
## Instructions
### Making the Salad
1. Arrange the tomato slices on a plate.
2. Scatter the torn basil leaves over the tomatoes.
### Preparing the Dressing
1. In a small bowl, whisk together the olive oil and balsamic vinegar.
2. Drizzle the dressing over the tomatoes and basil before serving.
## Notes
Serve immediately for the best flavor.
"""
recipe = recipe_from_markdown(markdown_content)
if recipe:
print(recipe.model_dump_json(indent=2))
Markdown Structure
- The recipe title must be an H1 (
# Title
). - Ingredients must be under an H2 heading
## Ingredients
, with optional H3 subheadings for ingredient groups. - Instructions must be under an H2 heading
## Instructions
, with optional H3 subheadings for instruction groups. - Notes can be included under an H2 heading
## Notes
.
Convert Recipe to JSON-LD
from recipy.json_ld import recipe_from_url, recipe_to_json
recipe = recipe_from_url("https://www.allrecipes.com/recipe/14231/guacamole/")
if recipe:
json_data = recipe_to_json(recipe)
print(json_data)
Convert Recipe to PDF
from recipy.json_ld import recipe_from_url
from recipy.pdf import recipe_to_pdf, PdfOptions
recipe = recipe_from_url("https://www.allrecipes.com/recipe/14231/guacamole/")
if recipe:
pdf_options = PdfOptions(reproducible=True)
pdf_content = recipe_to_pdf(recipe, pdf_options=pdf_options)
with open("recipe.pdf", "wb") as f:
f.write(pdf_content)
Convert Recipe to LaTeX
from recipy.json_ld import recipe_from_url
from recipy.latex import recipe_to_latex, LatexOptions
recipe = recipe_from_url("https://www.allrecipes.com/recipe/14231/guacamole/")
if recipe:
latex_options = LatexOptions(main_font="Liberation Serif", heading_font="Liberation Sans")
latex_content = recipe_to_latex(recipe, options=latex_options)
print(latex_content)
Recipe Model
from recipy.models import Recipe, IngredientGroup, InstructionGroup, Review, Meta, Rating
recipe = Recipe(
title="Tomato Basil Salad",
description="A simple, fresh salad perfect for summer.",
ingredient_groups=[
IngredientGroup(
title="For the Salad",
ingredients=[
"2 ripe tomatoes, sliced",
"1/4 cup fresh basil leaves, torn"
]
),
IngredientGroup(
title="For the Dressing",
ingredients=[
"2 tablespoons olive oil",
"1 tablespoon balsamic vinegar"
]
)
],
instruction_groups=[
InstructionGroup(
title="Making the Salad",
instructions=[
"Arrange the tomato slices on a plate.",
"Scatter the torn basil leaves over the tomatoes."
]
),
InstructionGroup(
title="Preparing the Dressing",
instructions=[
"In a small bowl, whisk together the olive oil and balsamic vinegar.",
"Drizzle the dressing over the tomatoes and basil before serving."
]
)
],
notes="Serve immediately for the best flavor.",
reviews=[
Review(
author="Jane Doe",
body="This salad is so fresh and delicious!",
rating=4.5
),
Review(
author="John Smith",
body="Simple yet tasty. I added some mozzarella for extra flavor.",
rating=4.0
)
],
image_urls=[
"https://example.com/tomato_basil_salad_small.jpg",
"https://example.com/tomato_basil_salad_medium.jpg",
"https://example.com/tomato_basil_salad_large.jpg"
],
rating=Rating(value=4.3, count=28),
meta=Meta(
prep_time_minutes=10,
cook_time_minutes=0,
total_time_minutes=10,
recipe_yield="2 servings"
)
)
print(recipe.model_dump_json(indent=2))
Supported Features by Format
JSON-LD | Markdown | LaTeX | |||
---|---|---|---|---|---|
Feature | Input | Output | Input | Output | Output |
Title | ✅ | ✅ | ✅ | ✅ | ✅ |
Description | ✅ | ✅ | ✅ | ✅ | ✅ |
Ingredient Groups | ❌ | ❌ | ✅ | ✅ | ✅ |
Ingredients | ✅ | ✅ | ✅ | ✅ | ✅ |
Instruction Groups | ✅ | ✅ | ✅ | ✅ | ✅ |
Instructions | ✅ | ✅ | ✅ | ✅ | ✅ |
Images | ✅ | ✅ | ✅ | ✅ | ❌ |
Rating | ✅ | ✅ | ❌ | ❌ | ❌ |
Reviews | ✅ | ✅ | ❌ | ❌ | ❌ |
Metadata | ✅ | ✅ | ❌ | ❌ | ❌ |
Notes | ✅ | ✅ | ✅ | ✅ | ✅ |
License
Permission to use, copy, modify, and/or distribute this software for
any purpose with or without fee is hereby granted.
THE SOFTWARE IS PROVIDED “AS IS” AND THE AUTHOR DISCLAIMS ALL
WARRANTIES WITH REGARD TO THIS SOFTWARE INCLUDING ALL IMPLIED WARRANTIES
OF MERCHANTABILITY AND FITNESS. IN NO EVENT SHALL THE AUTHOR BE LIABLE
FOR ANY SPECIAL, DIRECT, INDIRECT, OR CONSEQUENTIAL DAMAGES OR ANY
DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN
AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT
OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
python_recipy-0.1.14.tar.gz
(319.3 kB
view details)
Built Distribution
File details
Details for the file python_recipy-0.1.14.tar.gz
.
File metadata
- Download URL: python_recipy-0.1.14.tar.gz
- Upload date:
- Size: 319.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 92b97d889cc6a77c070542049f6a1f22cb3d9e47271fb2989a04c71bfbbbc938 |
|
MD5 | b6d215f4f3a5c248839d3438084604ec |
|
BLAKE2b-256 | cf1ecd538c716998f6fc39100643e871eb1a7c624d01c4a0534b157906b8d4b9 |
File details
Details for the file python_recipy-0.1.14-py3-none-any.whl
.
File metadata
- Download URL: python_recipy-0.1.14-py3-none-any.whl
- Upload date:
- Size: 14.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | cbfb16d1b5b8dc93133af9e231e38ebcc69eb6d9768c7fc4d8b16506c610b2f0 |
|
MD5 | ce30458e71165552ec9b70b8f8f12dd5 |
|
BLAKE2b-256 | da4e1eb98f67f3408e88c0a98005cd300a4d29221ca36e73818e8a19fd6deb82 |