A Python package to parse structured information from recipe ingredient sentences

These details have not been verified by PyPI

Project links

Development Status
- 5 - Production/Stable
License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
- OS Independent
Programming Language
Topic
- Text Processing :: Linguistic

Project description

Ingredient Parser

The Ingredient Parser package is a Python package for parsing structured information out of recipe ingredient sentences.

Documentation

Documentation on using the package and training the model can be found at https://ingredient-parser.readthedocs.io/.

Quick Start

Install the package using pip

$ python -m pip install ingredient-parser-nlp

Import the parse_ingredient function and pass it an ingredient sentence.

>>> from ingredient_parser import parse_ingredient
>>> parse_ingredient("3 pounds pork shoulder, cut into 2-inch chunks")
ParsedIngredient(
    name=IngredientText(text='pork shoulder', confidence=0.999193),
    size=None,
    amount=[IngredientAmount(quantity='3',
                             unit=<Unit('pound')>,
                             text='3 pounds',
                             confidence=0.999906,,
                             APPROXIMATE=False,
                             SINGULAR=False)],
    preparation=IngredientText(text='cut into 2 inch chunks', confidence=0.999193),
    comment=None,
    purpose=None,
    sentence='3 pounds pork shoulder, cut into 2-inch chunks'
)

Refer to the documentation here for the optional parameters that can be used with parse_ingredient .

Model

The core of the library is a sequence labelling model that is used to label each token in the sentence with the part of the sentence it belongs to. A data set of 75,000 example sentences is used to train and evaluate the model. See the Model Guide in the documentation for mode details.

The model has the following accuracy on a test data set of 20% of the total data used:

Sentence-level results:
	Accuracy: 95.86%

Word-level results:
	Accuracy 98.41%
	Precision (micro) 98.41%
	Recall (micro) 98.41%
	F1 score (micro) 98.41%

Development

The development dependencies are in the requirements-dev.txt file. Details on the training process can be found in the Model Guide documentation.

Before committing anything, install pre-commit and run

pre-commit install

to install the pre-commit hooks.

There is a simple web app for testing the parser with ingredient sentences and showing the parsed output. To run the web app, run the command

$ flask --app webapp run

Screen shot of web app

This requires the development dependencies to be installed.

The dependencies for building the documentation are in the requirements-doc.txt file.

Project details

These details have not been verified by PyPI

Project links

Development Status
- 5 - Production/Stable
License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
- OS Independent
Programming Language
Topic
- Text Processing :: Linguistic

Release history Release notifications | RSS feed

This version

1.1.2

Aug 23, 2024

1.1.1

Aug 16, 2024

1.1.0

Aug 15, 2024

1.0.1

Aug 10, 2024

1.0.0

Jun 17, 2024

0.1.0b11 pre-release

May 27, 2024

0.1.0b10 pre-release

Apr 12, 2024

0.1.0b9 pre-release

Apr 6, 2024

0.1.0b8 pre-release

Jan 27, 2024

0.1.0b7 pre-release

Nov 21, 2023

0.1.0b6 pre-release

Oct 24, 2023

0.1.0b5 pre-release

Sep 16, 2023

0.1.0b4 pre-release

Aug 16, 2023

0.1.0b3 pre-release

Jul 18, 2023

0.1.0b2 pre-release yanked

Jul 18, 2023

Reason this release was yanked:

Incorrectly marked as compatible with Python 3.9

0.1.0b1 pre-release

Apr 8, 2023

0.1.0a5 pre-release

Feb 25, 2023

0.1.0a4 pre-release

Dec 22, 2022

0.1.0a3 pre-release

Oct 2, 2022

0.1.0a2 pre-release

Sep 12, 2022

0.1.0a1 pre-release

Sep 4, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ingredient_parser_nlp-1.1.2.tar.gz (588.6 kB view hashes)

Uploaded Aug 23, 2024 Source

Built Distribution

ingredient_parser_nlp-1.1.2-py3-none-any.whl (587.6 kB view hashes)

Uploaded Aug 23, 2024 Python 3

Hashes for ingredient_parser_nlp-1.1.2.tar.gz

Hashes for ingredient_parser_nlp-1.1.2.tar.gz
Algorithm	Hash digest
SHA256	`67502adabec3a946229276e89bce3dc1f8e782005fad61063e61f28bd74a7501`
MD5	`f7bcda23b1a485b1f1bf3ec7e354ccc5`
BLAKE2b-256	`2c2e7e800d03c65d284220d7548502145d24b52df418b15ce783d54f8b61db5c`

Hashes for ingredient_parser_nlp-1.1.2-py3-none-any.whl

Hashes for ingredient_parser_nlp-1.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6124b08c366d73378c0bdc96e067e2b589212c68ad748afac04d55e90df57076`
MD5	`3bb5272c9e920c7e4173996c2e0da565`
BLAKE2b-256	`b80a22b5ee16ae3f8ad91042bfcd80efbf9b3e66136d37a283f271a27d0c2de7`