luqum

A Lucene query parser generating ElasticSearch queries and more !

These details have not been verified by PyPI

Project links

Homepage

Project description

luqum - A lucene query parser in Python, using PLY

logo

“luqum” (as in LUcene QUery Manipolator) is a tool to parse queries written in the Lucene Query DSL and build an abstract syntax tree to inspect, analyze or otherwise manipulate search queries.

It enables enriching the Lucene Query DSL meanings (for example to support nested object searches or have particular treatments on some fields), and transform lucene DSL queries to native ElasticSearch JSON DSL

Thanks to luqum, your users may continue to write queries like: author.last_name:Smith OR author:(age:[25 TO 34] AND first_name:John) and you will be able to leverage ElasticSearch query DSL, and control the precise meaning of each search terms.

Luqum is dual licensed under Apache2.0 and LGPLv3.

Compatible with Python 3.10+

Installation

pip install luqum

Dependencies

PLY >= 3.11

Full documentation

http://luqum.readthedocs.org/en/latest/

Changelog for luqum

The format is based on Keep a Changelog and this project tries to adhere to Semantic Versioning.

1.0.0 - 2025-02-18

Fixed

parsetab.py file
PyPI distribution procedure for devs

Added

Requirements for build and twine for devs

0.14.0 - 2025-02-17

Added

Support for Python 3.11, 3.12 and 3.13 (#97 and #111, thanks to @cclauss and @alexgarel)
Support for negative values in range (#101, thanks to @mesemus)
Docs on OpenRangeTransformer (#110, thanks to @alexgarel)

Removed

Support for Python 3.6, 3.7, 3.8 and 3.9 (#111, thanks to @alexgarel)

Fixed

Improved global state management for lexer (#109, thanks to @Morikko)
Recognize E_NESTED override (#103, thanks to @maksle)

0.13.0 - 2023-03-24

Added

Add support for unbounded ranges

Support is added for open ranges, i.e. inequality operators in front of a term. In tree form, the < is named To, and > is named From.

Additionally, a TreeTransformer is also added, to convert these open ranges to more traditional Range objects.

To properly support escaping, some adjustments were made to how escaping sequences work. After careful evaluation of how Apache Lucene handles escape sequences, it appears that random characters can be escaped, even if they result in unknown escape sequences: the escaped character is always yielded. This makes support for operations such as <=foo a lot less complicated.

There is no support in the ElasticsearchQueryBuilder.

0.12.1 - 2023-02-08

Fixed

Precedence for unknown operation and boost (#89, thanks to @JSCU-CNI)

0.12.0 - 2022-10-13

Changed

Boost can be implicit ; by default, the boost factor is 1

Added

Add support for Lucene and Elasticsearch Boolean operations (#71, thanks to @linefeedse):
- Introduce the BooleanOperation
- add its resolution in ElasticSearch transformer
- add it as a possible resolver for the unknown operation (no explicit operator in query)
Set E element as ElasticsearchQueryBuilder’s attributes (#75, thanks to @qcoumes):

This allows to override elements such as EMust, EWord, …, without the need of overriding ElasticsearchQueryBuilder’s methods.
Explicit support for Python 3.9 and Python 3.10 (#76)
Add a thread safe parse function (#82)

Fixed

Cast TokenValue.__str__ return value to string (#74, thanks to @delkopiso)
Isolated comma should be parsed as a Word (#80)
Better handling of escaped wildcards

Docs

Add boolean operation to doc
Fix quick start documentation
Updated readthedocs instructions

CI

Run tests with github actions
Update all libraries for dev:
- switch from nose to pytest as nose is not python3.10 compatible
- remove old travis tests

0.11.0 - 2021-01-06

Changed

completely modified the naming module and auto_name function, as it was not practical as is.

Added

added tools to build visual explanations about why a request matches a results (leveraging elasticsearch named queries.
added a visitor and transformer that tracks path to element while visiting the tree.

Fixed

fixed the handling of names when transforming luqum tree to elasticsearch queries and added integration tests.

0.10.0 - 2020-09-22

Added

support for parsing Regular expressions like /foo/ (no transformation to Elasticsearch DSL yet)
basic support for head and tail of expressions (the separators) and for their position (pos and size) in original text
added auto_head_tail util (use it if you build your tree programatically and want a printable representation)
tree item now support a clone_item method and a setter for children. This should help with making transformation pattern easier.
New visitor.TreeVisitor and visitor.TreeTransformer classes to help in processing trees utils.LuceneTreeVisitor, utils.LuceneTreeVisitorV2 and utils.LuceneTreeTransformer are warned as deprecated (but still works).

Changed

support for python 3.8 added, support for python 3.4 and 3.5 dropped
better printing of Proximity and Fuzzy items (preserve implicit nature of degree)
raise IllegalCharacterError on illegal character found instead of printing and skipping
renamed ParseError to ParseSyntaxError, and kept ParseError as a parent exception

Fixed

Range item were not checking for bounds type on equality
Boost item were not checking for force on equality
Reorganize tests

0.9.0 - 2020-07-29

Added

support for elasticsearch 7

0.8.1 - 2019-11-01

Added

added Apache 2 license, while maintaining LGPLv3+

0.8.0 - 2019-08-02

Added

support for multi_match query in ElasticsearchQueryBuilder.

Fixed

SchemaAnalyzer, should count non text fields as not_analyzed
ElasticsearchQueryBuilder’s field_options parameter can accept match_type instead of type to change request type. This is now the prefered way over type which may more easily conflict with request parameters.

0.7.5 - 2018-10-29

Added

handling sub fields (aka multi-fields)

Fixed

fixed bug on equality, having more children in one tree than in the other, was not triggering inequality if first nodes were the same !

0.7.4 - 2018-08-28

Added

handling special characters escaping
added iter_wildcards and split_wildcards to have a finer grained search of wildcard in terms

Fixed

fixed bug in luqum.utils.LuceneTreeTransformer when removing node
fixed bug in handling approx operator on multiple words in luqum.elasticsearch.visitor.ElasticsearchQueryBuilder
test coverage now check branch

0.7.3 - 2018-06-08

Fixed

On ElasticSearch query transformation, Luqum was interpreting wildcards in Phrases where as it should not

0.7.2 - 2018-05-14

Fixed

adding the zero_terms_query to match_phrase was a mistake (introduced in 0.7.0).

Added

0.7.0 introduced the match query for queries with single words on analyzed fields, whereas before it was using match_phrase. Although this is more coherent, this may cause difficulties on edge cases as this may leads to results different from previous release.

This behaviour might be disabled using a new match_word_as_phrase parameter to luqum.elasticsearch.visitor.ElasticsearchQueryBuilder. Note that this parameter maybe removed in future release. (the field_options might be used instead on a per field basis).

0.7.1 - 2018-03-20

Fixed

version introduced because of a bad upload on pypi (Restructured description error)

0.7.0 - 2018-03-20

Added

Support for named queries (see elastic named queries)
Helper to automatically create ElasticSearch query builder options from the index configuration, see: luqum.elasticsearch.schema
a new arg field_options on luqum.elasticsearch.visitor.ElasticsearchQueryBuilder allows to add parameters to field queries. It also permits to control the type of query for match queries.
now for a query with a single word, if the field is analyzed, the transformation to elastic search query will use a “match” query instead of a “match_phrase”. This is more conform in behaviour to what the expression of “query_string” would produce.

Fixed

small fix in utils.TreeTransformerV2, which was not removing elements from lists or tuple as stated
single word matches, are now match, and not match_phrase
match_phrase has the zero_terms_query field, as for match

Changed

dropped official Python 3.3 support

0.6.0 - 2017-12-12

Added

Manage object fields in elasicsearch transformation

Fixed

minor fix, getting better error message when parsing error is at the end of content

Changed

better handling of nested fields may lead to shorter requests

0.5.3 - 2017-08-21

Added

A class to transform smartly replace implicit operations with explicit one (OR or AND)

Fixed

handling of fields names with numbers followed by a number (better handling of time in expressions)

Changed

now using ply 3.10

0.5.2 - 2017-05-29

Changed

better recursion in the tree transformer util (API Change)

Fixed

handling of empty phrases for elasticsearch query builder

0.5.1 - 2017-04-10

a minor release

Changed

Better handling of the implicit operator on printing

0.5.0 - 2017-04-04

Changed

Operations are now supporting multiple operands (instead of only two). This mitigate the construction of very deep trees.

Fixed

fixes and improvement of documentation

0.4.0 - 2016-12-05

Changed

The Lucene query checker now checks nested fields before transformation to prevent bad usage

0.3.1 - 2016-11-23

Added

Support for nested fields in Elastic Search queries

Changed

improved performances by adding a cache to the tree visitor utility

0.3 - 2016-11-21

(Note that 0.2 version was skipped)

Added

Transforming Lucene queries to Elastic Search queries
Added a new tree visitor TreeVisitorV2 more easy to use

Fixed

Improved first tree visitor utility and its tests (API Change)

0.1 - 2016-05-17

This was the initial release of Luqum.

Added

the parser and the tree structure
the visitor and transformer utils
the Lucene query consistency checker
the prettify for pretty printing

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

1.0.0

Feb 18, 2025

0.14.0

Feb 17, 2025

0.13.0

Mar 24, 2023

0.12.1

Feb 8, 2023

0.12.0

Oct 14, 2022

0.11.0

Jan 6, 2021

0.10.0

Sep 22, 2020

0.9.0

Jul 29, 2020

0.8.1

Nov 1, 2019

0.8.0

Aug 2, 2019

0.7.5

Oct 29, 2018

0.7.4

Aug 28, 2018

0.7.3

Jun 8, 2018

0.7.2

May 14, 2018

0.7.1

Mar 20, 2018

0.6.1

Jan 3, 2018

0.6.0

Dec 12, 2017

0.5.3

Aug 21, 2017

0.5.2

May 29, 2017

0.5.1

Apr 10, 2017

0.5.0

Apr 4, 2017

0.4.0

Dec 5, 2016

0.3.1

Nov 23, 2016

0.3.0

Nov 21, 2016

0.1.0

Jan 15, 2016

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

luqum-1.0.0.tar.gz (76.0 kB view details)

Uploaded Feb 18, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

luqum-1.0.0-py3-none-any.whl (55.3 kB view details)

Uploaded Feb 18, 2025 Python 3

File details

Details for the file luqum-1.0.0.tar.gz.

File metadata

Download URL: luqum-1.0.0.tar.gz
Upload date: Feb 18, 2025
Size: 76.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for luqum-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`3e7bf3a94eaf8dc936c63de3019b306ee96e63575bc19372dad56114b194f8e0`
MD5	`f3e349b5066b71f1018a1f36072b1ccc`
BLAKE2b-256	`47eb740c176edbfcc8a207f9b4af0476e9ad07b96481e7dc37b03e47669dc208`

See more details on using hashes here.

File details

Details for the file luqum-1.0.0-py3-none-any.whl.

File metadata

Download URL: luqum-1.0.0-py3-none-any.whl
Upload date: Feb 18, 2025
Size: 55.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for luqum-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`25e8723aa7b4a522f296eaf8553f7c887b75a29cc92479293cb44333a3b0bc2b`
MD5	`66a963ec269e7e64bdaa9727e538a054`
BLAKE2b-256	`9225cfcd4c64369b24b5bf28ccada686f228440897bf001bc165c0dd4f8aebe3`

See more details on using hashes here.

luqum 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

luqum - A lucene query parser in Python, using PLY

Installation

Dependencies

Full documentation

Changelog for luqum

1.0.0 - 2025-02-18

Fixed

Added

0.14.0 - 2025-02-17

Added

Removed

Fixed

0.13.0 - 2023-03-24

0.12.1 - 2023-02-08

0.12.0 - 2022-10-13

Changed

Added

Fixed

Docs

CI

0.11.0 - 2021-01-06

Changed

Added

Fixed

0.10.0 - 2020-09-22

Added

Changed

Fixed

0.9.0 - 2020-07-29

0.8.1 - 2019-11-01

0.8.0 - 2019-08-02

Added

Fixed

0.7.5 - 2018-10-29

Added

Fixed

0.7.4 - 2018-08-28

Added

Fixed

0.7.3 - 2018-06-08

0.7.2 - 2018-05-14

Fixed

Added

0.7.1 - 2018-03-20

0.7.0 - 2018-03-20

Added

Fixed

Changed

0.6.0 - 2017-12-12

Added

Fixed

Changed

0.5.3 - 2017-08-21

Added

Fixed

Changed

0.5.2 - 2017-05-29

Changed

Fixed

0.5.1 - 2017-04-10

Changed

0.5.0 - 2017-04-04

Changed

Fixed

0.4.0 - 2016-12-05

0.3.1 - 2016-11-23

Added

Changed

0.3 - 2016-11-21

Added

Fixed