marqo

The python client for S2Search API.

These details have not been verified by PyPI

Project description

Marqo

Neural search for humans

A deep-learning powered, open-source search engine which seamlessly integrates with your applications, websites, and workflow.

Applications built with Marqo enjoy the following features out-of-the-box:

⚡ Performance

Intuitive design pattern for high-performance microservices.
Run Marqo at scale - horizontal scalability by design.
Duplex streaming between client and server.
Async and non-blocking data processing over dynamic flows.

🍳 Ease of use

Setup in three lines.
Plug and play functionality with machine learning models and parsers.

☁️ Cloud-native

Use Marqo in high availability with Opensearch backend.
Serverless deployment with Marqo cloud (coming soon!).

Get started

Marqo requires docker. To install docker go to https://docs.docker.com/get-docker/
Use docker to run Opensearch.

docker run -p 9200:9200 -p 9600:9600 -e "discovery.type=single-node" opensearchproject/opensearch:2.0.0

Start indexing and searching! Try the example below.

Simple example

Let's look at a simple example below:

import marqo

mq = marqo.Client(url='https://localhost:9200', main_user="admin", main_password="admin")

mq.index("my-first-index").add_documents([
    {
        "Title": "The Travels of Marco Polo",
        "Description": "A 13th-century travelogue describing Polo's travels"
    }, 
    {
        "Title": "Extravehicular Mobility Unit (EMU)",
        "Description": "The EMU is a spacesuit that provides environmental protection, "
                       "mobility, life support, and communications for astronauts",
        "_id": "article_591"
    }]
)

results = mq.index("my-first-index").search(
    q="What is the best outfit to wear on the moon?"
)

mq is the client that wraps themarqo API
add_documents() takes a list of documents, represented as python dicts, for indexing
add_documents() creates an index with default settings, if one does not already exist
You can optionally set a document's ID with the special _id field. Otherwise, marqo will generate one.

Pretty printing results outputs a dict like this:

{
    'hits': [
        {   
            'Title': 'Extravehicular Mobility Unit (EMU)',
            'Description': 'The EMU is a spacesuit that provides environmental protection, mobility, life support, and' 
                           'communications for astronauts',
            '_highlights': {
                'Description': 'The EMU is a spacesuit that provides environmental protection, '
                               'mobility, life support, and communications for astronauts'
            },
            '_id': 'article_591',
            '_score': 1.2387788
        }, 
        {   
            'Title': 'The Travels of Marco Polo',
            'Description': "A 13th-century travelogue describing Polo's travels",
            '_highlights': {'Title': 'The Travels of Marco Polo'},
            '_id': 'e00d1a8d-894c-41a1-8e3b-d8b2a8fce12a',
            '_score': 1.2047464
        }
    ],
    'limit': 10,
    'processingTimeMs': 49,
    'query': 'What is the best outfit to wear on the moon?'
}

Each hit corresponds to a document that matched the search query
They are ordered from most to least matching
limit is the maximum number of hits to be returned. This can be set as a parameter during search
Each hit has a _highlights field. This was the part of the document that matched the query the best

Warning

Note that you should not run other applications on the Opensearch cluster as Marqo automatically changes and adapts the settings on the cluster.

Contributors

Marqo is a community project with the goal of making neural search accessible to the wider developer community. We are glad that you are interested in helping out! Please read this to get started

Dev set up

Create a virtual env
Install requirements from the requirements file: pip install -r requirements.txt
Activate the virtual environment
Run tests by running the tox file. CD into this dir and then run "tox"
If you update dependencies, make sure to delete the .tox dir and rerun

Merge instructions:

Run the full test suite (by using the command tox in this dir)
Merge to main
If you think that the change is significant, run the large data test. The large data test will build Marqo from the main branch and fill indices with data. Go through and test queries against this data. https://github.com/S2Search/NeuralSearchLargeDataTest

Support

Join our Slack community and chat with other community members about ideas.
Join our Engineering all hands/community meeting meet-up to discuss your use case and learn Jina's new features.
- When? [TODO]
- Where? Zoom (see our public events calendar [INSERT OUR ZOOM]/.ical) and live stream on YouTube
Subscribe to the latest video tutorials on our YouTube channel

Join Us

marqo is backed by S2Search and licensed under MIT.
We are actively hiring [INSERT HIRING LINK] AI engineers, solution engineers to build the next neural search ecosystem in open source.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

3.9.1

Oct 31, 2024

3.9.0

Oct 29, 2024

3.8.1

Sep 17, 2024

3.8.0

Sep 13, 2024

3.7.0

Jul 10, 2024

3.6.0

Jul 1, 2024

3.5.1

May 27, 2024

3.5.0

May 19, 2024

3.4.0

May 15, 2024

3.3.0

Apr 24, 2024

3.2.1

Apr 5, 2024

3.2.0

Mar 12, 2024

3.1.0

Feb 8, 2024

3.0.1

Jan 30, 2024

3.0.0

Jan 10, 2024

2.1.0

Dec 5, 2023

2.0.0

Oct 27, 2023

1.3.1

Sep 7, 2023

1.3.0

Sep 5, 2023

1.2.4

Aug 15, 2023

1.2.3

Aug 14, 2023

1.2.2

Aug 11, 2023

1.2.1

Aug 11, 2023

1.2.0

Aug 9, 2023

1.1.1

Jul 28, 2023

1.1.0

Jul 24, 2023

1.0.0

Jul 21, 2023

0.11.0

Jun 30, 2023

0.10.0

Jun 26, 2023

0.9.6

May 31, 2023

0.9.5

May 11, 2023

0.9.4

Apr 24, 2023

0.9.3

Apr 16, 2023

0.9.2

Apr 14, 2023

0.9.1

Apr 5, 2023

0.9.0

Apr 3, 2023

0.8.0

Mar 16, 2023

0.7.4

Mar 9, 2023

0.7.3

Mar 8, 2023

0.7.2

Feb 28, 2023

0.7.1

Feb 23, 2023

0.7.0

Feb 21, 2023

0.6.0

Feb 13, 2023

0.5.15

Jan 27, 2023

0.5.14

Jan 10, 2023

0.5.13

Jan 9, 2023

0.5.12

Dec 22, 2022

0.5.11

Dec 22, 2022

0.5.10

Dec 15, 2022

0.5.9

Dec 14, 2022

0.5.8

Nov 28, 2022

0.5.7

Nov 22, 2022

0.5.6

Nov 17, 2022

0.5.5

Nov 15, 2022

0.5.4

Nov 9, 2022

0.5.3

Nov 2, 2022

0.5.2

Nov 2, 2022

0.5.1

Oct 31, 2022

0.5.0

Oct 28, 2022

0.4.0

Oct 19, 2022

0.3.1

Oct 13, 2022

0.3.0

Oct 12, 2022

0.1.16

Oct 5, 2022

0.1.15

Sep 15, 2022

0.1.14

Sep 2, 2022

0.1.13

Aug 31, 2022

0.1.12

Aug 31, 2022

0.1.11

Aug 31, 2022

0.1.10

Aug 3, 2022

0.1.9

Aug 2, 2022

0.1.8

Aug 2, 2022

0.1.7

Aug 1, 2022

0.1.6

Aug 1, 2022

0.1.5

Aug 1, 2022

0.1.4

Aug 1, 2022

0.1.3

Aug 1, 2022

0.1.2

Aug 1, 2022

This version

0.1.1

Aug 1, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

marqo-0.1.1.tar.gz (46.3 kB view details)

Uploaded Aug 1, 2022 Source

File details

Details for the file marqo-0.1.1.tar.gz.

File metadata

Download URL: marqo-0.1.1.tar.gz
Upload date: Aug 1, 2022
Size: 46.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.15.0 pkginfo/1.8.3 requests/2.27.1 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.64.0 CPython/2.7.17

File hashes

Hashes for marqo-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`f71a7a92707bf0229defee1d133c1beda96c6f5bc09296d4c80481207f9efb8d`
MD5	`773d8de539b96dfd2672d1cbe69fbef3`
BLAKE2b-256	`9196e1e9c6efacf9cd1e5c2f032b3726395e73407b03d484007d8882fc4daee4`