sycamore-ai

Sycamore is an LLM-powered semantic data preparation system for building search applications.

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

License

Sycamore is a conversational search and analytics platform for complex unstructured data, such as documents, presentations, transcripts, embedded tables, and internal knowledge repositories. It retrieves and synthesizes high-quality answers through bringing AI to data preparation, indexing, and retrieval. Sycamore makes it easy to prepare unstructured data for search and analytics, providing a toolkit for data cleaning, information extraction, enrichment, summarization, and generation of vector embeddings that encapsulate the semantics of data. Sycamore uses your choice of generative AI models to make these operations simple and effective, and it enables quick experimentation and iteration. Additionally, Sycamore uses OpenSearch for indexing, enabling hybrid (vector + keyword) search, retrieval-augmented generation (RAG) pipelining, filtering, analytical functions, conversational memory, and other features to improve information retrieval.

Untitled

Features

Natural language, conversational interface to ask complex questions on unstructured data. Includes citations to source passages and conversational memory.
Includes a variety of query operations over unstructured data, including hybrid search, retrieval augmented generation (RAG), and analytical functions.
Prepares and enriches complex unstructured data for search and analytics through advanced data segmentation, LLM-powered UDFs for data enrichment, performant data manipulation with Python, and vector embeddings using a variety of AI models.
Helpful features like automatic data crawlers (Amazon S3 and HTTP) and Jupyter notebook support to create and iterate on data preparation scripts.
Scalable, secure, and customizable OpenSearch backend for indexing and data retrieval.

Demo

Hosted on Loom

Get Started

You can easily deploy Sycamore locally or on a virtual machine using Docker.

With Docker installed:

Clone the Sycamore repo:

git clone https://github.com/aryn-ai/sycamore

Set OpenAI Key:

export OPENAI_API_KEY=YOUR-KEY

Go to:

./sycamore

Launch Sycamore. Containers will be pulled from DockerHub:

docker compose up --pull=always

The Sycamore demo query UI will be at localhost:3000

You can next choose to run a demo that prepares and ingests data from the Sort Benchmark website, crawl data from a public website, or write your own data preparation script.

For more info about Sycamore’s data ingestion and preparation feature set, visit the Sycamore documentation.

Resources

Documentation: https://sycamore.readthedocs.io
Slack: https://join.slack.com/t/sycamore-ulj8912/shared_invite/zt-23sv0yhgy-MywV5dkVQ~F98Aoejo48Jg
Data preparation libraries (PyPi): https://pypi.org/project/sycamore-ai/
Contact us: info@aryn.ai

Contributing

Check out our Contributing Guide for more information about how to contribute to Sycamore and set up your environment for development.

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.1.16

May 7, 2024

0.1.15

Apr 12, 2024

0.1.14

Apr 2, 2024

0.1.13

Mar 15, 2024

0.1.12

Feb 9, 2024

0.1.11

Jan 3, 2024

0.1.10

Dec 21, 2023

0.1.9

Dec 8, 2023

0.1.8

Nov 18, 2023

0.1.7

Nov 3, 2023

0.1.6

Oct 20, 2023

0.1.5

Oct 12, 2023

0.1.4

Oct 6, 2023

0.1.3

Sep 28, 2023

0.1.2

Sep 28, 2023

0.1.1

Sep 28, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sycamore_ai-0.1.16.tar.gz (11.4 MB view hashes)

Uploaded May 7, 2024 Source

Built Distribution

sycamore_ai-0.1.16-py3-none-any.whl (11.4 MB view hashes)

Uploaded May 7, 2024 Python 3

Hashes for sycamore_ai-0.1.16.tar.gz

Hashes for sycamore_ai-0.1.16.tar.gz
Algorithm	Hash digest
SHA256	`5719a1a68dbc386f79443347c42e417115a7e51a15b2e875ceeab034f784f8a8`
MD5	`279986414d5378942c73683af1c3187f`
BLAKE2b-256	`31075090bf77ee4ce217b867e9c6842ae9044ac2ff38640c042e65214f023566`

Hashes for sycamore_ai-0.1.16-py3-none-any.whl

Hashes for sycamore_ai-0.1.16-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c282dd8aada8e728ffefa5c1f2f94c67b17dc53bd972e9b7fa85b6f14e11c87e`
MD5	`df42ff75d6c416a1807332d2551cbbd3`
BLAKE2b-256	`f13dc8f005a8a0c77223e566f79c14cd97535470819dfcd97c4652c0f39eac9a`