No project description provided
Project description
Unstructured Haystack
Unstructured Connectors for Haystack
This is an example Haystack 2.0 integration. It's an integration for Unstructured.io connectors. Please contribute 🚀
The current version has 2 available Unstructured connectors:
- Discord:
UnstructuredDiscordConnector
- GitHub:
UnstructuredGitHubConnector
- Google Drive:
UnstructuredGoogleDriveConnector
How to use in a Haystack 2.0 Pipeline
For example, you can write documents fetched from Discord using the UnstructuredDiscordConnector
:
from haystack.preview import Pipeline
from haystack.preview.components.writers import DocumentWriter
from unstructured_haystack import UnstructuredDiscordConnector
from chroma_haystack import ChromaDocumentStore
# Chroma is used in-memory so we use the same instances in the two pipelines below
document_store = ChromaDocumentStore()
connector = UnstructuredDiscordConnector(api_key="UNSTRUCTURED_API_KEY", discord_token="DISCORD_TOKEN")
indexing = Pipeline()
indexing.add_component("connector", connector)
indexing.add_component("writer", DocumentWriter(document_store))
indexing.connect("connector.documents", "writer.documents")
indexing.run({"connector": {"channels" : "993539071815200889", "period": 3, "output_dir" : "discord-example"}})
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for unstructured_haystack-0.0.5.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | e5bb93635fd89503412fcc7fd035e27cad54f39614f31f4b95c876f765e3bb2e |
|
MD5 | bfc10376e85549508f1e9e2ce9a824d6 |
|
BLAKE2b-256 | 57b37e58f19ae34b6f64260433c67b8357582584eb00fa00d8ba3d7970dbb2b9 |
Close
Hashes for unstructured_haystack-0.0.5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d09804df50e2b5c18e47153bd1776e37f754fd24c6bb3f0e8c9cc5212f766e34 |
|
MD5 | c3d61c89fc5d054e579ed22857e4c84b |
|
BLAKE2b-256 | 95f7bc2687852afa563bffc4dc043e3d3ce41b75a83ad2a0d46feef7498ff2a8 |