No project description provided
Project description
Unstructured Haystack
Unstructured Connectors for Haystack
This is an example Haystack 2.0 integration. It's an integration for Unstructured.io connectors. Please contribute 🚀
The current version has 2 available Unstructured connectors:
- Discord:
UnstructuredDiscordConnector
- GitHub:
UnstructuredGitHubConnector
- Google Drive:
UnstructuredGoogleDriveConnector
How to use in a Haystack 2.0 Pipeline
For example, you can write documents fetched from Discord using the UnstructuredDiscordConnector
:
from haystack.preview import Pipeline
from haystack.preview.components.writers import DocumentWriter
from unstructured_haystack import UnstructuredDiscordConnector
from chroma_haystack import ChromaDocumentStore
# Chroma is used in-memory so we use the same instances in the two pipelines below
document_store = ChromaDocumentStore()
connector = UnstructuredDiscordConnector(api_key="UNSTRUCTURED_API_KEY", discord_token="DISCORD_TOKEN")
indexing = Pipeline()
indexing.add_component("connector", connector)
indexing.add_component("writer", DocumentWriter(document_store))
indexing.connect("connector.documents", "writer.documents")
indexing.run({"connector": {"channels" : "993539071815200889", "period": 3, "output_dir" : "discord-example"}})
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for unstructured_haystack-0.0.6.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 508009df3365e5e6fdace3eb512135991d4bb19af023d7af891430c3387f9acb |
|
MD5 | 62dff3fad3488be78e561234096e2fa2 |
|
BLAKE2b-256 | 5a4adcf2a32a81969ea5d8de46b14d9c71463cc39a97fac12f3952b3eb7dd46b |
Close
Hashes for unstructured_haystack-0.0.6-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 938892d54a23ff2442bf70de95e5750bc6a90633dc87299bdf911a79505f91e5 |
|
MD5 | 3876a376cf12874886d52ff5e624a9c5 |
|
BLAKE2b-256 | f36de96e63165479d56dee26351a247f4f9d68f27275e92c8e68f4bee756d857 |