Apify-haystack integration
Project description
Apify-Haystack integration
The Apify-Haystack integration allows easy interaction between the Apify platform and Haystack.
Apify is a platform for web scraping, data extraction, and web automation tasks. It provides serverless applications called Actors for different tasks, like crawling websites, and scraping Facebook, Instagram, and Google results, etc.
Haystack offers an ecosystem of tools for building, managing, and deploying search engines and LLM applications.
Installation
Apify-haystack is available as the apify-haystack
PyPI package.
pip install apify-haystack
Examples
See the examples directory for more examples, here is a list of few of them
- Load a dataset from Apify and convert it to Haystack Documents
- Call Apify Actor and load a dataset to convert it to Haystack Documents
- Crawl website, scrape text content, and store it in the InMemoryDocumentStore
- Retrieval-Augmented Generation (RAG): Extracting text from a website & question answering
Support
If you find any bug or issue, please submit an issue on GitHub. For questions, you can ask on Stack Overflow, in GitHub Discussions or you can join our Discord server.
Contributing
Your code contributions are welcome. If you have any ideas for improvements, either submit an issue or create a pull request. For contribution guidelines and the code of conduct, see CONTRIBUTING.md.
License
This project is licensed under the Apache License 2.0 - see the LICENSE file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for apify_haystack-0.0.1a6-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 568b8603bc37600b253df325ef4917776231e0b0f60a510668d4fffe2163a68a |
|
MD5 | 8b10fc4c844085902238e7a6f35e2159 |
|
BLAKE2b-256 | 1e666bb94a114c576343f7674deb79ffde28c3b2293ac27fc498a71281932c64 |