CrewAI tools for web scraping and research using Olostep API
Project description
CrewAI Olostep Integration
Give your CrewAI agents the power to scrape, search, and research the web.
This package provides ready-to-use CrewAI tools that integrate with the Olostep API for powerful web data access capabilities.
Features
- 🌐 Web Scraping - Extract content from any URL in markdown, HTML, or text format
- 📦 Batch Processing - Scrape up to 10,000 URLs in parallel
- 🕷️ Website Crawling - Automatically discover and scrape entire websites
- 🗺️ Sitemap Extraction - Get all URLs from a website for site structure analysis
- 🔍 AI-Powered Search - Get answers to questions with web sources and structured output
Installation
pip install crewai-olostep
Quick Start
export OLOSTEP_API_KEY="your_api_key_here"
from crewai import Agent, Task, Crew
from crewai_olostep import olostep_scrape_tool, olostep_answer_tool
# Create an agent with Olostep tools
researcher = Agent(
role="Web Researcher",
goal="Find accurate information from the web",
backstory="Expert researcher with web scraping skills.",
tools=[olostep_scrape_tool, olostep_answer_tool],
verbose=True
)
# Create a task
task = Task(
description="Research the pricing of Stripe's payment processing",
expected_output="A summary of Stripe's pricing tiers and fees",
agent=researcher
)
# Run the crew
crew = Crew(agents=[researcher], tasks=[task])
result = crew.kickoff()
print(result)
Available Tools
| Tool | Description | Key Parameters |
|---|---|---|
olostep_scrape_tool |
Scrape single pages | url, format |
olostep_batch_tool |
Process 10K+ URLs in parallel | urls (comma-separated), format |
olostep_crawl_tool |
Crawl entire websites | start_url, max_pages, include_pattern, exclude_pattern |
olostep_sitemap_tool |
Extract all URLs from a site | url, search_query, max_urls |
olostep_answer_tool |
AI-powered web search | question, output_schema |
Get All Tools
from crewai_olostep import get_all_tools
all_tools = get_all_tools() # Returns all 5 tools
Example: Multi-Agent Research Crew
from crewai import Agent, Task, Crew, Process
from crewai_olostep import (
olostep_scrape_tool,
olostep_sitemap_tool,
olostep_answer_tool,
)
# Specialized agents
explorer = Agent(
role="Site Explorer",
goal="Discover website structures",
tools=[olostep_sitemap_tool],
)
scraper = Agent(
role="Content Extractor",
goal="Extract web content",
tools=[olostep_scrape_tool],
)
analyst = Agent(
role="Research Analyst",
goal="Analyze and synthesize",
tools=[olostep_answer_tool],
)
# Chained tasks
task1 = Task(description="Find all product pages on https://example.com", agent=explorer)
task2 = Task(description="Scrape the top 3 pages", agent=scraper, context=[task1])
task3 = Task(description="Summarize findings", agent=analyst, context=[task2])
crew = Crew(
agents=[explorer, scraper, analyst],
tasks=[task1, task2, task3],
process=Process.sequential,
)
result = crew.kickoff()
Documentation
Full documentation at docs.olostep.com/integrations/crewai
Support
- PyPI: pypi.org/project/crewai-olostep
- Docs: docs.olostep.com
- Email: info@olostep.com
License
MIT License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file crewai_olostep-0.1.1.tar.gz.
File metadata
- Download URL: crewai_olostep-0.1.1.tar.gz
- Upload date:
- Size: 7.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5c60f3d6608d82b39ea73afd791561962b16c8f3e12e94961e14678b7488ed79
|
|
| MD5 |
75f1439896e913b732a1e5a2a6b44243
|
|
| BLAKE2b-256 |
ecc2f716c1142579a1d5f84e69d10004825e16d2e48f6dabe5b21b4e75a088e3
|
File details
Details for the file crewai_olostep-0.1.1-py3-none-any.whl.
File metadata
- Download URL: crewai_olostep-0.1.1-py3-none-any.whl
- Upload date:
- Size: 7.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b663dd4adfb1795a1419249229eb78a93d7c8880a7739e1cfadc67e661a1a43e
|
|
| MD5 |
6169bf598898cadf75b607c59acdfc74
|
|
| BLAKE2b-256 |
bc4290e8570c89cf081e140284e03a167dd509014139ada84334cc6c443dadd3
|