Skip to main content

CrewAI tools for web scraping and research using Olostep API

Project description

CrewAI Olostep Integration

PyPI version License: MIT

Give your CrewAI agents the power to scrape, search, and research the web.

This package provides ready-to-use CrewAI tools that integrate with the Olostep API for powerful web data access capabilities.

Features

  • 🌐 Web Scraping - Extract content from any URL in markdown, HTML, or text format
  • 📦 Batch Processing - Scrape up to 10,000 URLs in parallel
  • 🕷️ Website Crawling - Automatically discover and scrape entire websites
  • 🗺️ Sitemap Extraction - Get all URLs from a website for site structure analysis
  • 🔍 AI-Powered Search - Get answers to questions with web sources and structured output

Installation

pip install crewai-olostep

Quick Start

export OLOSTEP_API_KEY="your_api_key_here"
from crewai import Agent, Task, Crew
from crewai_olostep import olostep_scrape_tool, olostep_answer_tool

# Create an agent with Olostep tools
researcher = Agent(
    role="Web Researcher",
    goal="Find accurate information from the web",
    backstory="Expert researcher with web scraping skills.",
    tools=[olostep_scrape_tool, olostep_answer_tool],
    verbose=True
)

# Create a task
task = Task(
    description="Research the pricing of Stripe's payment processing",
    expected_output="A summary of Stripe's pricing tiers and fees",
    agent=researcher
)

# Run the crew
crew = Crew(agents=[researcher], tasks=[task])
result = crew.kickoff()
print(result)

Available Tools

Tool Description Key Parameters
olostep_scrape_tool Scrape single pages url, format
olostep_batch_tool Process 10K+ URLs in parallel urls (comma-separated), format
olostep_crawl_tool Crawl entire websites start_url, max_pages, include_pattern, exclude_pattern
olostep_sitemap_tool Extract all URLs from a site url, search_query, max_urls
olostep_answer_tool AI-powered web search question, output_schema

Get All Tools

from crewai_olostep import get_all_tools

all_tools = get_all_tools()  # Returns all 5 tools

Example: Multi-Agent Research Crew

from crewai import Agent, Task, Crew, Process
from crewai_olostep import (
    olostep_scrape_tool,
    olostep_sitemap_tool,
    olostep_answer_tool,
)

# Specialized agents
explorer = Agent(
    role="Site Explorer",
    goal="Discover website structures",
    tools=[olostep_sitemap_tool],
)

scraper = Agent(
    role="Content Extractor", 
    goal="Extract web content",
    tools=[olostep_scrape_tool],
)

analyst = Agent(
    role="Research Analyst",
    goal="Analyze and synthesize",
    tools=[olostep_answer_tool],
)

# Chained tasks
task1 = Task(description="Find all product pages on https://example.com", agent=explorer)
task2 = Task(description="Scrape the top 3 pages", agent=scraper, context=[task1])
task3 = Task(description="Summarize findings", agent=analyst, context=[task2])

crew = Crew(
    agents=[explorer, scraper, analyst],
    tasks=[task1, task2, task3],
    process=Process.sequential,
)

result = crew.kickoff()

Documentation

Full documentation at docs.olostep.com/integrations/crewai

Support

License

MIT License

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

crewai_olostep-0.1.1.tar.gz (7.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

crewai_olostep-0.1.1-py3-none-any.whl (7.2 kB view details)

Uploaded Python 3

File details

Details for the file crewai_olostep-0.1.1.tar.gz.

File metadata

  • Download URL: crewai_olostep-0.1.1.tar.gz
  • Upload date:
  • Size: 7.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for crewai_olostep-0.1.1.tar.gz
Algorithm Hash digest
SHA256 5c60f3d6608d82b39ea73afd791561962b16c8f3e12e94961e14678b7488ed79
MD5 75f1439896e913b732a1e5a2a6b44243
BLAKE2b-256 ecc2f716c1142579a1d5f84e69d10004825e16d2e48f6dabe5b21b4e75a088e3

See more details on using hashes here.

File details

Details for the file crewai_olostep-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: crewai_olostep-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 7.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for crewai_olostep-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b663dd4adfb1795a1419249229eb78a93d7c8880a7739e1cfadc67e661a1a43e
MD5 6169bf598898cadf75b607c59acdfc74
BLAKE2b-256 bc4290e8570c89cf081e140284e03a167dd509014139ada84334cc6c443dadd3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page