Skip to main content

An integration package connecting ScraperAPI and LangChain

Project description

LangChain – ScraperAPI

Give your AI agent the ability to browse websites, search Google and Amazon in just two lines of code.

The langchain-scraperapi package adds three ready-to-use LangChain tools backed by the ScraperAPI service:

Tool class Use it to
ScraperAPITool Grab the HTML/text/markdown of any web page
ScraperAPIGoogleSearchTool Get structured Google Search SERP data
ScraperAPIAmazonSearchTool Get structured Amazon product-search data

Installation

pip install -U langchain-scraperapi

Setup

Create an account at https://www.scraperapi.com/ and get an API key, then set it as an environment variable:

import os
os.environ["SCRAPERAPI_API_KEY"] = "your-api-key"

Quick Start

ScraperAPITool — Browse any website

Scrape HTML, text, or markdown from any webpage:

from langchain_scraperapi.tools import ScraperAPITool

tool = ScraperAPITool()

# Get text content
result = tool.invoke({
    "url": "https://example.com",
    "output_format": "text",
    "render": True
})
print(result)

Parameters:

  • url (required) – target page URL
  • output_format"text" | "markdown" (default returns HTML)
  • country_code – e.g. "us", "de"
  • device_type"desktop" | "mobile"
  • premium – use premium proxies
  • render – run JavaScript before returning content
  • keep_headers – include response headers

ScraperAPIGoogleSearchTool — Structured Google Search

Get structured Google Search results:

from langchain_scraperapi.tools import ScraperAPIGoogleSearchTool

google_search = ScraperAPIGoogleSearchTool()

results = google_search.invoke({
    "query": "what is langchain",
    "num": 20,
    "output_format": "json"
})
print(results)

Parameters:

  • query (required) – search terms
  • output_format"json" (default) or "csv"
  • country_code, tld, num, hl, gl – optional search modifiers

ScraperAPIAmazonSearchTool — Structured Amazon Search

Get structured Amazon product search results:

from langchain_scraperapi.tools import ScraperAPIAmazonSearchTool

amazon_search = ScraperAPIAmazonSearchTool()

products = amazon_search.invoke({
    "query": "noise cancelling headphones",
    "tld": "co.uk",
    "page": 2
})
print(products)

Parameters:

  • query (required) – product search terms
  • output_format"json" (default) or "csv"
  • country_code, tld, page – optional search modifiers

Example: AI Agent that can browse the web

from langchain_openai import ChatOpenAI
from langchain.agents import AgentExecutor, create_tool_calling_agent
from langchain_core.prompts import ChatPromptTemplate, MessagesPlaceholder
from langchain_scraperapi.tools import ScraperAPITool

# Set up tools and LLM
tools = [ScraperAPITool()]
llm = ChatOpenAI(model_name="gpt-4o", temperature=0)

# Create prompt
prompt = ChatPromptTemplate.from_messages([
    ("system", "You are a helpful assistant that can browse websites. Use ScraperAPITool to access web content."),
    ("human", "{input}"),
    MessagesPlaceholder(variable_name="agent_scratchpad"),
])

# Create and run agent
agent = create_tool_calling_agent(llm, tools, prompt)
agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True)

response = agent_executor.invoke({
    "input": "Browse hackernews and summarize the top story"
})

Documentation

For complete parameter details and advanced usage, see the ScraperAPI documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

langchain_scraperapi-0.1.2.tar.gz (8.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

langchain_scraperapi-0.1.2-py3-none-any.whl (8.7 kB view details)

Uploaded Python 3

File details

Details for the file langchain_scraperapi-0.1.2.tar.gz.

File metadata

  • Download URL: langchain_scraperapi-0.1.2.tar.gz
  • Upload date:
  • Size: 8.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.4 CPython/3.13.5 Darwin/24.6.0

File hashes

Hashes for langchain_scraperapi-0.1.2.tar.gz
Algorithm Hash digest
SHA256 c5f1852be44bdebd3a12675ad074e4cce57cc0c46af698c6df6ddaefd37d5166
MD5 d54e8231e93a2a38aebcb8c1197b4e74
BLAKE2b-256 fac2fed50cb8b6c2cb28ece5e58acca614e3d1bb10c1e8fbca1c45cfd3890ee4

See more details on using hashes here.

File details

Details for the file langchain_scraperapi-0.1.2-py3-none-any.whl.

File metadata

File hashes

Hashes for langchain_scraperapi-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 33b84aeb258c5ce285d9edd6128fb8c895039649746fdfa8b3067c7595206f35
MD5 af73ac2501eac3bb8e95caa04ee97a11
BLAKE2b-256 cdf5d15a221d1b0541623b6b2d9ab15e1771113df02474a75211d064b9271184

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page