Python SDK for Firecrawl API

These details have not been verified by PyPI

Project links

Project description

Firecrawl Python SDK

The Firecrawl Python SDK is a library that allows you to easily search, scrape, and interact with the web, and output the data in a format ready for use with language models (LLMs). It provides a simple and intuitive interface for the Firecrawl API.

Installation

To install the Firecrawl Python SDK, you can use pip:

pip install firecrawl-py

Usage

Get an API key from firecrawl.dev
Set the API key as an environment variable named FIRECRAWL_API_KEY or pass it as a parameter to the Firecrawl class.

Here's an example of how to use the SDK:

from firecrawl import Firecrawl
from firecrawl.types import ScrapeOptions

firecrawl = Firecrawl(api_key="fc-YOUR_API_KEY")

# Scrape a website (v2):
data = firecrawl.scrape(
  'https://firecrawl.dev', 
  formats=['markdown', 'html']
)
print(data)

# Crawl a website (v2 waiter):
crawl_status = firecrawl.crawl(
  'https://firecrawl.dev', 
  limit=100, 
  scrape_options=ScrapeOptions(formats=['markdown', 'html'])
)
print(crawl_status)

Scraping a URL

To scrape a single URL, use the scrape method. It takes the URL as a parameter and returns a document with the requested formats.

# Scrape a website (v2):
scrape_result = firecrawl.scrape('https://firecrawl.dev', formats=['markdown', 'html'])
print(scrape_result)

Parsing uploaded files

Use parse to upload local bytes/files (html, pdf, docx, etc.) as multipart form data and return the parsed document. parse does not support change tracking or browser-only options (actions, wait_for, location, mobile, screenshot, branding).

from firecrawl import Firecrawl
from firecrawl.v2.types import ParseOptions

firecrawl = Firecrawl(api_key="fc-YOUR_API_KEY")

doc = firecrawl.parse(
  b"<!DOCTYPE html><html><body><h1>Python Parse</h1></body></html>",
  filename="upload.html",
  content_type="text/html",
  options=ParseOptions(formats=["markdown"]),
)

print(doc.markdown)

Crawling a Website

To crawl a website, use the crawl method. It takes the starting URL and optional parameters as arguments. You can control depth, limits, formats, and more.

crawl_status = firecrawl.crawl(
  'https://firecrawl.dev', 
  limit=100, 
  scrape_options=ScrapeOptions(formats=['markdown', 'html']),
  poll_interval=30
)
print(crawl_status)

Asynchronous Crawling

Looking for async operations? Check out the Async Class section below.

To enqueue a crawl asynchronously, use start_crawl. It returns the crawl ID which you can use to check the status of the crawl job.

crawl_job = firecrawl.start_crawl(
  'https://firecrawl.dev', 
  limit=100, 
  scrape_options=ScrapeOptions(formats=['markdown', 'html']),
)
print(crawl_job)

Checking Crawl Status

To check the status of a crawl job, use the get_crawl_status method. It takes the job ID as a parameter and returns the current status of the crawl job.

crawl_status = firecrawl.get_crawl_status("<crawl_id>")
print(crawl_status)

Manual Pagination (v2)

Crawl and batch scrape status responses may include a next URL when more data is available. The SDK auto-paginates by default; to page manually, disable auto-pagination and pass the opaque next URL back to the SDK.

from firecrawl.v2.types import PaginationConfig

# Crawl: fetch one page at a time
crawl_job = firecrawl.start_crawl("https://firecrawl.dev", limit=100)
status = firecrawl.get_crawl_status(
  crawl_job.id,
  pagination_config=PaginationConfig(auto_paginate=False),
)
if status.next:
  page2 = firecrawl.get_crawl_status_page(status.next)

# Batch scrape: fetch one page at a time
batch_job = firecrawl.start_batch_scrape(["https://firecrawl.dev"])
status = firecrawl.get_batch_scrape_status(
  batch_job.id,
  pagination_config=PaginationConfig(auto_paginate=False),
)
if status.next:
  page2 = firecrawl.get_batch_scrape_status_page(status.next)

Cancelling a Crawl

To cancel an asynchronous crawl job, use the cancel_crawl method. It takes the job ID of the asynchronous crawl as a parameter and returns the cancellation status.

cancel_crawl = firecrawl.cancel_crawl(id)
print(cancel_crawl)

Map a Website

Use map to generate a list of URLs from a website. Options let you customize the mapping process, including whether to use the sitemap or include subdomains.

# Map a website (v2):
map_result = firecrawl.map('https://firecrawl.dev')
print(map_result)

Scrape-bound interactive browsing (v2)

Use a scrape job ID to keep interacting with the replayed browser context:

doc = firecrawl.scrape(
  "https://example.com",
  actions=[{"type": "click", "selector": "a[href='/pricing']"}],
)

scrape_job_id = doc.metadata_typed.scrape_id
if not scrape_job_id:
  raise RuntimeError("Missing scrape job id")

run = firecrawl.interact(
  scrape_job_id,
  code="print(await page.url())",
  language="python",
  timeout=60,
)
print(run.stdout)

firecrawl.stop_interaction(scrape_job_id)

{/* ### Extracting Structured Data from Websites

To extract structured data from websites, use the extract method. It takes the URLs to extract data from, a prompt, and a schema as arguments. The schema is a Pydantic model that defines the structure of the extracted data.

*/}

Crawling a Website with WebSockets

To crawl a website with WebSockets, use the crawl_url_and_watch method. It takes the starting URL and optional parameters as arguments. The params argument allows you to specify additional options for the crawl job, such as the maximum number of pages to crawl, allowed domains, and the output format.

# inside an async function...
nest_asyncio.apply()

# Define event handlers
def on_document(detail):
    print("DOC", detail)

def on_error(detail):
    print("ERR", detail['error'])

def on_done(detail):
    print("DONE", detail['status'])

    # Function to start the crawl and watch process
async def start_crawl_and_watch():
    # Initiate the crawl job and get the watcher
    watcher = app.crawl_url_and_watch('firecrawl.dev', exclude_paths=['blog/*'], limit=5)

    # Add event listeners
    watcher.add_event_listener("document", on_document)
    watcher.add_event_listener("error", on_error)
    watcher.add_event_listener("done", on_done)

    # Start the watcher
    await watcher.connect()

# Run the event loop
await start_crawl_and_watch()

Error Handling

The SDK handles errors returned by the Firecrawl API and raises appropriate exceptions. If an error occurs during a request, an exception will be raised with a descriptive error message.

Async Class

For async operations, you can use the AsyncFirecrawl class. Its methods mirror the Firecrawl class, but you await them.

from firecrawl import AsyncFirecrawl

firecrawl = AsyncFirecrawl(api_key="YOUR_API_KEY")

# Async Scrape (v2)
async def example_scrape():
  scrape_result = await firecrawl.scrape(url="https://example.com")
  print(scrape_result)

# Async Parse (v2)
async def example_parse():
  parse_result = await firecrawl.parse(
    b"<!DOCTYPE html><html><body><h1>Async Parse</h1></body></html>",
    filename="upload.html",
    content_type="text/html",
  )
  print(parse_result)

# Async Crawl (v2)
async def example_crawl():
  crawl_result = await firecrawl.crawl(url="https://example.com")
  print(crawl_result)

v1 compatibility

For legacy code paths, v1 remains available under firecrawl.v1 with the original method names.

from firecrawl import Firecrawl

firecrawl = Firecrawl(api_key="YOUR_API_KEY")

# v1 methods (feature‑frozen)
doc_v1 = firecrawl.v1.scrape_url('https://firecrawl.dev', formats=['markdown', 'html'])
crawl_v1 = firecrawl.v1.crawl_url('https://firecrawl.dev', limit=100)
map_v1 = firecrawl.v1.map_url('https://firecrawl.dev')

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

4.24.0

May 1, 2026

This version

4.23.1

Apr 30, 2026

4.23.0

Apr 22, 2026

4.22.3

Apr 21, 2026

4.22.2

Apr 15, 2026

4.22.1

Apr 7, 2026

4.22.0

Apr 3, 2026

4.21.1

Apr 2, 2026

4.21.0

Mar 25, 2026

4.20.0

Mar 23, 2026

4.19.0

Mar 12, 2026

4.18.1

Mar 7, 2026

4.18.0

Feb 26, 2026

4.17.2

Feb 26, 2026

4.17.1

Feb 26, 2026

4.17.0

Feb 26, 2026

4.16.3

Feb 25, 2026

4.16.2

Feb 24, 2026

4.16.1

Feb 24, 2026

4.16.0

Feb 18, 2026

4.15.1

Feb 17, 2026

4.15.0

Feb 17, 2026

4.14.1

Feb 12, 2026

4.14.0

Jan 30, 2026

4.13.4

Jan 23, 2026

4.13.3

Jan 22, 2026

4.13.2

Jan 22, 2026

4.13.1

Jan 20, 2026

4.13.0

Jan 14, 2026

4.12.0

Dec 19, 2025

4.11.3

Dec 17, 2025

4.11.2

Dec 17, 2025

4.11.1

Dec 16, 2025

4.11.0

Dec 14, 2025

4.10.5

Dec 13, 2025

4.10.4

Dec 9, 2025

4.10.3

Dec 7, 2025

4.10.2

Dec 4, 2025

4.10.1

Dec 3, 2025

4.10.0

Nov 30, 2025

4.9.0

Nov 26, 2025

4.8.0

Nov 12, 2025

4.7.0

Nov 12, 2025

4.6.0

Nov 5, 2025

4.5.0

Oct 17, 2025

4.4.0

Oct 13, 2025

4.3.7

Oct 10, 2025

4.3.6

Sep 7, 2025

4.3.5

Sep 5, 2025

4.3.4

Sep 5, 2025

4.3.3

Sep 3, 2025

4.3.2

Sep 3, 2025

4.3.1

Sep 1, 2025

4.3.0

Sep 1, 2025

4.2.0

Sep 1, 2025

4.1.1

Aug 30, 2025

4.1.0

Aug 30, 2025

4.0.0

Aug 29, 2025

3.4.0

Aug 27, 2025

3.3.3

Aug 27, 2025

3.3.2

Aug 23, 2025

3.3.0

Aug 23, 2025

3.2.1

Aug 22, 2025

3.2.0

Aug 22, 2025

3.1.0

Aug 21, 2025

3.0.3

Aug 19, 2025

2.16.5

Aug 6, 2025

2.16.3

Jul 29, 2025

2.16.2

Jul 22, 2025

2.16.1

Jul 15, 2025

2.16.0

Jul 14, 2025

2.15.0

Jul 4, 2025

2.14.0

Jul 2, 2025

2.13.0

Jul 1, 2025

2.12.0

Jun 27, 2025

2.11.0

Jun 27, 2025

2.10.0

Jun 26, 2025

2.9.0

Jun 20, 2025

2.7.1

May 28, 2025

2.7.0

May 20, 2025

2.6.0

May 16, 2025

2.5.4

May 8, 2025

2.5.3

Apr 29, 2025

2.5.2

Apr 29, 2025

2.5.1

Apr 29, 2025

2.5.0

Apr 29, 2025

2.4.3

Apr 28, 2025

2.4.2

Apr 28, 2025

2.4.1

Apr 28, 2025

2.4.0

Apr 26, 2025

2.3.0

Apr 24, 2025

2.2.0

Apr 22, 2025

2.1.2

Apr 19, 2025

2.1.1

Apr 18, 2025

2.1.0

Apr 18, 2025

2.0.2

Apr 18, 2025

2.0.1

Apr 18, 2025

2.0.0

Apr 18, 2025

1.17.0

Apr 17, 2025

1.16.0

Apr 12, 2025

1.15.0

Mar 24, 2025

1.14.1

Mar 17, 2025

1.14.0

Mar 17, 2025

1.13.5

Mar 5, 2025

1.13.3

Mar 5, 2025

1.13.2

Mar 2, 2025

1.13.1

Mar 2, 2025

1.13.0

Mar 2, 2025

1.12.0

Feb 13, 2025

1.11.1

Feb 6, 2025

1.11.0

Jan 31, 2025

1.10.2

Jan 21, 2025

1.10.1

Jan 20, 2025

1.10.0

Jan 18, 2025

1.9.0

Jan 7, 2025

1.8.0

Jan 2, 2025

1.7.1

Jan 2, 2025

1.7.0

Dec 27, 2024

1.6.8

Dec 19, 2024

1.6.7

Dec 19, 2024

1.6.4

Dec 12, 2024

1.6.3

Dec 4, 2024

1.6.2

Dec 3, 2024

1.6.1

Nov 26, 2024

1.6.0

Nov 26, 2024

1.5.0

Nov 11, 2024

1.4.0

Oct 23, 2024

1.3.1

Oct 13, 2024

1.3.0

Oct 10, 2024

1.2.4

Sep 17, 2024

1.2.3

Sep 4, 2024

1.2.2

Sep 3, 2024

1.2.1

Aug 31, 2024

1.2.0

Aug 31, 2024

1.1.1

Aug 30, 2024

0.0.20

Jul 1, 2024

0.0.19

Jul 1, 2024

0.0.16

Jul 1, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

firecrawl-4.23.1.tar.gz (179.4 kB view details)

Uploaded Apr 30, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

firecrawl-4.23.1-py3-none-any.whl (225.0 kB view details)

Uploaded Apr 30, 2026 Python 3

File details

Details for the file firecrawl-4.23.1.tar.gz.

File metadata

Download URL: firecrawl-4.23.1.tar.gz
Upload date: Apr 30, 2026
Size: 179.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for firecrawl-4.23.1.tar.gz
Algorithm	Hash digest
SHA256	`8d599b1f3bf38c6c74082d83981d540a88a7cc30df00d808a783763df2b35ab6`
MD5	`ff987da0d5bad65d20e54ad4bba18715`
BLAKE2b-256	`165e934a5ec9a23b72b0d2a58f12e8b42856db705b4d0ed60664dd4f3f9a0181`

See more details on using hashes here.

File details

Details for the file firecrawl-4.23.1-py3-none-any.whl.

File metadata

Download URL: firecrawl-4.23.1-py3-none-any.whl
Upload date: Apr 30, 2026
Size: 225.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for firecrawl-4.23.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e793d4c6cf29e6e0419ecb8de339e22c2adec6511d8839dc8addc6c836afd445`
MD5	`215a265186095d53d33c5a89eb51426a`
BLAKE2b-256	`17067120a7eb38a5e4560a8fb99070bf56147ae6ebdf5b5fcbccddb2849b72eb`

See more details on using hashes here.

firecrawl 4.23.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Firecrawl Python SDK

Installation

Usage

Scraping a URL

Parsing uploaded files

Crawling a Website

Asynchronous Crawling

Checking Crawl Status

Manual Pagination (v2)

Cancelling a Crawl

Map a Website

Scrape-bound interactive browsing (v2)

Crawling a Website with WebSockets

Error Handling

Async Class

v1 compatibility

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes