Parallel Tools: CLI and data enrichment utilities for the Parallel API

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

parallel-developers

These details have not been verified by PyPI

Project links

Documentation

Project description

Parallel-Web-Tools

CLI and data enrichment utilities for the Parallel API.

Note: This package provides the parallel-cli command-line tool and data enrichment utilities in the parallel-web-tools package. It depends on parallel-web, the official Parallel Python SDK, but does not contain it. Install parallel-web separately if you need direct SDK access.

Features

CLI for Humans & AI Agents - Works interactively or fully via command-line arguments
Web Search - AI-powered search with domain filtering and date ranges
Content Extraction - Extract clean markdown from any URL
Data Enrichment - Enrich CSV, DuckDB, and BigQuery data with AI
AI-Assisted Planning - Use natural language to define what data you want
Multiple Integrations - Polars, DuckDB, Snowflake, BigQuery, Spark

Installation

Standalone CLI (Recommended)

Install the standalone parallel-cli binary with everything bundled (no Python required):

curl -fsSL https://raw.githubusercontent.com/parallel-web/parallel-web-tools/main/install-cli.sh | bash

This automatically detects your platform (macOS/Linux, x64/arm64) and installs to ~/.local/bin.

Python Package

For programmatic usage or data enrichment integrations:

# Full install with CLI and all connectors
pip install parallel-web-tools[all]

# Library only (minimal dependencies)
pip install parallel-web-tools

# With specific connectors
pip install parallel-web-tools[cli]          # CLI only
pip install parallel-web-tools[polars]       # Polars DataFrame
pip install parallel-web-tools[duckdb]       # DuckDB
pip install parallel-web-tools[bigquery]     # BigQuery
pip install parallel-web-tools[spark]        # Apache Spark

CLI Overview

parallel-cli
├── auth                    # Check authentication status
├── login                   # OAuth login (or use PARALLEL_API_KEY env var)
├── logout                  # Remove stored credentials
├── search                  # Web search
├── extract                 # Extract content from URLs
└── enrich                  # Data enrichment commands
    ├── run                 # Run enrichment
    ├── plan                # Create YAML config
    ├── suggest             # AI suggests output columns
    └── deploy              # Deploy to cloud systems (BigQuery, etc.)

Quick Start

1. Authenticate

# Interactive OAuth login
parallel-cli login

# Or set environment variable
export PARALLEL_API_KEY=your_api_key

2. Search the Web

# Natural language search
parallel-cli search "What is Anthropic's latest AI model?" --json

# Keyword search with filters
parallel-cli search -q "bitcoin price" --after-date 2024-01-01 --json

# Search specific domains
parallel-cli search "SEC filings for Apple" --include-domains sec.gov --json

3. Extract Content from URLs

# Extract content as markdown
parallel-cli extract https://example.com --json

# Extract with a specific focus
parallel-cli extract https://company.com --objective "Find pricing info" --json

# Get full page content
parallel-cli extract https://example.com --full-content --json

4. Enrich Data

# Let AI suggest what columns to add
parallel-cli enrich suggest "Find the CEO and annual revenue" --json

# Create a config file (interactive)
parallel-cli enrich plan -o config.yaml

# Create a config file (non-interactive, for AI agents)
parallel-cli enrich plan -o config.yaml \
    --source-type csv \
    --source companies.csv \
    --target enriched.csv \
    --source-columns '[{"name": "company", "description": "Company name"}]' \
    --intent "Find the CEO and annual revenue"

# Run enrichment from config
parallel-cli enrich run config.yaml

# Run enrichment directly (no config file needed)
parallel-cli enrich run \
    --source-type csv \
    --source companies.csv \
    --target enriched.csv \
    --source-columns '[{"name": "company", "description": "Company name"}]' \
    --intent "Find the CEO and annual revenue"

5. Deploy to Cloud Systems

# Deploy to BigQuery for SQL-native enrichment
parallel-cli enrich deploy --system bigquery --project my-gcp-project

Non-Interactive Mode (for AI Agents & Scripts)

All commands support --json output and can be fully controlled via CLI arguments:

# Search with JSON output
parallel-cli search "query" --json

# Extract with JSON output
parallel-cli extract https://url.com --json

# Suggest columns with JSON output
parallel-cli enrich suggest "Find CEO" --json

# Plan without prompts (provide all args)
parallel-cli enrich plan -o config.yaml \
    --source-type csv \
    --source input.csv \
    --target output.csv \
    --source-columns '[{"name": "company", "description": "Company name"}]' \
    --enriched-columns '[{"name": "ceo", "description": "CEO name"}]'

# Or use --intent to let AI determine the columns
parallel-cli enrich plan -o config.yaml \
    --source-type csv \
    --source input.csv \
    --target output.csv \
    --source-columns '[{"name": "company", "description": "Company name"}]' \
    --intent "Find CEO, revenue, and headquarters"

Integrations

Integration	Type	Install	Documentation
Polars	Python DataFrame	`pip install parallel-web-tools[polars]`	Setup Guide
DuckDB	SQL + Python	`pip install parallel-web-tools[duckdb]`	Setup Guide
Snowflake	SQL UDF	`pip install parallel-web-tools[snowflake]`	Setup Guide
BigQuery	Cloud Function	`pip install parallel-web-tools[bigquery]`	Setup Guide
Spark	SQL UDF	`pip install parallel-web-tools[spark]`	Demo Notebook

Quick Integration Examples

Polars:

import polars as pl
from parallel_web_tools.integrations.polars import parallel_enrich

df = pl.DataFrame({"company": ["Google", "Microsoft"]})
result = parallel_enrich(
    df,
    input_columns={"company_name": "company"},
    output_columns=["CEO name", "Founding year"],
)
print(result.result)

DuckDB:

import duckdb
from parallel_web_tools.integrations.duckdb import enrich_table

conn = duckdb.connect()
conn.execute("CREATE TABLE companies AS SELECT 'Google' as name")
result = enrich_table(
    conn,
    source_table="companies",
    input_columns={"company_name": "name"},
    output_columns=["CEO name", "Founding year"],
)
print(result.result.fetchdf())

Programmatic Usage

from parallel_web_tools import run_enrichment, run_enrichment_from_dict

# From YAML file
run_enrichment("config.yaml")

# From dictionary
run_enrichment_from_dict({
    "source": "data.csv",
    "target": "enriched.csv",
    "source_type": "csv",
    "source_columns": [{"name": "company", "description": "Company name"}],
    "enriched_columns": [{"name": "ceo", "description": "CEO name"}]
})

YAML Configuration Format

source: input.csv
target: output.csv
source_type: csv  # csv, duckdb, or bigquery
processor: core-fast  # lite, base, core, pro, ultra (add -fast for speed)

source_columns:
  - name: company_name
    description: The name of the company

enriched_columns:
  - name: ceo
    description: The CEO of the company
    type: str  # str, int, float, bool
  - name: revenue
    description: Annual revenue in USD
    type: float

Environment Variables

Variable	Description
`PARALLEL_API_KEY`	API key for authentication (alternative to `parallel-cli login`)
`DUCKDB_FILE`	Default DuckDB file path
`BIGQUERY_PROJECT`	Default BigQuery project ID

Related Packages

parallel-web - Official Parallel Python SDK (this package depends on it)

Development

git clone https://github.com/parallel-web/parallel-web-tools.git
cd parallel-web-tools
uv sync --all-extras
uv run pytest tests/ -v

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

parallel-developers

These details have not been verified by PyPI

Project links

Documentation

Release history Release notifications | RSS feed

0.3.1rc1 pre-release

Apr 24, 2026

0.3.0

May 6, 2026

0.3.0rc4 pre-release

May 6, 2026

0.3.0rc3 pre-release

May 6, 2026

0.3.0rc2 pre-release

May 5, 2026

0.3.0rc1 pre-release

May 5, 2026

0.2.0

Mar 12, 2026

0.1.3

Mar 12, 2026

0.1.2

Mar 11, 2026

0.1.2rc2 pre-release

Mar 11, 2026

0.1.2rc1 pre-release

Mar 8, 2026

0.1.1

Mar 6, 2026

0.1.0

Mar 6, 2026

0.1.0rc4 pre-release

Mar 6, 2026

0.1.0rc1 pre-release

Mar 6, 2026

0.0.15rc2 pre-release

Mar 6, 2026

0.0.15rc1 pre-release

Mar 6, 2026

0.0.14

Feb 9, 2026

0.0.13

Feb 7, 2026

0.0.13rc2 pre-release

Feb 7, 2026

0.0.13rc1 pre-release

Feb 7, 2026

0.0.12

Feb 3, 2026

0.0.11

Jan 31, 2026

0.0.10

Jan 28, 2026

0.0.9

Jan 27, 2026

0.0.9rc1 pre-release

Jan 27, 2026

0.0.8

Jan 27, 2026

0.0.7

Jan 26, 2026

0.0.6

Jan 26, 2026

0.0.5

Jan 25, 2026

0.0.4

Jan 23, 2026

0.0.3

Jan 23, 2026

0.0.2

Jan 23, 2026

This version

0.0.1

Jan 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parallel_web_tools-0.0.1.tar.gz (189.3 kB view details)

Uploaded Jan 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

parallel_web_tools-0.0.1-py3-none-any.whl (61.0 kB view details)

Uploaded Jan 23, 2026 Python 3

File details

Details for the file parallel_web_tools-0.0.1.tar.gz.

File metadata

Download URL: parallel_web_tools-0.0.1.tar.gz
Upload date: Jan 23, 2026
Size: 189.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for parallel_web_tools-0.0.1.tar.gz
Algorithm	Hash digest
SHA256	`57062d2c7a798a2969a62d8aea7bc54f9c803c1bfcb3099a542890d9507d3816`
MD5	`98f01269e178187ef311db9fc57e0264`
BLAKE2b-256	`69d7f139fd626f94773d4baaeecc6070b9e4a75b7d8868f97a0b381095ddee60`

See more details on using hashes here.

Provenance

The following attestation bundles were made for parallel_web_tools-0.0.1.tar.gz:

Publisher: publish.yml on parallel-web/parallel-web-tools

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: parallel_web_tools-0.0.1.tar.gz
- Subject digest: 57062d2c7a798a2969a62d8aea7bc54f9c803c1bfcb3099a542890d9507d3816
- Sigstore transparency entry: 845900475
- Sigstore integration time: Jan 23, 2026
Source repository:
- Permalink: parallel-web/parallel-web-tools@05d7495be1a26b07a6591b938c9127dde06f39e1
- Branch / Tag: refs/tags/v0.0.1
- Owner: https://github.com/parallel-web
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@05d7495be1a26b07a6591b938c9127dde06f39e1
- Trigger Event: release

File details

Details for the file parallel_web_tools-0.0.1-py3-none-any.whl.

File metadata

Download URL: parallel_web_tools-0.0.1-py3-none-any.whl
Upload date: Jan 23, 2026
Size: 61.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for parallel_web_tools-0.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e7dc2a0c6d245e767bccb83a22aab73db4c6bc1a90b69dd71477432016e56356`
MD5	`007877411fb7a880b092281c53d8b72d`
BLAKE2b-256	`6b5d6fb1ab0b947b7549ba2cebe31f0be688ec0f0fc7525a418fd72cae6b0b6f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for parallel_web_tools-0.0.1-py3-none-any.whl:

Publisher: publish.yml on parallel-web/parallel-web-tools

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: parallel_web_tools-0.0.1-py3-none-any.whl
- Subject digest: e7dc2a0c6d245e767bccb83a22aab73db4c6bc1a90b69dd71477432016e56356
- Sigstore transparency entry: 845900479
- Sigstore integration time: Jan 23, 2026
Source repository:
- Permalink: parallel-web/parallel-web-tools@05d7495be1a26b07a6591b938c9127dde06f39e1
- Branch / Tag: refs/tags/v0.0.1
- Owner: https://github.com/parallel-web
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@05d7495be1a26b07a6591b938c9127dde06f39e1
- Trigger Event: release

parallel-web-tools 0.0.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Parallel-Web-Tools

Features

Installation

Standalone CLI (Recommended)

Python Package

CLI Overview

Quick Start

1. Authenticate

2. Search the Web

3. Extract Content from URLs

4. Enrich Data

5. Deploy to Cloud Systems

Non-Interactive Mode (for AI Agents & Scripts)

Integrations

Quick Integration Examples

Programmatic Usage

YAML Configuration Format

Environment Variables

Related Packages

Development

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance