CLI tool to scrape posts, comments, and reactions from private Facebook groups

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

jwmoss

These details have not been verified by PyPI

Project description

Forage

CLI tool to scrape posts, comments, and reactions from private Facebook groups using browser automation.

Installation
Quick Start
Usage
Output Formats
Data Analysis
Development
Roadmap
Security
Support
License

Installation

From PyPI

# Install with pip
pip install ForageFacebook

# Or with uv
uv pip install ForageFacebook

# Install Playwright browsers
playwright install chromium

From Source

# Clone and install with uv
git clone https://github.com/jwmoss/forage.git
cd forage
uv sync

# Install Playwright browsers
uv run playwright install chromium

Quick Start

# Step 1: Log into Facebook (opens browser window)
uv run forage login

# Step 2: Scrape a group (last 7 days by default)
uv run forage scrape https://www.facebook.com/groups/your-group-id -o data.json

Usage

Login

# Default: opens Chromium browser
uv run forage login

# Use Firefox instead
uv run forage login --browser firefox

Scrape Posts

# Basic scrape (last 7 days, with comments)
uv run forage scrape your-group-slug

# Last 14 days, save to file
uv run forage scrape your-group-slug --days 14 -o posts.json

# Specific date range
uv run forage scrape your-group-slug --since 2024-01-01 --until 2024-01-15

# Skip comments (faster)
uv run forage scrape your-group-slug --skip-comments

# Only popular comments (5+ reactions)
uv run forage scrape your-group-slug --min-reactions 5

# Top 10 comments per post
uv run forage scrape your-group-slug --top-comments 10

# Watch the browser (debugging)
uv run forage scrape your-group-slug --no-headless -v

# Slower scraping to avoid rate limits
uv run forage scrape your-group-slug --delay 5.0

# Read group from stdin (for scripting)
echo "your-group-slug" | uv run forage scrape -

CLI Reference

forage [global flags] <command> [args]

Global Flags:
  -v, --verbose   Show progress and debug info
  -q, --quiet     Suppress non-error output
  --no-color      Disable colored output
  --version       Show version
  --help          Show help

Commands:
  login           Open browser for interactive Facebook login
  scrape          Scrape posts from a Facebook group

scrape flags

Flag	Default	Description
`--days`	`7`	Posts from last N days
`--since`	-	Start date (ISO 8601: YYYY-MM-DD)
`--until`	-	End date (ISO 8601: YYYY-MM-DD)
`--limit`	`0`	Max posts (0 = unlimited)
`--delay`	`2.0`	Seconds between page loads
`--min-reactions`	`0`	Min reactions for comments
`--top-comments`	`0`	Top N comments per post
`--skip-comments`	`false`	Skip comment fetching
`--skip-reactions`	`false`	Skip reaction counts
`-o, --output`	`-`	Output file (default: stdout)
`-f, --format`	`json`	Output format: json, sqlite, csv
`--no-headless`	`false`	Show browser window
`--browser`	`chromium`	Browser: chromium, firefox, webkit

SQLite Export

Export directly to SQLite for easier analysis:

# Export to SQLite database
uv run forage scrape your-group-slug -f sqlite -o data.db

# Query with sqlite3
sqlite3 data.db "SELECT content, reactions_total FROM posts ORDER BY reactions_total DESC LIMIT 10"

# Join posts with comments
sqlite3 data.db "SELECT p.content, c.content FROM posts p JOIN comments c ON c.post_id = p.id"

CSV Export

Export to CSV for spreadsheet analysis:

# Export to CSV (creates posts.csv and posts.comments.csv)
uv run forage scrape your-group-slug -f csv -o posts.csv

# Open in Excel/Numbers/Sheets or analyze with csvkit
csvstat posts.csv
csvcut -c author_name,content,reactions_total posts.csv | head -20

Output Format

{
  "group": {
    "id": "123456",
    "name": "My Group",
    "url": "https://www.facebook.com/groups/123456"
  },
  "scraped_at": "2024-01-20T15:30:00Z",
  "date_range": {
    "since": "2024-01-13",
    "until": "2024-01-20"
  },
  "posts": [
    {
      "id": "pfbid...",
      "author": {
        "name": "Jane Doe",
        "profile_url": "https://facebook.com/jane.doe"
      },
      "content": "Post text here...",
      "timestamp": "2024-01-19T12:00:00Z",
      "reactions": {
        "total": 42,
        "like": 0,
        "love": 0,
        "haha": 0,
        "wow": 0,
        "sad": 0,
        "angry": 0
      },
      "comments_count": 15,
      "comments": [
        {
          "id": "comment_...",
          "author": {"name": "John Smith", "profile_url": "..."},
          "content": "Comment text...",
          "timestamp": null,
          "reactions": {"total": 5},
          "replies": []
        }
      ]
    }
  ]
}

Data Analysis Examples

# Top 10 posts by reactions
uv run forage scrape mygroup --skip-comments | \
  jq '.posts | sort_by(.reactions.total) | reverse | .[0:10]'

# All post content
uv run forage scrape mygroup --skip-comments | \
  jq '.posts[].content'

# Posts with 50+ reactions
uv run forage scrape mygroup | \
  jq '.posts | map(select(.reactions.total >= 50))'

# Count posts per author
uv run forage scrape mygroup | \
  jq '.posts | group_by(.author.name) | map({author: .[0].author.name, count: length}) | sort_by(.count) | reverse'

Development

# Install dev dependencies
uv sync --extra dev

# Run type checker
uv run ty check src/

# Run tests
uv run pytest

Architecture

src/forage/
├── cli.py       # Click CLI commands
├── auth.py      # Session management (login, cookies)
├── scraper.py   # Core scraping logic
├── parser.py    # HTML parsing for posts/comments
└── models.py    # Pydantic data models

Limitations

Requires manual login (no automated auth)
Facebook's HTML structure changes frequently
Rate limiting may require slower scraping
Individual reaction types not broken out (only total)
Session cookies expire after ~30 days

Roadmap

Planned features and improvements:

High Priority

Cookie import - Import cookies from browser extensions (EditThisCookie, Netscape format)
Incremental scraping - Only fetch posts newer than last scrape
Progress persistence - Resume interrupted scrapes

Medium Priority

Multiple groups - Scrape multiple groups in one command
Media extraction - Download images/videos from posts
Reaction breakdown - Extract individual reaction types (like, love, etc.)
Author statistics - Aggregate stats per author
Scheduled scraping - Cron-friendly mode with locking

Nice to Have

Web UI - Local web interface for browsing scraped data
Webhook notifications - Notify on new posts matching criteria
Public group support - Scrape without login for public groups
Parallel scraping - Speed up multi-group scrapes

Contributing

See CONTRIBUTING.md for development setup and guidelines.

Security

See SECURITY.md for security considerations and best practices.

Support

If you find this tool useful, consider sponsoring development:

License

MPL-2.0 (Mozilla Public License 2.0)

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

jwmoss

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.1.0 yanked

Jan 10, 2026

Reason this release was yanked:

Wrong release, need to do semver.

This version

1.0.9

May 24, 2026

1.0.8

Jan 26, 2026

1.0.7

Jan 26, 2026

1.0.6

Jan 17, 2026

1.0.5

Jan 13, 2026

1.0.4

Jan 10, 2026

1.0.3

Jan 10, 2026

1.0.2

Jan 10, 2026

1.0.1

Jan 10, 2026

1.0.0

Jan 9, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

foragefacebook-1.0.9.tar.gz (80.2 kB view details)

Uploaded May 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

foragefacebook-1.0.9-py3-none-any.whl (31.1 kB view details)

Uploaded May 24, 2026 Python 3

File details

Details for the file foragefacebook-1.0.9.tar.gz.

File metadata

Download URL: foragefacebook-1.0.9.tar.gz
Upload date: May 24, 2026
Size: 80.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for foragefacebook-1.0.9.tar.gz
Algorithm	Hash digest
SHA256	`610cab4e9d4c633ae98c6f0dd92d60fb48998f196a6c94e751062cd965f1cb06`
MD5	`f3a0026ef6c6b5aa52fb7ea577a644ac`
BLAKE2b-256	`2447e4dca50202e53018d8a6ca5c4e8c4a0c08651fa9ffec3792c9f200716da0`

See more details on using hashes here.

Provenance

The following attestation bundles were made for foragefacebook-1.0.9.tar.gz:

Publisher: publish.yml on jwmoss/forage

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: foragefacebook-1.0.9.tar.gz
- Subject digest: 610cab4e9d4c633ae98c6f0dd92d60fb48998f196a6c94e751062cd965f1cb06
- Sigstore transparency entry: 1624480954
- Sigstore integration time: May 24, 2026
Source repository:
- Permalink: jwmoss/forage@ec491da9564dbd27babf35cb280c0dda02da4282
- Branch / Tag: refs/tags/v1.0.9
- Owner: https://github.com/jwmoss
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ec491da9564dbd27babf35cb280c0dda02da4282
- Trigger Event: release

File details

Details for the file foragefacebook-1.0.9-py3-none-any.whl.

File metadata

Download URL: foragefacebook-1.0.9-py3-none-any.whl
Upload date: May 24, 2026
Size: 31.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for foragefacebook-1.0.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`64870c33a4b25be903263d16c10bdec6b55e7384d245a7e24b72e648ed3360f3`
MD5	`1996ec09b017bd86293f90bacd88a7b6`
BLAKE2b-256	`4ca3b2204f8c32b8c26af5d49bee9ebe0e95268ff1d8b13c55d2806be193ec5b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for foragefacebook-1.0.9-py3-none-any.whl:

Publisher: publish.yml on jwmoss/forage

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: foragefacebook-1.0.9-py3-none-any.whl
- Subject digest: 64870c33a4b25be903263d16c10bdec6b55e7384d245a7e24b72e648ed3360f3
- Sigstore transparency entry: 1624481040
- Sigstore integration time: May 24, 2026
Source repository:
- Permalink: jwmoss/forage@ec491da9564dbd27babf35cb280c0dda02da4282
- Branch / Tag: refs/tags/v1.0.9
- Owner: https://github.com/jwmoss
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ec491da9564dbd27babf35cb280c0dda02da4282
- Trigger Event: release

ForageFacebook 1.0.9

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Forage

Table of Contents

Installation

From PyPI

From Source

Quick Start

Usage

Login

Scrape Posts

CLI Reference

scrape flags

SQLite Export

CSV Export

Output Format

Data Analysis Examples

Development

Architecture

Limitations

Roadmap

High Priority

Medium Priority

Nice to Have

Contributing

Security

Support

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance