Skip to main content

Influence Archaeology OSINT Tool for mapping information flow across digital communities

Project description

DataDagger - Influence Archaeology OSINT Tool

DataDagger is a command-line OSINT tool for mapping how ideas, narratives, and information spread through digital communities over time. It helps security researchers and analysts understand information flow patterns and identify potential threats.

Important API Changes (2023)

⚠️ Twitter API Update: As of February 2023, Twitter no longer offers free API access. Basic tier starts at $100/month.

Current Platform Support:

  • Reddit - Free API access available
  • ⚠️ Twitter - Paid API required ($100+/month)
  • 🆕 Mastodon - Free alternative to Twitter
  • 🔮 Future: Planning support for Truth Social, Telegram, and other platforms

Alternatives to Twitter:

  1. Mastodon API - Free, decentralized social network
  2. Web scraping - Use with caution and respect robots.txt
  3. Academic access - Some researchers may qualify for special access

Features

  • Narrative Tracking: Trace how specific ideas, memes, or narratives originated and evolved
  • Influence Mapping: Map which accounts first pushed certain narratives
  • Cross-Platform Analysis: Shows how information flows between different platforms
  • Timeline Visualization: Create timeline visualizations showing idea evolution
  • Network Analysis: Maps social networks of influence relationships
  • Content Evolution: Track how stories change as they spread

Quick Install

pip install datadagger

That's it! Skip to Quick Start if you want to try it immediately.

Manual Installation

  1. Clone the repository
  2. Install dependencies:
    pip install -r requirements.txt
    
  3. Set up your environment variables (see .env.example)

Quick Start

# Install DataDagger
pip install datadagger

# Try the demo (no setup required)
datadagger demo

# Check platform pricing and options
datadagger pricing

# Setup free APIs  
datadagger setup

# Search across free platforms
datadagger search "your topic" --platforms reddit,mastodon

Usage

Basic Commands

# Search for a narrative across free platforms
datadagger search "birds aren't real" --platforms reddit,mastodon --days 30

# Track narrative evolution (Reddit + Mastodon)  
datadagger track --query "covid lab leak" --start-date 2020-01-01 --end-date 2023-01-01

# Analyze influence networks
datadagger network --hashtag "#climatechange" --depth 3

# Generate timeline visualization
datadagger timeline --narrative "flat earth" --output timeline.html

# Export data for analysis
datadagger export --query "misinformation" --format csv --output data.csv

Advanced Features

# Sentiment analysis over time
datadagger sentiment --query "vaccine" --platform mastodon --timeline

# Find patient zero of a narrative
datadagger origin --query "specific conspiracy theory" --threshold 0.8

# Cross-platform correlation analysis
datadagger correlate --query1 "narrative A" --query2 "narrative B"

# Use Twitter (if you have paid access)
datadagger search "breaking news" --platforms twitter --limit 100

Configuration

Copy .env.example to .env and configure your API keys:

Free Platforms (Recommended)

# Reddit API (Free)
REDDIT_CLIENT_ID=your_reddit_client_id
REDDIT_CLIENT_SECRET=your_reddit_client_secret

# Mastodon API (Free) 
MASTODON_INSTANCE_URL=https://mastodon.social
MASTODON_ACCESS_TOKEN=your_token_here  # Optional for public data

Paid Platforms

# Twitter API (Paid - $100+/month since Feb 2023)
TWITTER_BEARER_TOKEN=your_twitter_bearer_token

API Cost Breakdown (as of 2025)

Platform Cost Features Recommendation
Reddit FREE Full API access Start here
Mastodon FREE Decentralized, Twitter-like Great alternative
Twitter $100+/month Official Twitter data ⚠️ Only if budget allows

Getting API Keys

  1. Reddit (FREE):

  2. Mastodon (FREE):

    • Choose an instance (e.g., mastodon.social)
    • Create account → Preferences → Development
    • Create new application
  3. Twitter (PAID):

Legal and Ethical Use

This tool is designed for:

  • Security research and threat intelligence
  • Academic research on information diffusion
  • Journalism and fact-checking
  • Understanding misinformation patterns

Important: Only use this tool on publicly available data and in compliance with platform terms of service and applicable laws.

License

MIT License - See LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datadagger-2.0.0.tar.gz (53.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datadagger-2.0.0-py3-none-any.whl (42.1 kB view details)

Uploaded Python 3

File details

Details for the file datadagger-2.0.0.tar.gz.

File metadata

  • Download URL: datadagger-2.0.0.tar.gz
  • Upload date:
  • Size: 53.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for datadagger-2.0.0.tar.gz
Algorithm Hash digest
SHA256 1985da14f90cf4f2b190c2f71cbcb670d806e399f8c311bf83a742f5c013d609
MD5 503d39691825206932e740438be6d31b
BLAKE2b-256 4de93cfc5a5216d990a64b2774abd0ac2ecb7deb50c7a34ec305cca7544f7f96

See more details on using hashes here.

File details

Details for the file datadagger-2.0.0-py3-none-any.whl.

File metadata

  • Download URL: datadagger-2.0.0-py3-none-any.whl
  • Upload date:
  • Size: 42.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for datadagger-2.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 eea88e5e36690842dd14ae2349727582ef033a7581e0c3ec60e1052547544d55
MD5 d7f61f693f3621bf2e1a67a201debdeb
BLAKE2b-256 6d4c6d02d030168fee3e881783e2a141cc625a6695ee93427bbb08587faa9dfa

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page