Influence Archaeology OSINT Tool for mapping information flow across digital communities
Project description
DataDagger - Influence Archaeology OSINT Tool
DataDagger is a command-line OSINT tool for mapping how ideas, narratives, and information spread through digital communities over time. It helps security researchers and analysts understand information flow patterns and identify potential threats.
Important API Changes (2023)
⚠️ Twitter API Update: As of February 2023, Twitter no longer offers free API access. Basic tier starts at $100/month.
Current Platform Support:
- ✅ Reddit - Free API access available
- ⚠️ Twitter - Paid API required ($100+/month)
- 🆕 Mastodon - Free alternative to Twitter
- 🔮 Future: Planning support for Truth Social, Telegram, and other platforms
Alternatives to Twitter:
- Mastodon API - Free, decentralized social network
- Web scraping - Use with caution and respect robots.txt
- Academic access - Some researchers may qualify for special access
Features
- Narrative Tracking: Trace how specific ideas, memes, or narratives originated and evolved
- Influence Mapping: Map which accounts first pushed certain narratives
- Cross-Platform Analysis: Shows how information flows between different platforms
- Timeline Visualization: Create timeline visualizations showing idea evolution
- Network Analysis: Maps social networks of influence relationships
- Content Evolution: Track how stories change as they spread
Quick Install
pip install datadagger
That's it! Skip to Quick Start if you want to try it immediately.
Manual Installation
- Clone the repository
- Install dependencies:
pip install -r requirements.txt
- Set up your environment variables (see
.env.example)
Quick Start
# Install DataDagger
pip install datadagger
# Try the demo (no setup required)
datadagger demo
# Check platform pricing and options
datadagger pricing
# Setup free APIs
datadagger setup
# Search across free platforms
datadagger search "your topic" --platforms reddit,mastodon
Usage
Basic Commands
# Search for a narrative across free platforms
datadagger search "birds aren't real" --platforms reddit,mastodon --days 30
# Track narrative evolution (Reddit + Mastodon)
datadagger track --query "covid lab leak" --start-date 2020-01-01 --end-date 2023-01-01
# Analyze influence networks
datadagger network --hashtag "#climatechange" --depth 3
# Generate timeline visualization
datadagger timeline --narrative "flat earth" --output timeline.html
# Export data for analysis
datadagger export --query "misinformation" --format csv --output data.csv
Advanced Features
# Sentiment analysis over time
datadagger sentiment --query "vaccine" --platform mastodon --timeline
# Find patient zero of a narrative
datadagger origin --query "specific conspiracy theory" --threshold 0.8
# Cross-platform correlation analysis
datadagger correlate --query1 "narrative A" --query2 "narrative B"
# Use Twitter (if you have paid access)
datadagger search "breaking news" --platforms twitter --limit 100
Configuration
Copy .env.example to .env and configure your API keys:
Free Platforms (Recommended)
# Reddit API (Free)
REDDIT_CLIENT_ID=your_reddit_client_id
REDDIT_CLIENT_SECRET=your_reddit_client_secret
# Mastodon API (Free)
MASTODON_INSTANCE_URL=https://mastodon.social
MASTODON_ACCESS_TOKEN=your_token_here # Optional for public data
Paid Platforms
# Twitter API (Paid - $100+/month since Feb 2023)
TWITTER_BEARER_TOKEN=your_twitter_bearer_token
API Cost Breakdown (as of 2025)
| Platform | Cost | Features | Recommendation |
|---|---|---|---|
| FREE | Full API access | ✅ Start here | |
| Mastodon | FREE | Decentralized, Twitter-like | ✅ Great alternative |
| $100+/month | Official Twitter data | ⚠️ Only if budget allows |
Getting API Keys
-
Reddit (FREE):
- Go to https://www.reddit.com/prefs/apps/
- Create a "script" application
- Copy client ID and secret
-
Mastodon (FREE):
- Choose an instance (e.g., mastodon.social)
- Create account → Preferences → Development
- Create new application
-
Twitter (PAID):
- Go to https://developer.twitter.com/
- Apply for API access
- Choose Basic plan ($100/month) or higher
Legal and Ethical Use
This tool is designed for:
- Security research and threat intelligence
- Academic research on information diffusion
- Journalism and fact-checking
- Understanding misinformation patterns
Important: Only use this tool on publicly available data and in compliance with platform terms of service and applicable laws.
License
MIT License - See LICENSE file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file datadagger-2.0.0.tar.gz.
File metadata
- Download URL: datadagger-2.0.0.tar.gz
- Upload date:
- Size: 53.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1985da14f90cf4f2b190c2f71cbcb670d806e399f8c311bf83a742f5c013d609
|
|
| MD5 |
503d39691825206932e740438be6d31b
|
|
| BLAKE2b-256 |
4de93cfc5a5216d990a64b2774abd0ac2ecb7deb50c7a34ec305cca7544f7f96
|
File details
Details for the file datadagger-2.0.0-py3-none-any.whl.
File metadata
- Download URL: datadagger-2.0.0-py3-none-any.whl
- Upload date:
- Size: 42.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
eea88e5e36690842dd14ae2349727582ef033a7581e0c3ec60e1052547544d55
|
|
| MD5 |
d7f61f693f3621bf2e1a67a201debdeb
|
|
| BLAKE2b-256 |
6d4c6d02d030168fee3e881783e2a141cc625a6695ee93427bbb08587faa9dfa
|