Twitter Lead Generation + Healthcare agent (OpenRouter + Airtable) with optional FastAPI and poller

These details have not been verified by PyPI

Project links

Project description

DocmapLabs Marketing Package

A pip-installable toolkit to discover relevant healthcare posts and extract leads using either:

an OpenRouter LLM for insights/summaries, and/or
your Hugging Face multi-head leads classifier (intent + symptoms + specialties).

Optional connectors save to Airtable/CSV/SQLite and sync contacts to HubSpot. Includes a FastAPI server, a rate-limit-aware CLI, and an optional poller.

Install

# core
python -m pip install --user docmaplabs
# extras
python -m pip install --user "docmaplabs[server]"   # FastAPI
python -m pip install --user "docmaplabs[ml]"       # HF model inference
python -m pip install --user "docmaplabs[scrape]"   # snscrape (optional)
# one-liner (all extras)
python -m pip install --user "docmaplabs[all]"

Python >= 3.8. Add user bin to PATH if needed: export PATH="$HOME/.local/bin:$PATH".

Environment variables (set only what you need)

OpenRouter (LLM)
- OPENROUTER_API_KEY (required for real LLM results)
- OPENROUTER_MODEL (default: meta-llama/llama-3.1-8b-instruct:free)
Twitter fetch (optional, for CLI fetch)
- TWITTER_BEARER_TOKEN (App-only bearer token)
Hugging Face classifier (optional)
- HF_LEADS_REPO_ID (e.g., your-org/docmap-leads-classifier-v1)
- LEADS_THRESHOLD (default 0.3), LEADS_DEVICE (auto|cpu|cuda), LEADS_BASE_MODEL (default microsoft/deberta-v3-base)
Airtable (optional)
- AIRTABLE_API_KEY, AIRTABLE_BASE_ID
- AIRTABLE_TABLE_HEALTHCARE_INSIGHTS (default Healthcare Insights)
- AIRTABLE_TABLE_LEADS (default Leads)
HubSpot (optional)
- HUBSPOT_ACCESS_TOKEN (private app token; contacts read/write)

Tokens are function-scoped. If a token is missing and a feature needs it, that feature is skipped or a clear error is raised. LLM calls fallback to simple heuristics if OPENROUTER_API_KEY is absent. You can interactively create a .env via --prompt-creds and/or load an env via --env-file.

CLI usage

Analyze existing posts (from file or stdin) and save insights:

docmaplabs-marketing analyze-healthcare \
  --tweets ./tweets.json \
  --keywords NHS "waiting times" GP \
  --region UK \
  --save csv:/tmp/insights.csv \
  --out /tmp/analysis.json

End-to-end: fetch → analyze → save insights and leads (rate-limit aware):

docmaplabs-marketing run-keywords \
  --keywords NHS "waiting times" GP \
  --region UK \
  --analysis-save csv:/tmp/insights.csv \
  --lead-csv /tmp/leads.csv \
  --lead-airtable \
  --lead-hubspot \
  --prompt-creds     # interactively prompt for missing tokens and optionally save to ~/.docmaplabs_marketing/.env
  --env-file /path/to/.env   # load a specific env profile (per account/workspace)

Storage targets:

Insights: airtable | csv:/path/file.csv | sqlite:/path/db.sqlite[::table]
Leads: --lead-csv path, --lead-airtable, --lead-hubspot

Notes:

If your Python lacks SQLite, use CSV/Airtable (the tool reports a clear error otherwise).
Twitter Free tier rate limits apply; the client backs off automatically using reset headers (429-aware with reset).
snscrape is provided as best-effort (--source snscrape). It may break due to upstream changes; prefer the Twitter API for reliability.

Server (FastAPI)

docmaplabs-marketing-api --host 0.0.0.0 --port 8000

Endpoints:

GET /health → { ok: true }
POST /analyze-healthcare
- Body: { "tweets": [string | {id, author|handle|user, text}], "keywords": [string], "region": "UK" }
- Returns HealthcareAnalysis and attempts to save insights to Airtable if configured
POST /generate-travel-advice
- Body: { "query": "Tokyo 3 days", "triggerUser": "@you" }
- Returns Advice
POST /classify-posts
- Body: { "texts": ["post text", ...], "threshold": 0.3 }
- Returns { predictions: [{ intent, symptoms[], specialties[] }, ...] }

Using the DocMap UK triage model (optional)

Recommended HF repo: rabbitfishai/docmap-uk-triage-merged-qwen2.5-7b
Configure via env or pyproject:
- Env: HF_TRIAGE_REPO_ID=... (or HF_TRIAGE_ENDPOINT for a private HF Inference Endpoint), optional HUGGINGFACEHUB_API_TOKEN.
- Pyproject defaults: under [tool.docmaplabs_marketing] set triage_repo_id, triage_system_prompt.
Intended use and safety:
- Informational guidance; not a diagnosis. For life-threatening emergencies call 999; for urgent concerns call 111. Not affiliated with the NHS.
- Add this disclaimer in your UI/API and logs.

Python API (library)

from docmaplabs_marketing_package.agent import analyze_healthcare_tweets, generate_travel_advice
from docmaplabs_marketing_package.leadgen import extract_leads, classify_posts_with_hf

# Analyze tweets
tweets = [
  {"id":"1","author":"@nhs_user","text":"GP appointments delayed again."},
  "Hospitals in London facing staffing issues",
]
analysis = analyze_healthcare_tweets(tweets, keywords=["NHS","waiting times"], region="UK")
leads = extract_leads(analysis, min_relevance=0.6)

# HF classifier
preds = classify_posts_with_hf(["Any advice on fever? Based in Glasgow"], threshold=0.3)

# Travel advice
advice = generate_travel_advice("Tokyo 3 days in October", "@you")
print(advice.summary)

Data models

HealthcareAnalysis → summary: str, insights: List[HealthcareInsight]
HealthcareInsight → tweetId, author, text, isHealthcareRelated, isUKRelated, categories[], painPoints[], sentiment, urgency, relevance, contacts[]
Lead → handle, name?, sourceTweetId?, relevance, notes?
Advice → title, summary, itinerary[{day,plan}], costEstimateUSD?, confidence?

Airtable schema (optional)

Create:

Healthcare Insights with: TweetId, Author, Text (long text), IsHealthcareRelated (checkbox), IsUKRelated (checkbox), Categories, PainPoints, Sentiment, Urgency (number), Relevance (number), Contacts, Summary (long text)
Leads with: Handle, Name, SourceTweetId, Relevance (number), Notes

Poller (optional, rate-limit aware)

Runs a loop that:

fetches with backoff (Twitter API),
analyzes with OpenRouter (or heuristic fallback),
extracts leads, and
persists since_id under ~/.cache/docmaplabs_marketing/ to avoid reprocessing.

docmaplabs-marketing-poller
# configure with env (.env or --env-file) and POLL_INTERVAL_SECONDS (default 60)

Each iteration prints a JSON line like:

{ "fetched": 50, "leads": 7, "newest_id": "1871234" }

You can also import and call poller.run(once=True, keywords=[...]) in Python.

Safety and Intended Use (UK)

This toolkit and any referenced models provide general information and lead identification; they are not medical devices.
Do not use for diagnosis or emergency triage. For life‑threatening emergencies, call 999; for urgent concerns, call 111.
Ensure GDPR/DPA compliance, publish a privacy policy, and obtain consent where required (e.g., direct outreach).
Advertising: follow CAP/ASA rules; avoid implying NHS endorsement.

Changelog

0.2.0: HF classifier integration, /classify-posts endpoint, snscrape optional source, Twitter API rate-limit backoff, env-file support, poller with since_id state, docs polish, packaging metadata, [all] extra.
0.1.0: Initial public version with CLI, FastAPI, Airtable/CSV/SQLite storage, HubSpot sync (optional), and Twitter fetch.

License

MIT Open Source Package for Marketing

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.1

Aug 19, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

docmaplabs-0.2.1.tar.gz (22.5 kB view details)

Uploaded Aug 19, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

docmaplabs-0.2.1-py3-none-any.whl (27.2 kB view details)

Uploaded Aug 19, 2025 Python 3

File details

Details for the file docmaplabs-0.2.1.tar.gz.

File metadata

Download URL: docmaplabs-0.2.1.tar.gz
Upload date: Aug 19, 2025
Size: 22.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.13

File hashes

Hashes for docmaplabs-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`907c9d53e16df5c4b3cccef9a052d97ed14c0459e85d65e536e415d84b031a6b`
MD5	`8b260a94fd5e71519df68b055b613de0`
BLAKE2b-256	`3253dedcf7b73e626c8b036a96589d6db738dd600c0487b59f29f364c97d2a53`

See more details on using hashes here.

File details

Details for the file docmaplabs-0.2.1-py3-none-any.whl.

File metadata

Download URL: docmaplabs-0.2.1-py3-none-any.whl
Upload date: Aug 19, 2025
Size: 27.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.13

File hashes

Hashes for docmaplabs-0.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5c4909019d0eaf3811589871f9b2c4e266021d41e59d13dc2b444e196be830d3`
MD5	`d89ce3c17f9bb732c63006de775f4980`
BLAKE2b-256	`610a2a723487b9d3d7511cc1f14079b898f7621eca0a7896d312f0be8e115e96`

See more details on using hashes here.

docmaplabs 0.2.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

DocmapLabs Marketing Package

Install

Environment variables (set only what you need)

CLI usage

Server (FastAPI)

Using the DocMap UK triage model (optional)

Python API (library)

Data models

Airtable schema (optional)

Poller (optional, rate-limit aware)

Safety and Intended Use (UK)

Changelog

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes