Skip to main content

A polite, RSS-first Economic Times news and sentiment collector for market research.

Project description

et_scraper_safe

A safe, polite Python library for collecting public Economic Times data (news headlines and summaries) for market and sentiment research.

Safety rules

  • Respects robots.txt before fetching any HTML page.
  • Does not bypass logins, paywalls, captchas, Cloudflare, or rate limits.
  • Prefers RSS feeds over HTML scraping.
  • Adds a delay between HTML requests.
  • Saves both raw and cleaned data as CSV.
  • Returns pandas DataFrames so it plugs directly into analysis pipelines.

Folder structure

et_scraper_safe/
├── requirements.txt
├── main.py
├── README.md
├── et_scraper/
│   ├── __init__.py
│   ├── config.py
│   ├── robots_checker.py
│   ├── rss_collector.py
│   ├── html_collector.py
│   ├── parser.py
│   ├── sentiment.py
│   └── storage.py
└── data/
    ├── raw/
    └── clean/

Install & run

cd et_scraper_safe
pip install -r requirements.txt
python main.py

Output example

source,category,title,summary,link,published,sentiment_score,sentiment_label
economic_times,stocks,Tata Motors shares rally...,...,link,...,2,Bullish
economic_times,economy,Rupee falls against dollar...,...,link,...,-1,Bearish

Use in a swing trading pipeline

Economic Times News
        ↓
Headline Sentiment
        ↓
Stock Symbol Mapping
        ↓
Technical Indicators
        ↓
Final Swing Score

Library usage

from et_scraper import fetch_all_rss_news, sentiment_score, sentiment_label

df = fetch_all_rss_news()
df["sentiment_score"] = df["title"].apply(sentiment_score)
df["sentiment_label"] = df["sentiment_score"].apply(sentiment_label)

Categories collected

Latest, Markets, Stocks, Economy, Business, IPO, Mutual Funds, Commodities, Forex.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

et_scraper_safe-0.1.0.tar.gz (6.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

et_scraper_safe-0.1.0-py3-none-any.whl (8.2 kB view details)

Uploaded Python 3

File details

Details for the file et_scraper_safe-0.1.0.tar.gz.

File metadata

  • Download URL: et_scraper_safe-0.1.0.tar.gz
  • Upload date:
  • Size: 6.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for et_scraper_safe-0.1.0.tar.gz
Algorithm Hash digest
SHA256 37c275cc32ae3c8f0972bb514e1bc104d8fd299e0bf48c4829744268a3ea1528
MD5 04aecd630773a2896f4e9ec7924c6349
BLAKE2b-256 865883cd60e2969a95aa9a70d5759caaaff0b82c5785de84d99cb38e977f9e52

See more details on using hashes here.

File details

Details for the file et_scraper_safe-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for et_scraper_safe-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b82866cc196524f8ba136112bbb2fc88b9f651521da00b1c1a9672e62cdd672f
MD5 41449cbc93aac1ac7a7e0d86803790f8
BLAKE2b-256 33c056df7ce6d6aa7af0d6db38e6ec3daf3243c63bb8cf3c5b904abfee61c4eb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page