A polite, RSS-first Economic Times news and sentiment collector for market research.
Project description
et_scraper_safe
A safe, polite Python library for collecting public Economic Times data (news headlines and summaries) for market and sentiment research.
Safety rules
- Respects
robots.txtbefore fetching any HTML page. - Does not bypass logins, paywalls, captchas, Cloudflare, or rate limits.
- Prefers RSS feeds over HTML scraping.
- Adds a delay between HTML requests.
- Saves both raw and cleaned data as CSV.
- Returns pandas DataFrames so it plugs directly into analysis pipelines.
Folder structure
et_scraper_safe/
├── requirements.txt
├── main.py
├── README.md
├── et_scraper/
│ ├── __init__.py
│ ├── config.py
│ ├── robots_checker.py
│ ├── rss_collector.py
│ ├── html_collector.py
│ ├── parser.py
│ ├── sentiment.py
│ └── storage.py
└── data/
├── raw/
└── clean/
Install & run
cd et_scraper_safe
pip install -r requirements.txt
python main.py
Output example
source,category,title,summary,link,published,sentiment_score,sentiment_label
economic_times,stocks,Tata Motors shares rally...,...,link,...,2,Bullish
economic_times,economy,Rupee falls against dollar...,...,link,...,-1,Bearish
Use in a swing trading pipeline
Economic Times News
↓
Headline Sentiment
↓
Stock Symbol Mapping
↓
Technical Indicators
↓
Final Swing Score
Library usage
from et_scraper import fetch_all_rss_news, sentiment_score, sentiment_label
df = fetch_all_rss_news()
df["sentiment_score"] = df["title"].apply(sentiment_score)
df["sentiment_label"] = df["sentiment_score"].apply(sentiment_label)
Categories collected
Latest, Markets, Stocks, Economy, Business, IPO, Mutual Funds, Commodities, Forex.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file et_scraper_safe-0.1.0.tar.gz.
File metadata
- Download URL: et_scraper_safe-0.1.0.tar.gz
- Upload date:
- Size: 6.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
37c275cc32ae3c8f0972bb514e1bc104d8fd299e0bf48c4829744268a3ea1528
|
|
| MD5 |
04aecd630773a2896f4e9ec7924c6349
|
|
| BLAKE2b-256 |
865883cd60e2969a95aa9a70d5759caaaff0b82c5785de84d99cb38e977f9e52
|
File details
Details for the file et_scraper_safe-0.1.0-py3-none-any.whl.
File metadata
- Download URL: et_scraper_safe-0.1.0-py3-none-any.whl
- Upload date:
- Size: 8.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b82866cc196524f8ba136112bbb2fc88b9f651521da00b1c1a9672e62cdd672f
|
|
| MD5 |
41449cbc93aac1ac7a7e0d86803790f8
|
|
| BLAKE2b-256 |
33c056df7ce6d6aa7af0d6db38e6ec3daf3243c63bb8cf3c5b904abfee61c4eb
|