Cryptocurrency exchange announcement news crawler for major crypto exchanges
Project description
Crypto Exchange News Crawler 🚀
A powerful and easy-to-use Python package for scraping cryptocurrency exchange announcements from major exchanges.
🎯 Features
- Multi-Exchange Support: Scrape from 12 major crypto exchanges
- Multiple Output Formats: JSON, CSV, and XML support
- Structured Data: Clean, standardized output format
- Rate Limiting: Built-in delays to respect exchange servers
- Extensible: Easy to add new exchanges
📦 Installation Options
Option 1: Direct Usage
git clone https://github.com/lowweihong/crypto-exchange-news-crawler.git
cd crypto-exchange-news-crawler
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
pip install -r requirements.txt
playwright install
scrapy crawl bybit -o output.json
Option 2: Install from PyPI
pip install crypto-exchange-news-crawler
playwright install
## directly use proxy and uncomment DOWNLOADER_MIDDLEWARES
crypto-news crawl binance -o binance.json
crypto-news crawl bybit -s DOWNLOADER_MIDDLEWARES='{"crypto_exchange_news.middlewares.MyProxyMiddleware": 610}' -s PROXY_LIST="http://proxy1:port,http://proxy2:port"
Supported Exchanges
| Exchange | Status |
|---|---|
| Bybit | ✅ |
| Binance | ✅ |
| OKX | ✅ |
| Bitget | ✅ |
| BingX | ✅ |
| Kraken | ✅ |
| Bitfinex | ✅ |
| XT | ✅ |
| Crypto.com | ✅ |
| MEXC | ✅ |
| Deepcoin | ✅ |
| Kucoin | ✅ |
Available options : ["bybit", "binance", "okx", "bitget", "bitfinex", "xt", "bingx", 'kraken', 'cryptocom', 'mexc', 'deepcoin', 'kucoin']
📊 Output Format
Each scraped announcement includes:
{
"news_id": "unique_identifier",
"title": "Announcement title",
"desc": "Announcement description",
"url": "Full URL to announcement",
"category_str": "Category (e.g., latest_activities, new_crypto)",
"exchange": "Exchange name",
"announced_at_timestamp": 1749235200,
"timestamp": 1749232733
}
⚙️ Configuration
Key settings in settings.py:
MAX_PAGE: Maximum number of pages to crawl (default: 2)DOWNLOAD_DELAY: Delay between requests in seconds (default: 3)CONCURRENT_REQUESTS: Number of concurrent requests (default: 8)USER_AGENT: List of user agents for rotationPROXY_LIST: Fill the list with your proxy list and remember also to open uncomment the DOWNLOADER_MIDDLEWARES part to use the proxy middlewarePLAYWRIGHT_LAUNCH_OPTIONS: Browser configuration for Playwright spiders
Custom Settings
You can override settings from the command line:
scrapy crawl bitget -s MAX_PAGE=5 -s DOWNLOAD_DELAY=2
🔧 Technical Requirements
- Python 3.7+
- Scrapy 2.11.0+
- Playwright (for Bitget spider)
- Chromium browser (automatically installed with Playwright)
🌐 Exchange URLs
Direct links to announcement pages:
⚖️ Legal & Ethical Usage
This crawler is designed for educational and research purposes. Please ensure you comply with:
- Applicable data protection laws
- Fair use guidelines
Always use the crawler responsibly and consider the impact on the target servers.
🤝 Contributing
Contributions welcome! Areas for improvement:
- Add support for more exchanges (Huobi, KuCoin, Gateio, etc.)
- Implement real-time WebSocket feeds
- Add telegram/discord notification integrations
- Improve data parsing and categorization
Support
For issues, questions, or contributions, please create an issue in the repository.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file crypto_exchange_news_crawler-0.1.8.tar.gz.
File metadata
- Download URL: crypto_exchange_news_crawler-0.1.8.tar.gz
- Upload date:
- Size: 20.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
648285aa5735e878c7cb03abd0c17e54125a58277fae0740323bd50cdb7415ef
|
|
| MD5 |
e40eefce54ee7f15d2bb3f52c17e689a
|
|
| BLAKE2b-256 |
3169f13f314c3e295f9a2607fce129463b4a78f16aa42bef7e21c0b752f30686
|
File details
Details for the file crypto_exchange_news_crawler-0.1.8-py3-none-any.whl.
File metadata
- Download URL: crypto_exchange_news_crawler-0.1.8-py3-none-any.whl
- Upload date:
- Size: 27.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
38f6ece5004c4fbfce7ffe1700ec5c025cfbdc4fd72f31cbb41c1ca1caa06609
|
|
| MD5 |
3407da66b6d98a93ba36ea80ebbcdf20
|
|
| BLAKE2b-256 |
f7ef5262a728e689e954849aeccfa523ba3408b5d1aa2bede100edd885c8c820
|