A web scraper for ticker symbols from yahoo finance
Project description
Produces .csv, .xlsx, .json and .yaml files (All files contain same data but in a different format) for stocks, ETF, futures, indexes, mutual funds, currency, warrants and bonds. The ticker symbol, company name and exchange are saved for all symbols.
It gets its data from https://finance.yahoo.com/lookup/. Please note: it is not possible to get all the symbols due to limitations set by Yahoo.
Requirements
Python 2.7 or Python 3.5+
Install
From python package manager (preferred):
pip install Yahoo-ticker-downloader
From source:
python setup.py install
Example Usage
usage: YahooTickerDownloader.py [-h] [-i] [-e] [-E EXCHANGE] [-s SLEEP] [-p]
[type]
positional arguments:
type The type to download, this can be: generic
optional arguments:
-h, --help show this help message and exit
-i, --insecure use HTTP instead of HTTPS
-e, --export export immediately without downloading (Only useful if
you already downloaded something to the .pickle file)
-E EXCHANGE, --Exchange EXCHANGE
Only export ticker symbols from this exchange (the
filtering is done during the export phase)
-s SLEEP, --sleep SLEEP
The time to sleep in seconds between requests
-p, --pandantic Stop and warn the user if some rare assertion fails
For example to download all stock symbols you run it like:
YahooTickerDownloader.py
The program takes a few weeks before it is finished. The program supports suspending and resuming a download. Press CTRL+C to suspend download. Restart the program in the same working directory to resume downloading. It is possible to export partially downloaded results using the -e flag.
Example of CSV output:
Ticker Name Exchange exchangeDisplay Type TypeDisplay
JNUG Direxion Daily Jr Gld Mnrs Bull 3X ETF ASE NYSE MKT E ETF
DWDP DowDuPont Inc. NYQ NYSE S Equity
E Eni S.p.A. NYQ NYSE S Equity
EQH AXA Equitable Holdings, Inc. NYS NYSE S Equity
XOM Exxon Mobil Corporation NYQ NYSE S Equity
ETP Energy Transfer Partners, L.P. NYQ NYSE S Equity
ES=F E-mini S&P 500 Index Futures,Jufuture CME Chicago Mercantile Exchange F Futures
NQ=F E-mini Nasdaq 100 Index Futuresfuture CME Chicago Mercantile Exchange F Futures
GE=F Eurodollar Futures,Sep-2018future CME Chicago Mercantile Exchange F Futures
...ect
Depending on the type you are downloading, you will get between 3.000 and 100.000+ entries.
Further resources
Download history for symbols: ystockquote
Changelog
Version 3.0.1 (2018-12-01)
Removed reppy dependency
Version 3.0.0 (2018-05-27)
Switched over to different JSON api (searchassist)
Version 2.2.0 (2018-01-31)
Continue exporting to different formats if one export fails ( #41 )
Check robots.txt ( anti-feature )
Version 2.1.1 (2017-08-02)
A too old requests dependency was listed ( #35 )
Version 2.1.0 (2017-05-10)
Added market parameter ( pull request #33 )
Version 2.0.1 (2017-05-07)
Fixed issue where all downloads except stock and currency stopped working.
Version 2.0.0 (2017-05-05)
Switched over to JSON api
Version 1.0.0 (2017-04-04)
Reverted some changes from 0.10.0. Bond is back. Reverted back to English site instead of German.
Resolved CSV issue again. Closes #23 and #16.
Merged #26 Workaround Y! b>2000 limit
Scraper now scrapes a lot more at the expense of runtime.
Support for python2 is back. Latest python 2 & 3 are supported.
Removed xls support
Added xlsx support (#29)
Version 0.10.1 (2017-02-04)
More descriptive help message
Version 0.10.0 (2017-02-02)
Removed bond downloading option.
Uses different yahoo source. Fixes #18
Removed python2 from classifiers. Related to #16
Version 0.9.0 (unreleased)
Added a flag to restrict output to specific stock exchanges.
Version 0.8.1 (2016-08-17)
Workaround for #7 : downloading interruption
Solution for #9 : UnicodeEncodeError
Version 0.7.0 (2016-03-20)
Added –export option. It will transcode the .pickle file immediately to the desired output formats.
Version 0.6.0 (unreleased)
Add 3 retries with an exponential back-off if HTTPError or ChunkedEncodingError is raised when processing _fetchHtml.
Version 0.5.0 (2015-08-16)
Allows downloading using a insecure connection.
The temporarily download file-names now include the ticker type.
Version 0.4.0 (2014-10-28)
Warrant symbols can now be downloaded.
Bond symbols can now be downloaded.
Version 0.3.0 (2014-08-14)
Use HTTPS instead of HTTP
Retry to fetch a page if it contains no symbols (A “fix” for issue #4)
Renamed all ‘Curreny’ to ‘Currency’
Relative imports are used
Fix: .csv file it outputs is encoded in UTF-8 when using python2
Performance: Considerable reduced memory consumption
It now outputs .json, .yaml and .xls files in addition to .csv
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file Yahoo-ticker-downloader-3.0.1.tar.gz
.
File metadata
- Download URL: Yahoo-ticker-downloader-3.0.1.tar.gz
- Upload date:
- Size: 8.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: Python-urllib/2.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9f2e3ea8c65342e2122d3c251eada97727a352a2268f29ce6f286b5a6ccdb17d |
|
MD5 | 87c12e30064a351fe82a3250a5bc2b8a |
|
BLAKE2b-256 | 26c6012180ba63223f39bfea186f04e19840381a9bc22a2fc15a7aecb0b8d90d |