Skip to main content

PoC for scraping Yahoo News with sentiment analysis

Project description

StockNews

Scrape financial News from Yahoo and analyse the sentiment (PoC)

Summary

With stocknews, you can scrape news data from the Yahoo Financial RSS Feed and store them with the sentiment of the headline and the summary. Depending on the initialization 1 or 2 files are output as csv. No. 1 is the scraped news (optional) and no. 2 is the summary, having the summarized sentiment of news for the given date (see options) and the values.

Install

To install the package, run pip install stocknews

Usage

In order to use stocknews to scrape news data and prepare them for your model you simply need this:

from stocknews import StockNews
...
stocks = ['AAPL', 'MSFT', 'NFLX']
sn = StockNews(stocks, wt_key='MY_WORLD_TRADING_DATA_KEY')
df = sn.summarize()
...

This returns a pandas DataFrame and saves it to data/data.csv by default (see options)

Options

  • stocks: A list of stocks to check. See http://eoddata.com/symbols.aspx for all symbols available
  • news_file='news.csv': filename of the saved news
  • summary_file='data.csv': filename of the saved dataset, including sentiment and value per day and stock
  • save_news=True: save the news file or scrape and analyse on the fly for recent news
  • closing_hour=20: Close of the exchange (NASDAQ in this case). News after closing will be taken for next trading day (skips the weekend as well)
  • closing_minute=0: Same as closing_hour
  • wt_key=None: Your worldtradingdata.com API Key. Get one here. Not needed if read_rss is called directly.

Dependencies

Tests

python setup.py test

ToDo

  • add more news sources
  • add more tests

Changes

0.9.11

  • Fixed another pathing issue...

0.9.10

  • removed the exit() rolleyes

0.9.9

  • path issues fixed. For real now...

0.9.8

  • fixed some path issues

0.9.7

  • tried to fix too many requests, added a counter for made requests to keep track

0.9.6

  • Suppress ntlk download messages
  • renamed test.py

0.9.5

  • "Initial Release"

<0.9.5:

  • Testing

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
stocknews-0.9.11.tar.gz (5.4 kB) Copy SHA256 hash SHA256 Source None

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page