Skip to main content
Help improve PyPI by participating in a 5-minute user interface survey!

Scrape data of all the episodes of a Tv Series from IMDB

Project Description

Tvstats

Scrape data of all the episodes of a Tv Series from IMDB.

Installation

Run

python setup.py install

Dependencies

tvstats is based on Python 2.7. Requires BeautifulSoup4 for parsing, requests for downloading html. Matplotlib is required(optional) for using graph module.

Usage

Run the simple command

tvstats url

to generate json data. URL should point to homepage of a tv series. eg. http://www.imdb.com/title/tt0108778/?ref_=fn_al_tt_1

For options and help run

tvstats -h

Why?

Here are my reasons:

  • I was bored and had time to kill.
  • I love watching Tv Series. Thought it would be good to analyse some data before starting a new one.
  • Graphs are fun.
  • Lastly, I wanted to test out BeautifulSoup4 :).

Issues, Bugs, Graphs?

Let me knwow about the issues at https://github.com/leosartaj/tvstats/issues. Feel free to add new graphs or improve.

Examples

All the datasets can be found here. Graphs were made using graph function in ‘graph.py’.

Friends

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/friends.png

Game Of Thrones

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/gameOfThrones.png

Breaking Bad

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/breakingBad.png

The Big Bang Theory

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/tbbt.png

How I Met Your Mother

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/himym.png

Prison Break

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/prisonBreak.png

Hannibal

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/hannibal.png

Suits

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/suits.png

Dexter

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/dexter.png

Arrow

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/arrow.png

Person Of Interest

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/personOfInterest.png

Homeland

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/homeland.png

House Of Cards

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/houseOfCards.png

How to Get Away With Murder

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/howToGetAwayWithMurder.png

Orange Is The New Black

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/orangeIsTheNewBlack.png

Shameless

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/shameless.png

Sons Of Anarchy

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/sonsOfAnarchy.png

Spartacus

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/spartacus.png

The Walking Dead

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/theWalkingDead.png

Vikings

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/vikings.png

Flash

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/flash.png

The Wire

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/theWire.png

Continuum

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/continuum.png

Lost

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/lost.png

The Sopranos

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/theSopranos.png

Releases

Release history Release notifications

This version
History Node

0.0.2

History Node

0.0.1

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
tvstats-0.0.2.tar.gz (5.6 kB) Copy SHA256 hash SHA256 Source None Jul 1, 2015

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging CloudAMQP CloudAMQP RabbitMQ AWS AWS Cloud computing Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page