Skip to main content
This is a pre-production deployment of Warehouse. Changes made here affect the production instance of PyPI (pypi.python.org).
Help us improve Python packaging - Donate today!

Scrape data of all the episodes of a Tv Series from IMDB

Project Description

Tvstats

Scrape data of all the episodes of a Tv Series from IMDB.

Installation

Run

python setup.py install

Dependencies

tvstats is based on Python 2.7. Requires BeautifulSoup4 for parsing, requests for downloading html. Matplotlib is required(optional) for using graph module.

Usage

Run the simple command

tvstats url

to generate json data. URL should point to homepage of a tv series. eg. http://www.imdb.com/title/tt0108778/?ref_=fn_al_tt_1

For options and help run

tvstats -h

Why?

Here are my reasons:

  • I was bored and had time to kill.
  • I love watching Tv Series. Thought it would be good to analyse some data before starting a new one.
  • Graphs are fun.
  • Lastly, I wanted to test out BeautifulSoup4 :).

Issues, Bugs, Graphs?

Let me knwow about the issues at https://github.com/leosartaj/tvstats/issues. Feel free to add new graphs or improve.

Examples

All the datasets can be found here. Graphs were made using graph function in ‘graph.py’.

Friends

Game Of Thrones

Breaking Bad

The Big Bang Theory

How I Met Your Mother

Prison Break

Hannibal

Suits

Dexter

Arrow

Person Of Interest

Homeland

House Of Cards

How to Get Away With Murder

Orange Is The New Black

Shameless

Sons Of Anarchy

Spartacus

The Walking Dead

Vikings

Flash

The Wire

Continuum

Lost

The Sopranos

Releases

Release History

Release History

This version
History Node

0.0.2

History Node

0.0.1

Download Files

Download Files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

File Name & Checksum SHA256 Checksum Help Version File Type Upload Date
tvstats-0.0.2.tar.gz (5.6 kB) Copy SHA256 Checksum SHA256 Source Jul 1, 2015

Supported By

WebFaction WebFaction Technical Writing Elastic Elastic Search Pingdom Pingdom Monitoring Dyn Dyn DNS Sentry Sentry Error Logging CloudAMQP CloudAMQP RabbitMQ Heroku Heroku PaaS Kabu Creative Kabu Creative UX & Design Fastly Fastly CDN DigiCert DigiCert EV Certificate Rackspace Rackspace Cloud Servers DreamHost DreamHost Log Hosting