Skip to main content

Scrape data of all the episodes of a Tv Series from IMDB

Project description

Tvstats

Scrape data of all the episodes of a Tv Series from IMDB.

Installation

Run

python setup.py install

Dependencies

tvstats is based on Python 2.7. Requires BeautifulSoup4 for parsing, requests for downloading html. Matplotlib is required(optional) for using graph module.

Usage

Run the simple command

tvstats url

to generate json data. URL should point to homepage of a tv series. eg. http://www.imdb.com/title/tt0108778/?ref_=fn_al_tt_1

For options and help run

tvstats -h

Why?

Here are my reasons:

  • I was bored and had time to kill.

  • I love watching Tv Series. Thought it would be good to analyse some data before starting a new one.

  • Graphs are fun.

  • Lastly, I wanted to test out BeautifulSoup4 :).

Issues, Bugs, Graphs?

Let me knwow about the issues at https://github.com/leosartaj/tvstats/issues. Feel free to add new graphs or improve.

Examples

All the datasets can be found here. Graphs were made using graph function in ‘graph.py’.

Friends

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/friends.png

Game Of Thrones

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/gameOfThrones.png

Breaking Bad

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/breakingBad.png

The Big Bang Theory

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/tbbt.png

How I Met Your Mother

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/himym.png

Prison Break

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/prisonBreak.png

Hannibal

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/hannibal.png

Suits

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/suits.png

Dexter

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/dexter.png

Arrow

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/arrow.png

Person Of Interest

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/personOfInterest.png

Homeland

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/homeland.png

House Of Cards

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/houseOfCards.png

How to Get Away With Murder

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/howToGetAwayWithMurder.png

Orange Is The New Black

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/orangeIsTheNewBlack.png

Shameless

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/shameless.png

Sons Of Anarchy

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/sonsOfAnarchy.png

Spartacus

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/spartacus.png

The Walking Dead

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/theWalkingDead.png

Vikings

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/vikings.png

Flash

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/flash.png

The Wire

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/theWire.png

Continuum

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/continuum.png

Lost

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/lost.png

The Sopranos

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/theSopranos.png

Releases

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tvstats-0.0.2.tar.gz (5.6 kB view details)

Uploaded Source

File details

Details for the file tvstats-0.0.2.tar.gz.

File metadata

  • Download URL: tvstats-0.0.2.tar.gz
  • Upload date:
  • Size: 5.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for tvstats-0.0.2.tar.gz
Algorithm Hash digest
SHA256 a8370f062a5bf6ab66679a1e174f8364939c93ff3a8c64375826ef5e8a39a0ba
MD5 39b97a59a640f53ce25e95a3e887e3a0
BLAKE2b-256 95d4d012632001c7bd791086957bb134b1964da265c44688bcaaea0b55466bc9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page