Skip to main content

Machine learning movie recommender

Project description

<h1 align="center">
Moviebox
<a href="https://github.com/klauscfhq/moviebox">
<img src="media/header.png" alt="Moviebox" width="90%">
</a>
</h1>

<h4 align="center">
🎥 Machine learning movie recommender
</h4>

[![Build Status](https://travis-ci.org/klauscfhq/moviebox.svg?branch=master)](https://travis-ci.org/klauscfhq/moviebox)

## Contents

- [Description](#description)
- [CLI](#cli)
- [Usage](#usage)
- [API](#api)
- [Development](#development)
- [Team](#team)
- [License](#license)

## Description

Moviebox is a content based machine learning recommending system build with the powers of [`tf-idf`](https://en.wikipedia.org/wiki/Tf%E2%80%93idf) and [`cosine similarities`](https://en.wikipedia.org/wiki/Cosine_similarity).

Initially, a natural number, that corresponds to the ID of a unique movie title, is accepted as input from the user. Through `tf-idf` the plot summaries of 5000 different movies that reside in the dataset, are analyzed and vectorized. Next, a number of movies is chosen as recommendations based on their `cosine similarity` with the vectorized input movie. Specifically, the cosine value of the angle between any two non-zero vectors, resulting from their inner product, is used as the primary measure of similarity. Thus, only movies whose story and meaning are as close as possible to the initial one, are displayed to the user as recommendations.

The [dataset](moviebox/dataset/movies.csv) in use is a random subset of the [Carnegie Mellon Movie Summary Corpus](http://www.cs.cmu.edu/~ark/movie$-data/), and it consists of `5000` movie titles along with their respective categories and plots.

The nature of the project is heavily educational.

## Install

**`Python 3+`** is required to install or build the code. Support for **`Python 2+`** will be added soon.

```
pip3 install moviebox
```

## CLI

```
$ moviebox --help

🎥 Machine learning movie recommender

Usage
$ moviebox [<options> ...]

Options
--help, -h Display help message
--movie, -m Input movie ID
--plot, -p Display movie plot
--interactive, -i Display process info
--recommendations, -r Number of movie recommendations
--version, -v Display installed version

Examples
$ moviebox --help
$ moviebox --movie 2874
$ moviebox -m 2874 --recommendations 3
$ moviebox -m 2874 -r 3 --plot
$ moviebox -m 2874 -r 3 -p --interactive
```

## Usage

```python
from moviebox.recommender import recommender

movieID = 2874 # Movie ID of `Asterix & Obelix: God save Britannia`
recommendationsNumber = 3 # Get 3 movie recommendations
showPlots = True # Display the plot of each recommended movie
interactive = True # Display process info while running

# Generate the recommendations
recommender(
movieID=movieID,
recommendationsNumber=recommendationsNumber,
showPlots=showPlots,
interactive=interactive)
```

## API

#### recommender`(movieID, recommendationsNumber, showPlots, interactive)`

**E.g.** `recommender(movieID=2874, recommendationsNumber=3, showPlots=True, interactive=True)`

#### `movieID`

- Type: `Integer`

- Default Value: `2874`

- Optional: `True`

Input movie ID. Any natural number between `[0, 4999]` can be selected.

#### `recommendationsNumber`

- Type: `Integer`

- Default Value: `3`

- Optional: `True`

Number of movie recommendations to be generated. Any natural number between `[1, 30]` can be selected.

#### `showPlots`

- Type: `Boolean`

- Default Value: `False`

- Optional: `True`

Display the plot summary of each recommended movie.

#### `interactive`

- Type: `Boolean`

- Default Value: `False`

- Optional: `True`

Display process-related information while running.

## Development

- [Clone](https://help.github.com/articles/cloning-a-repository/) this repository to your local machine
- Navigate to your clone `cd moviebox`
- Install the dependencies `fab i`/`pip3 install -r requirements.txt`
- Check for errors `fab test`
- Run the API `fab start`
- Build the package `fab dist`
- Cleanup compiled files `fab clean`

## Team

- Mario Sinani ([@mariocfhq](https://github.com/mariocfhq))
- Klaus Sinani ([@klauscfhq](https://github.com/klauscfhq))

## License

[MIT](https://github.com/klauscfhq/moviebox/blob/master/license.md)


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

moviebox-0.0.0.tar.gz (3.5 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

moviebox-0.0.0-py3-none-any.whl (3.6 MB view details)

Uploaded Python 3

File details

Details for the file moviebox-0.0.0.tar.gz.

File metadata

  • Download URL: moviebox-0.0.0.tar.gz
  • Upload date:
  • Size: 3.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for moviebox-0.0.0.tar.gz
Algorithm Hash digest
SHA256 350d46767fadfc82309204114e0f7a0e4b694b7bea3cb1fd49728afc24e85759
MD5 d9acdd0113833c6599d412a8c9bf3cd5
BLAKE2b-256 fb104ecd2083f5b8580101461b5bfcfa38e2e0588042bc2c072b9f94c3f7fd40

See more details on using hashes here.

File details

Details for the file moviebox-0.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for moviebox-0.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b12549838048af31d52343eeb0f79af6b895f2e17986fe2a606b3abf26506a9c
MD5 e2eda4ff927a2e631ad4743d47f3ca14
BLAKE2b-256 303dd2a8b372809765ff551cf013da3950c8f1bb83debd52e9dcd48334f82c14

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page