Hybrid Movie Recommendation System
Hybrid Movie Recommendation System
Every year new movies are released with a varied story-line or a genre which could
be of potential interest to viewers. Various online movie or video streaming platforms
can keep the customers engaged by recommending movies of the viewer's
preference. Hybrid Recommendation System for Movies uses the combination of
collaborative and content based filtering in the context of web-based recommender
systems. Recommendation systems constitute a specific type of information filtering
that attempt to present items according to the interests expressed by a user , in order
to reduce the effect of such dependencies. Our project proposes a hybrid
recommendation system with sentiment analysis of movie tweets. The movie tweets
have been collected from micro blogging websites to understand the current trends
and user response of the movie. Through different investigations, we will show the
ability to expand utilization, just as the qualities of how to foresee blended longings.
Ideas for particular kinds of channel data endeavor to decide factors dependent on
the business' advantages to limit the effect of protection. Our venture requires an
arrangement of correspondence and passionate examination of tweets. The film's
tweets have been gathered on a microblog webpage to stay aware of the most recent
patterns and arrangements from film clients. Leading examination in the state files
gives positive outcomes.
v
Table of Content
ABSTRACT v
1 INTRODUCTION 1
2 LITERATURE REVIEW 3
3 METHODOLOGY 5
vi
3.6 RECOMMENDER SYSTEM TYPES AND TECHNIQUES 17
3.6.3CONTENT-BASED RECOMMENDATION 21
3.8 IMPLEMENTATION 27
3.8.2 KNN 28
3.8.3 SVD 29
5.1 CONCLUSION 32
APPENDICES 34
A.SOURCE CODE 34
B.SCREENSHOTS 39
vii
LIST OF FIGURES
B SCREENSHOTS 39
viii
LIST OF TABLES
ix
CHAPTER 1
INTRODUCTION
1.1 RELEVANCE OF THE PROJECT
Movies are a part and parcel of life. There are different types of movies like
some for entertainment, some for educational purposes, some are animated movies
for children, and some are horror movies or action films. Movies can be easily
differentiated through their genres like comedy, thriller, animation, action etc. Other
way to distinguish among movies can be either by releasing year, language, director
etc. Watching movies online, there are a number of movies to search in our most liked
movies .
1
1.2 PROBLEM STATEMENT
The goal of the project is to recommend a movie to the user. Providing related
content out of relevant and irrelevant collection of items to users of online service
providers.
2
CHAPTER 2
LITERATURE SURVEY
2.1.RELATED WORK
The work of Nandagawali and Patil [1] provides a solution for the various
recommender systems by using collaborative filtering algorithms with the
community based user domain model. The main purpose is to satisfy the
customer„s product needs by providing them recommendation based on
products. It deals with the Amazon dataset. Better hardware configuration may
result in better output.
The work of Jatinder et al., [2] deals with the review of several techniques that
are used by a system for recommending electronic products. They have also
worked with the Amazon dataset. Time complexity of the whole model is pretty
high, making it less convenient.
In the work of Eklaspur and Pashupatimath, [3] a formal representation of
social network is present where text mining is taken as a perspective. A
framework is proposed that can recommend friend using an efficient Algorithm.
Their work is based on Facebook data. The notable points in their work can be
that the accuracy is high, and works fine with any size of data. Analysis of the
classes and its classification is also highly précised. But it lacks to meet the
standard when it comes to time of execution.
Haruna, et al., [4] in their work, have proposed a research paper recommender
system that transforms all the recommending papers into a paper-citation
relations matrix. The key points that makes their model better can be faster
results. Their algorithm which is Brute Force based, but fails to meet the
compatibility in real time application as it will be inefficient for very large
dataset, as the number of rows to column ratio will increase abnormally.
In the research work carried out by Kumar et al., [5] they have proposed a
movie recommendation system named MOVREC. The system sorts the ratings
by implementing K-means clustering algorithm. The pros of the system include
3
that the time taken is less, and almost every time the result of the classification
of data is accurate [21][25]. But the cons can be that creating classes amongst
closely related data which may belong to the same cluster can be redundant
and misleading.
According to the work of Chen et al. [6], they have incorporated Singular Value
Decomposition (SVD) in user based collaborative filtering technique for movie
recommendation system. Although, they have got good accuracy with their
model, but additional user or item features can be applied along with a larger
dataset for better preparation of model.
The work of Reddy, et al. [7] deals with genre correlation-oriented content-
based movie recommendation system, that only focusses on the genre of a
movie. On the contrary, for any person, choice might not be completely based
on one genre.
In [8], C. S. M. Wu, D. Garg, and U. Bhandary proposed a recommendation
system using collaborative filtering where a user‟s rating is used to suggest the
list. The authors have used the Apache Mahout framework and essentially
compared the performances and efficiency of us er-based & item-based
recommendations.
In [9],R. E. Nakhli, H. Moradi, and M. A. Sadeghi proposed the percentage
view approach for recommending movies to the users, it finds relevant movies
for the customer and then compares the performance with a random movie
recommendation system for showing the accuracy of the project.
In [10], a content-based recommendation system is proposed by H. W. Chen,
Y. L. Wu, M. K. Hor, and C. Y. Tang using neural networks. In recent years,
these are top topics for the researchers to work on when they want to build a
movie recommendation system.
4
CHAPTER 3
METHODOLOGY
In the existing system, collaborative filtering and content based filtering have
been used.Collaborative filtering system maintains a database of many users ratings
of a variety of items. It makes use of user data, ignoring content/item data.Content
based filtering uses only the item data maintaining a profile for each item. Each user
assumed to operate independently . No need for data on other users.
Cold start.
Sparsity.
First rater.
Popularity bias.
In the proposed hybrid recommender system takes advantage from both the
representation of the content as well as the similarities among users. The intuition
behind this technique is to use a content-based predictor to fill the user-rating matrix
that is sparsely distributed.
5
A web crawler is used to download necessary movie content for our dataset.
After the preprocessing the movie content database is stored. The dataset consists of
a user-rating matrix.
Content-based predictions are used to train each user-rating vector in the user-
rating matrix and convert it into a pseudo rating matrix which combines actual rating
with the predicted ratings. Collaborative filtering is then applied to this full pseudo
user-rating matrix to make recommendation for an active user.The movie tweets have
been collected from microblogging websites to understand the current trends and
user response of the movie.
This involves both the hardware and software requirements needed for the
project and detailed explanation of the specifications.
A PC with Windows/Linux OS
Processor with 1.7-2.4gHz speed
Minimum of 8gb RAM
2gb Graphic card