Professional Documents
Culture Documents
INTERNSHIP REPORT
INFORMATION TECHNOLOGY
TABLE OF CONTENTS
1
INTRODUCTION.............................................................................................4
1.1 About Knowledge Technology Laboratory...................................................4
1.2 About Topic: Collaborative Filtering techniques in Recommender Systems
........................................................................................................................
4
COLLABORATIVE FILTERING.....................................................................4
2.1 Recommender Systems.................................................................................4
2.2 Collaborative Filtering..................................................................................5
2.2.1
Overview.............................................................................................5
2.2.2
2.3.1
Cosine Similarity.................................................................................6
2.3.2
2.3.3
EXPERIMENT..................................................................................................8
3.1 Recommendation Engine : RecDB...............................................................8
3.2 Experiment....................................................................................................9
REFERENCES................................................................................................15
ACKNOWLEDGMENTS
TABLE OF FIGURE
Figure 1. Collaborative Filtering Process..............................................................6
Figure 2. Turn on database server........................................................................10
Figure 3. Create and run database movielensdb...............................................11
Figure 4. Import initmovielens1mdatabase.sql....................................................12
Figure 5. Check list of relations...........................................................................12
Figure 6. Top-10 movies recommendation based on the rating predicted using
Item-Item Collaborative Filtering............................................................................14
Figure 7. Recommends the top 5 action movies to user 1...................................15
Figure 8. Recommends the top 5 action movies to user 2...................................16
Figure 9. Recommends the top 5 action movies to user 3...................................17
1 INTRODUCTION
1.1 About Knowledge Technology Laboratory
Knowledge and Technology laboratory is under Faculty of Information
Technology. There are some main fields in researching:
Text Mining, Web Mining, Opinion Mining, Social Media mining,
and Natural Language Processing Vietnamese Entity/Object Search
Vietnamese Entity/Object Search
Process Mining, Knowledge Technology and Service Science
The head of Knowledge and Technology laboratory is Associate Professor,
Doctor Ha Quang Thuy.
1.2 About Topic: Collaborative Filtering techniques in
Recommender Systems
Recommender Systems can be divided into two main categories, Contentbased systems and Collaborative Filtering systems [1] [2] [3]. In my
internship course, I choose the Collaborative Filtering approach, there are
some reasons:
Firstly, Collaborative Filtering is based on simple ideal, so it is easy
to comprehend and implement.
Secondly, although Collaborative Filtering is simple but it is effect
intuition, and using in widely, such as: Amazon.com, Yahoo,
Cinemax.com
Last, Collaborative Filtering is the basic method, it is proven about
the performance, and it can be improved.
2 COLLABORATIVE FILTERING
2.1 Recommender Systems
Recommender Systems are a subclass of Information Filtering system
that use to predict the preference that user would give to an item [1] [4]
(movies, books, music, news, Web page, images ).
Typically, Recommender Systems produce a list of recommendations in
one of two ways: through Collaborative or Content-based Filtering [5] [1]
4
. Each entry
(ratings) of the
ai , j
in
th user on the
within a numerical scale and it can as well be 0 indicating that the user has
not yet rated that item.
There are many algorithm can be used for Collaborative Filtering. In this
paper, I will focus on Cosine Similarity, Pearson Correlation Similarity,
Singular Value Decomposition.
6
If the value of similarity is 1, two vectors are the same orientation, if that
value is 0, two vector is crossed, item i and j are distinct. And if this value is
-1, two is not similarity.
2.3.2 Pearson Correlation Similarity
Pearson Correlation Similarity is a Model-based algorithm for making
recommendations [1]. In this case, the similarities between two item i , j is
measured by computing Pearson Correlation
corr i , j
(Ru ,i R i)( R u , j R j )
( i, j ) =corr i , j=
u U
(R
u U
Where
Ru ,i
u, i
R i ) .
(R
u U
u, j
R j )
to item
R i
is the
, with rank
SVD ( A)
, is
defined as
SVD ( Am n )=U m m Sm n V T n n
Where matrix
nonzero entries,
m r
, called
Ak
. It is possible to
3 EXPERIMENT
3.1 Recommendation Engine : RecDB
In this section, I am doing some experiment using RecDB Recommendation Engine Built Entirely Inside PostgreSQL 9.2 of Mohamed
Sarwat of University of Minnesota. RecDB allows application developers to
build recommendation applications in a heartbeat through a wide variety of
built-in recommendation algorithms like user-user Collaborative Filtering,
8
Step 2: Create and run new database has name movielensdb with
command line in new terminal: perl scripts/pgfrontend.pl movielensdb
The address of the host server running the PostgreSQL backend is localhost
(default)
10
11
12
13
Figure 6. Top-10 movies recommendation based on the rating predicted using Item-Item
Collaborative Filtering
DESC LIMIT 5;
As can be seen from the Figure 7, we can see five action movie with the
highest rating value. That mean, the system can make recommendation for
user 1 five movies in action type.
To compare, I will make top 5 action movies for user 2 and user 3. For
user 2, I user query:
SELECT r.itemid, i.name, i.genre, r.ratingval
FROM ml_ratings r, ml_items i
RECOMMEND r.itemid
TO r.userid
ON r.ratingval
USING itemcoscf
15
5 REFERENCES
[1]
[2]
[3]
[4]
[5]
[6]
[7]
[8] .
Paul Resnick, Neophytos Iacovou, Mitesh Suchak, Peter
Bergstrom, John Riedl, "GroupLens: An Open Architecture for
.
Collaborative Filtering of Netnews," CSCW, pp. 175-186, 1994.
.
.
[9]
Mustansar Ali Ghazanfar, Adam Prgel-Bennett, Sndor
.
Szedmk, "Kernel-Mapping Recommender system algorithms,"
.
Inf. Sci., pp. 81-104, 2012.
.
[10].
Berry, M. W., Dumais, S. T., and OBrian, G. W., "Using Linear
Algebra for Intelligent Information Retrieval," SIAM Review, vol.
.
37, no. 4, pp. 573-595, 1995.
.
[11]
Hanoi, ./../2014
Lecturer
19