Beruflich Dokumente
Kultur Dokumente
2010
IEEE
IEEE
Fourth
International
International
Conference
Conference
on Semantic
on Semantic
Computing
Computing
Zhuli Xie
Yahoo! Inc.
3333 West Empire Avenue
Burbank, CA, USA
zxie@yahoo-inc.com
I. I NTRODUCTION
Recommender systems have become a vital part of most
web companies that have a significant online presence. These
systems form a specific type of information filtering technique
that attempts to present items (movies, music, books, news,
images, web pages) that are likely of interest to the user by
comparing the users profile to reference characteristics such
as the information from the item or from the users social
environment. Input is taken either directly or indirectly from
users and based on user needs, preferences and usage patterns,
recommendations for personalized products and services are
provided. The goal is to ease the information search, discovery
and decision processes for the user.
In this paper, we explore both Collaborative Filtering (CF)
and Content-based (CB) methods for video recommendation.
Our recommendation approach is item-based, i.e, the recommendations are per-item, rather than per-user based. An
item-based collaborative filtering method and a content-based
approach to find similar videos is presented. To find similar
videos, both methods are employed where the item-based
collaborative filtering utilizes the video views data and the
content-based method makes use of the video metadata. The
results from two approaches are then blended to generate the
recommendation indexes, which are then improved through
a refinement stage. Our system can be easily modified to
978-0-7695-4154-9/10 $26.00 2010 IEEE
DOI 10.1109/ICSC.2010.79
126
and aural relevance using 13K online videos. A film recommendation system, which uses low level audio-visual features
extracted to classify MPEG coded films is presented in [16].
This differs from our approach as we do not use low level
physical features, primarily to avoid issues related to proprietary nature of data and complexity given the large size
of data set. Another approach, Music Video Miner, which
performs video segmentation and feature extraction to get
an abstract representation of video using audio, visual and
transcript features, is presented in [4]. An approach providing
video suggestions using user-video graph is presented in [6].
Our approach differs from theirs in that we use semantic
information obtained from video metadata.
The rapid growth in Recommender Systems has seen approaches attempting to deal with the challenges of scalability,
accommodating to new data, comprehensibility and improving accuracy of recommendations. An approach addressing
the scalability issue is presented in [8] using a CF based
incremental clustering. In our recommendation approach we
leverage map-reduce framework to provide a scalable solution
for processing large dataset and make use of an alternate way
to achieve improvements in the quality of recommendation
results through a refinement stage. In terms of handling new
data, we use incremental learning and updating. The results
in [10] indicate that allowing users to know more about
the result generating process can help them understand the
strengths and weaknesses of the recommender system. With
this knowledge, users can make low-risk decisions. We provide
explanations in terms of common features using concepts of
semantic similarity when providing corresponding recommendations for candidate items which in our study are videos.
127
one user who just watched a clip of her friend dancing Hanna
Montana, she would be likely to watch similar mimic shows
from other unknown people and have some more laughs. In
such situation, we will provide the users with similar videos
which are determined from the textual data associated with
the videos, such as the title, description, and tags entered by
the users. At the same time, the user views data provide us
some other evidences that some videos are still of peoples
interests even though they cannot be determined as related
from the associated text. In Section V we will show that our
recommendation system combines the recommendation results
generated from our pipeline which deal with the user views
data and video Metadata separately. Below we describe two
kind of data used by our pipeline.
A. Co-Viewed Videos
MySpace allows users to watch the videos on the website
without logging into the site with their MySpace user accounts.
If we only use the user views data from authenticated users,
more than 40% of the total user views will not be used. In
our recommendation pipeline, we use cookies to represent
users instead of users accounts. When two videos are watched
by a number of users within a certain period of time, it
is an indication that both videos interest the users in some
way. And such user views data are usually collected by the
recommendation systems based on Collaborative Filtering,
such as the one used by Amazon.com [11]. We used the
video views data collected over a moving window of last
90-day period. It was also shown in [13] that the majority
of the videos uploaded to the social website accumulated a
very low percentage of their total views after 90 days since
uploaded. During an averaged 90-day period, there were more
than 382 million video views in which 13.4 million videos
were consumed by 113 million users (both authenticated users
and guests). Nearly 4.1 million videos were viewed only once
while the most popular videos were watched by nearly 1
millions of users. On average, a video was viewed by about 30
users. A similar picture can be seen from another aspect: more
than 65 million users watched only 1 video during the 90 days
period. On average, one user watched about 3 videos. About
166 million video views (nearly 43% of total views) were on
only 3.8K popular videos (with 10K or more viewers). Based
on above view data there were 382 million recommendations
generated, covering 2.2 millions videos which is 16% of total
videos that were viewed at least once in the 90-day period.
On average, there are 56 recommendations per video.
Fig. 1.
B. Video Metadata
Fig. 2.
128
2.
Input:
Input:
V. A PPROACHES
A. Collaborative Filtering
We use item-to-item based collaborative filtering method
that is similar to [11]. As discussed in previous section, we
are generating recommendation not on user level, but finding
videos with similar content, as half of our videos consumed
are from visitors (We can always provide more personalized
recommendation for users, by using their video watching
history in 90-day window). Its possible to compute the similarity between two videos in various ways, including cosine
measure, local popularity and Jaccard index [6]. We choose
local popularity because it provides the highest recommendation coverage without significantly losing the recommendation
quality. The coverage is defined as the percentage of videos
that receive recommendations among all videos. The following
algorithm calculates the similarity between video pairs using
local popularity and generates the recommendation index.
ALGORITHM 1: Recommendation Index Generation
129
Fig. 3.
TABLE I
F EATURE WEIGHT GIVEN BY METADATA TYPE OR POS TAGS
(1)
|D|
log
|dj : ti dj |
(2)
Weight
Tag
Noun Phrase
0.8
Proper Noun
0.8
0.8
Noun
Verb
0.7
Adjective
0.5
Adverb
0.2
130
TABLE II
C OMPUTATIONAL T IME AND C OVERAGE R ATE A NALYSIS .
Running Time (mins)
Initial
Coverage Rate
Subsequent
CF
16.21%
CB
27
16
58.22%
131
TABLE III
U SER - DEFINED C ATEGORY BASED M EAN AVERAGE P RECISION
A CCURACIES .
Categories
Animals
Animation/CGI
Automotive
Comedy and Humor
Entertainment
Extreme Videos
Instructional
Music
News and Politics
Schools and Education
Science and Technology
Video Blogging
Sports
Travel and Vacations
Video Games
Weird Stuff
Overall
CF
0.3421
0.5540
0.7149
0.7003
0.7621
0.6599
0.4093
0.7345
0.4347
0.4720
0.2577
0.7262
0.7987
0.6494
0.4799
0.7200
0.5885
CFprm
0.5911
0.7738
0.6198
0.7364
0.7840
0.4641
0.8087
0.6720
0.4166
0.3315
0.3423
0.6264
0.7025
0.3568
0.7595
0.5227
0.5943
CB
0.2217
0.3026
0.4103
0.3788
0.4341
0.2758
0.2944
0.5382
0.2524
0.2028
0.3247
0.2857
0.4137
0.2721
0.2903
0.3084
0.3254
CBprm
0.5061
0.6730
0.5866
0.6103
0.9120
0.4264
0.5838
0.6572
0.5205
0.3900
0.3878
0.6647
0.7426
0.4718
0.5082
0.4608
0.5689
Fig. 5.
B. Click-through Evaluation
In this section we show the click-through comparison study
of the proposed video recommendation system on MySpace
Video site of one days data on Nov 16, 2009. This is only one
days worth of data, but other days show similar performance.
On MySpace Video site, for each public viewable video,
recommendations is showing in a module along with another
module showing editorially-selected Featured Videos. This
set up helps us to measuring absolute performance of recommendation, in addition to comparing it to the manually
created list. Figure 6 shows information for the top 5000
videos shown in US for that day. Recommended videos have
an average click-through rate of 11.3%, with over 25% of the
recommended videos having a click-through rate of larger than
15%. By comparison, Featured Videos had a click-through
rate of 0.36%.
132
Fig. 6.
VII. C ONCLUSIONS
133