Beruflich Dokumente
Kultur Dokumente
WI: IWI
August 31st, 2010
• Introduction
• Analysis
• Methodology
• Results and Application
• Challenges and Future Works
• Conclusion
Introduction
Twitter as a news channel
RT
Lisa 12:30 A M6
Earthquake in Tokyo!
John12:15 A M3
Methodology
Method for Collecting, Indexing and
Grouping
‣ Collecting
Conditions
• Message in a group must be
related to the first story
• Further messages can develop
upon previous messages
Jenny Rose Finkel, Trond Grenager, and Christopher Manning. 2005. Incorporating Non-local Information
into Information Extraction Systems by Gibbs Sampling. Proceedings of the 43nd Annual Meeting of the
Association for Computational Linguistics (ACL 2005), pp. 363-37
Method for Group Ranking
• A group score is based on
reliability, popularity and
freshness factors. The score for each group is computed
‣ Reliability comes from the as follows:
numbers of followers who
follow the user who posted a
message.
‣ Popularity comes from the
numbers of retweet.
‣ Freshness is computed from
the difference of current time
and time where a message is
posted.
Results and Application
Detection Effectiveness
Method
Rates
Search query
• A prototype application
called Hotstream is
developed.
Grouped by TF-
IDF with proper
noun term
boosting
Example Dataset
Messages-Named Entities
Top 18 stories and their keywords from Hotstream as of July 21st, 2010
Red nodes = keywords, Yellow nodes = message groups
Community Detection Experiment
Network Characteristics
Network Type Edge betweeness
No. Clusters 40
Purity 0.67
Conclusion
• Introduced Twitter as a mean to convey news