Sie sind auf Seite 1von 9

On Summarization and Timeline Generation for

Evolutionary Tweet Streams

Abstract:

Short-text messages such as tweets are being created and shared at an unprecedented rate.
Tweets, in their raw form, while being informative, can also be overwhelming. For both end-
users and data analysts, it is a nightmare to plow through millions of tweets which contain
enormous amount of noise and redundancy. In this paper, we propose a novel continuous
summarization framework called Sumblr to alleviate the problem. In contrast to the traditional
document summarization methods which focus on static and small-scale data set, Sumblr is
designed to deal with dynamic, fast arriving, and large-scale tweet streams. Our proposed
framework consists of three major components. First, we propose an online tweet stream
clustering algorithm to cluster tweets and maintain distilled statistics in a data structure called
tweet cluster vector (TCV). Second, we develop a TCV-Rank summarization technique for
generating online summaries and historical summaries of arbitrary time durations. Third, we
design an effective topic evolution detection method, which monitors summary-based/volume-
based variations to produce timelines automatically from tweet streams. Our experiments on
large-scale real tweets demonstrate the efficiency and effectiveness of our framework.

HARDWARE REQUIREMENTS

PROCESSOR: INTEL Core i3


RAM: 3 GB DD RAM
HARD DISK: 40 GB

SOFTWARE REQUIREMENTS

The software requirements document is the specification of the system. It should include both a
definition and a specification of requirements. It is a set of what the system should do rather than
how it should do it. The software requirements provide a basis for creating the software
requirements specification. It is useful in estimating cost, planning team activities, performing
tasks and tracking the teams.

OPERATING SYSTEM: Any OS (JAVA Platform independent)


WEB TECHNOLOGIES: HTML, CSS, JAVA SCRIPT, Jquery
FRONT END: JSP
BACK END: JEE (Servlets, JDBC),
DB: MySQL
SERVER: Tomcat
IDE: Net Beans

Modules:

1. Secured Credentials:

In this module application users need to register into the system for the security purpose
and also we can easily track the user activity.

After registering into the application user can login into the system with his secured
credentials and based on user type application will render the corresponding home page. Here
mail is treated as the user unique login id. If he entered invalid details error page will be
displayed.

If user login successfully then for normal user application will provide the options like
add friend, invite friends, post tweets and summarize the tweets.

2. Post Tweets:

After user success login user is having the facility to tweet the messages and post their
views on the walls.

3. Follow Friends:

In this application we are providing the facility to follow other users and make them as
friend to follow their tweets.

4. Retweet Posts:

Here we are having the option like we can reply or retweet for the posted comment. We
can retweet the comment posted by our followers.
5. Summarize the tweets:

Here lot of people posting/tweets so many comments and we are not able to read all the
comments and they are not able to analyze and not able form the opinion. So we are
having the option like summarization of comments and here we are providing the
summarization of tweets based on from date and to date of posting.
Home page screen shot

Register page
Login page

User Success Login page


Post new tweet
USE CASE Diagram:

Sequence diagram:
Activity Diagram:

Collaboration Diagram:
Data base Structure:

USERDETAILS:

USERID NAME MAIL MOBILE PASSWORD


1 USER1 a@gmail.com 9898999998 *****
2 USER2 b@gmail.com 9637634636 *****

FOLLOWERS:

ID FOLLOWER_ID FOLLOWED_BY_ID
1 2 1
2 3 1

TWEETS:

TWEET _ID TWEETED_BY_ID TWEET_MSG TWEET_DT

RETWEET:

RETWEET_ ID TWEET _ID RE_TWEET_MSG RE_TWEET_DT

Das könnte Ihnen auch gefallen