Beruflich Dokumente
Kultur Dokumente
choose hurricane data, here many accurate models for predicting track
movement has been developed but coming to intensity prediction are
still inaccurate. So we proposed the hurricane intensity prediction
based on weighted feature learning and extensible markov model.
Here we used Java to build the algorithms, R for testing and
visualization. I am the first person in team to analyze and understand
the data. Its in-unstructured format, it took like couple of days to
understand the data and its dimensionality. Here the interesting point
is not all the parameters are affecting in predicting the intensity
prediction. So applied the weighted feature learning method where we
initially used the traditional genetic algorithm to give the necessary
weights to the features and later on extended to different weight
feature learning methods. I even implemented the existing algorithm
called SHIPS that was developed in python. Our method was
outperformed with the existing method, to evaluate we applied
absolute error and k-fold cross validation. NOAA along with other
intensity prediction models has used the developed model. This is the
major achievement by our team to forecast the intensity prediction.
1.
can analyze the moving object behavior which is nothing but drivers
behavior.
2.
Sudheer
Chelluboina
and
Michael
Hahsler,
Trajectory
Methodology:
What
are
methods
by
which
you
could
reduce
the
Suppose that you have access to the the following data for all
trips on Uber (date and time, trip origin (lat, long), destination
(lat, long), travel distance, Uber car type, driver ID). Describe a
sampling methodology that would produce independent and
identically distributed (IID) samples for the travel distance.
How would you demonstrate that the data is IID?
From the definition it is evident that the variable travel distance is
independent and identically distributed if they have the same
probability distribution and also they are independent to each reading.
To satisfy the sample data is independent and identically distributed:
systematic trend
Variance over the data will not show increasing or decreasing or
systematic trend
frequent trips.
Now overlaid the data points with orgin location of the trip. Here
the data to be considered is the time and origin. For time
attribute I create the labels as hourly, for example time between
11:00AM and 12:00PM will be labeled as 11, so this attribute will
have 24 labels for 24 hours. These labels can be increased or
decreased based on the requirement. We know how the
probability works based on the more and less labels for time.
Now clustering process, by considering the observation as
<time_label, lat, long>. We apply the distance based hierarchal
clustering, that this algorithms should group the origins by some
distance and on top of if by time_label. By this algorithm we can
find the probability of passengers at that particular location is
some value at that particular time. We can set the probabilities
the trip.
As having the mobile app building experience, the abovecalculated values at the system can be represented in an
understanding way. Like, In a app a options are given to the
driver to check the probability of passengers taking the trip at
particular location or current location. This is for the users who
are not used to visual representation.