Beruflich Dokumente
Kultur Dokumente
Present by
Batch A6
B.Mounika(15U91A0511)
M. Maneesha (15U91A0576)
G.Jeevana Jyothi(15U91A0532)
A.Durga Tirumala Rayudu(15U91A0503)
B.Vijay Sai(15U91A0512)
Under the guidance of
Mr.G.Dilip Kumar M.Tech.,
Assistant professor
Contents
• Abstract
• Introduction
• Existing System
• Proposed System
• Modules
• Architecture
• System Design
• Implementation
• Code
• System Requirement Specifications
* Functional Requirements
* Non Functional Requirements
->Software Requirements
->Hardware Requirements
• Conclusion
• Future Enhancement
ABSTRACT
Our project aimed at performing an analysis of
commuting patterns, neighbourhoods, traffic, tipping patterns,
taxi fares (and more) in urban communities. The purpose is to
extract useful insights so that any solutions that we derive can
be mapped to other large cities. This dataset includes trip
records from all trips completed in green taxis in NYC in
2015.Records include fields capturing pick-up and drop-off
dates/times, pick-up and drop-off locations, trip distances,
itemized fares, rate types, payment types, and driver-reported
passenger counts.
INTRODUCTION
• The dataset contains close to 200 Gigabytes of New York City
Yellow Cab and Green Taxi trips.
• The dataset contains detailed records of over 1.1 billion
individual taxi trips in the city from January 2009 through
December 2016[2].
• Each record includes pick-up and drop-off dates/times, pick-
up and drop-off precise location coordinates, trip distances,
itemized fares, and payment method Figure [1],depicts a heat
map of the NYC taxi trips[1].
Figure 1: NYC taxi trips heat map
EXISTING SYSTEM
• The existing system or the application or the people used to
analyse the data based on worksheets, which doest fetch
enough results.
• This makes the company not to predict the actual information
and peak details and earnings of the company.
• Every detail are stored in a perfect database which in turn
creates a lot of data mess, which deals with difficult in the
analysis of the drives made by the people and the number of
drives made by the people.
DISADVANTAGE
The designed application is not so user friendly and
doesn’t have a better user interface which led the users with a
bad perspective over the application.
• Daily updates to be loaded into the dataset and process the
data.
• No proper methods were used in the application.
• No logical packages for filtering the text.
• Libraries are not efficient in analysis user data is never
considered in the enhancement of the product.
PROPOSED SYSTEM
• In our proposed system we are trying to create an interface
such that a dynamic interface can be explored on the data.
• To find out the peak hours in the trips data.
• The R program has many in built methods and a library to
work.
• We are using R programming to analyse the log files. The R
program has many in built methods and libraries to work.
Programming is easy.
• In our proposed system we are using new technologies for
developing the application and using some specific methods in
the library.
ADVANTAGES
• R has a huge packages which are more efficient to work on
heavy data and any kind of data format
• R can also be connected to Hadoop to work on the big data.
• User friendly programming language.
• Has a CRAN Library.
• It has package as inbuilt libraries are used at most to analysis .
MODULES:
MaM ain
inta ta
in in
d data
ata
Client U u
pdp
adad
te te data
ata
Install shiny
Install shiny
LoadLs
oh
ain
dyshiny
LL
oo
aa
dddd
aa
ta
tato
tobb
uu
ffe
ffe
rr
G
Gee
nn
ee
rate
raterese
thults
results
Proce
P s
ros
ceth
sseth
da
eta
data
Crea
Ctes
re U
atesU
er
seIn
rterte
Infac
rfe
ace
T
Taa
kk
ee d
thata
e dafta
romfroR
mbufb
R fe
urffer
Serves UI.R
Serves UI.R
Shiny
ServS
eer
s v
s s
ervs
eerv
.R e.R
P
Procer
soce
s s
thsr
e th
eqeur
eeq
stuest
Ge
Gne
en
ra
ete
rate
ins
In
ta
sn
ta
tnre
ts
Rue
lts
sults
Storere
Sto data
data
server
MM
ana
an
ga
ege
dadata
ta
Sto
Srerr
to eesu
relts
sults
CLASS DIAGRAM:
SEQUENCE DIAGRAM:
COLLABORATION DIAGRAM:
STATE CHART DIAGRAM:
COMPONENT DIAGRAM:
DEPLOYMENT DIAGRAM:
IMPLEMENTATION
• R is a programming dialect and programming condition and
designed by the R Foundation for Statistical Computing.
• The R dialect is generally utilized among analysts and
information mineworkers for creating factual programming and
information analysis.
• R was created by Ross Ihaka and Robert Gentleman at the
university of Auckland and is developed by the R development
core Team.
• R can deal with date factors in a few ways. There are worked in
R capacities accessible to manage date factors, and there are
additionally some helpful contributed bundles accessible.
• In any case, there is another R bundle, lubridate, contributed by
Grolemund that gives a great deal more easy to use usefulness
to manage date and time with time zone bolster.
Cont..