Beruflich Dokumente
Kultur Dokumente
Big Data
BY
V. KRISHNA CHAITANYA
Evolution of Big Data Platform
1. Interest and excitement around Big data has always been on the rise over the past few years with
full blown implementations.
2. Mining petabytes of data for value is essential for businesses today which calls for a scalable storage
and distributed approach.
3. Most of the big data systems (e.g. Hadoop) available today have batch oriented nature for querying
data, which is time consuming which Hadoop vendors find it challenging in analyzing the data.
4. Also, a significantly different skill-set (scala, R, python etc..) is required in consuming this kind of
data.
Questions that are in the minds of customers
Can I use my SQL developers to work on Hadoop data?
Can my data scientists access Hadoop data without needing to worry about data
integration issues?
Can I run queries on Hadoop data without waiting hours for the result?
So clearly there is a need for a platform that can remove data latency and quickly analyze data in a single,
user friendly environment by providing a unifying layer to work with big data.
This will enable users to seamlessly integrate different temperatures of data and give all business personas
2. In simpler terms Vora is technically a plug-in for Apache Spark that produces accelerated results by
processing Hadoop data in memory.
3. Vora uses SPARK SQL library and HANA computing engine to provide interactive analysis for the end
users.
4. VORA handles OLAP analysis & hierarchical queries very well as it does layers in few
enhancements to Spark SQL.
1. SAP HANA Vora provides a distributed computing framework at enterprise scale to be able to
accelerate, innovate, and simplify your data.
2. Being able to bring these two together, Vora provides that layer on top of Hadoop.
HANA VORA Engine Architecture
1. HDFS (Hadoop Distributed File System) is the
core of Hadoop and storage part that can store
huge volumes of data i.e. Terabytes &
Petabytes whether it is structured or
unstructured.
Use Case 2: Predictive maintenance for automobiles using sensor data in Hadoop Companies can track automotive
performance though the continuous monitoring of sensor data. With Vora, companies can integrate real-time streaming data from
devices with customer master and transaction data stored in HANA/ERP to help improve vehicular safety. The ability to infuse enterprise
data with up-to-the-moment data from external sensors allows business to make contextually savvy decisions and improve processes.
Use Case 3: Track lost airline baggage using RFID With RFID tracking enabled for all airline baggage, the data stored in Hadoop
can be queried within minutes using Voras boosted SQL performance. This helps airlines improve the lost baggage metric and reduce
costs. Vora developers can quickly build queries on Hadoop tables, which enables businesses to keep pace with up-to-date operational
scenarios.
VORA Modeler
1. VORA modeler consists of three main components:
Data browser
SQL editor
Modeler
Data Browser
The data browser allows you to quickly display the relations such as tables, views, and cubes in
SAP HANA Vora without having to write a query.
SQL Editor
In the SQL editor, you can execute SQL only by typing in the SQL queries and execute those queries by
clicking the execute button after selecting the respective query.
Modeler
Data modeling refers to an activity of refining or dividing data in database tables by creating views that
portrays a business scenario.
THANK YOU