You are on page 1of 28

SAP HANA Enterprise Information Management (EIM) Smart data integration & Smart data quality with HANA

SPS09 at Intel
ASUG Annual Conference

BI 1155

2014 SAP AG or an SAP affiliate company. All rights reserved.

May, 2015

Disclaimer
This presentation outlines our general product direction and should not be relied on in making
a purchase decision. This presentation is not subject to your license agreement or any other
agreement with SAP.
SAP has no obligation to pursue any course of business outlined in this presentation or to
develop or release any functionality mentioned in this presentation. This presentation and
SAPs strategy and possible future developments are subject to change and may be changed
by SAP at any time for any reason without notice.
This document is provided without a warranty of any kind, either express or implied, including
but not limited to, the implied warranties of merchantability, fitness for a particular purpose, or
non-infringement. SAP assumes no responsibility for errors or omissions in this document,
except if such damages were caused by SAP intentionally or grossly negligent.

2014 SAP SE or an SAP affiliate company. All rights reserved.

Public

Agenda SAP HANA EIM Innovations


1.

Overview- EIM capabilities in SAP HANA - smart data integration, smart data quality and smart
data preparation*

2.

Intels co-innovation journey with SAP on HANA EIM

3.

Summary

2014 SAP SE or an SAP affiliate company. All rights reserved.

Public

SAP is simplifying the landscape with HANA while lowering


latency
Transactions

Traditional
SAP HANA
ETL

Mix of Potentially Stale


and Current Data

SAP HANA In
Memory Platform
Federation

Aggregate

Current Data

EIM Services
smart data quality (SDQ)
smart data integration (SDI)

Replication
Multiple Data Sources

On Premise and Cloud sources of Data

Separate ETL, Replication and Federation

Integrated Federation, ETL and


Replication
Op
RDBMS

Other
Sources

Simplified: Federation, Replication and Transformation, all within the SAP HANA platform
Open: Supports data any shape, style and size with an SDK for new data sources
Accelerated: In memory performance; lower latency
2014 SAP SE or an SAP affiliate company. All rights reserved.

Public

SAP HANA EIM: Delivering EIM Capabilities Natively in HANA as Centralized Services
SAP Applications

One framework to support all styles of data


delivery/provisioning in a unified framework
Real time replication
Bulk/batch
Data federation
Enterprise knowledge graph with entity semantic
services to crawl, discover and infer relationships
automatically
EIM services in HANA consumed by SAP and partner
applications

Partner Applications

SAP HANA
EIM Services
smart data quality
Metadata & Semantics

DQ Assessment

Data Cleansing

Matching

Best Record

3rd Party Enrichment

Rules and Policies

Hierarchies

smart data integration

On Premise and Cloud sources of data , SAP and Non-SAP


Relational, semi-structured, and unstructured
Planned Innovations
2014 SAP SE or an SAP affiliate company. All rights reserved.

Public

Innovate
SAP HANA Smart Data Integration
Simplifies the landscape, Reduces latency, Open framework
Simplified landscape with native EIM
capabilities

SAP HANA In Memory


Platform
SDQ
SDI

Integrated ETL, Replication and SDA

Integrated modeling environment with HANA studio and


HANA Web based development workbench
Built-in adapters for common sources ensuring
transactional consistency and guaranteed delivery
Open SDK for ecosystem to build custom adapters

2014 SAP SE or an SAP affiliate company. All rights reserved.

SDI Agent
ECC Adapter

Op
RDBMS

3rd party Db Adapters


HIVE adapter
Twitter, Flat file..

Twitter

Public

On Premise and Cloud sources of


Data

TCP/IP or HTTPS

Architected for On-Premise, Cloud and Hybrid


deployments

Open framework:

Current Data

EIM Services

Extends SAP HANA platform

Supports all styles of data delivery real-time, batch and


federation
Centralized, native in-memory services in unified
framework

Adapter
Framework

SAP HANA EIM Adapters and Adapter SDK


Delivers adapters for common sources
Real time-capable adapters ensure transactional
consistency and guaranteed delivery

SAP HANA In Memory


Platform

3rd party DBMSs: Oracle, MSSQL and DB2

Current
Data

EIM Services
SDQ

SAP ECC1 on top of the above DBMS

SDI

Twitter

Integrated ETL, Replication and SDA


TCP/IP or HTTPS

SDI Agent

OData, Hadoop, Flat file

SDI Adapter SDK for customers and partners to


build new adapters

Adapter
Framework

ECC Adapters
Adapters for 3rd party DB

RDBMS

HIVE Adapter
Twitter, Flat file..

Twitt
er

Architected for both on premise and cloud

On Premise and Cloud sources of


Data

Batch adapters

HANA DP agent communication occurs without


needing to open a port in the on premise firewall
1. Minimum supported version is ECC6; Details on NetWeaver version dependency in PAM
2014 SAP SE or an SAP affiliate company. All rights reserved.

Public

SAP HANA EIM


Extending HANA by integrating real time delivery mode
SDI provides real time mode to replicate sources
On selected sources with change data capture (CDC) capability
Leverages proven SAP Replication Server technology

Provides Transactional Integrity for real time


By listening to changes in the DBMS transaction logs and only replicating committed changes

Provides batch pull mode for all types of sources


Extends HANAs federation technology (SDA)
Can define SDA virtual tables for any remote table read through any SDI adapters
SDI adapters can access data outside enterprise firewall

Provides Guaranteed Delivery for real-time replication where possible


SDI can resume processing if replication stream is halted or disrupted
SDI can continue to operate during a temporary absence of the HANA target

2014 SAP SE or an SAP affiliate company. All rights reserved.

Public

Innovate
SAP HANA Smart Data Quality
Simplifies the landscape, Reduces data latency and delivers breakthrough performance

Extends HANA platform by integrating data


quality natively into HANA

Provides simple user interface

Cleanse person, firm and address data


Geocoding to enrich address data with latitude and
longitude information

Parse, standardize, validate, correct and enhance


person, firm, address in ONE transformation in HANA

Breakthrough performance with in-memory


data quality services

2014 SAP SE or an SAP affiliate company. All rights reserved.

Public

SAP HANA EIM


Available transformations SDI and SDQ
Basic SQL
Filter, Join, Union, Sort

Aggregation SQL
Aggregation, lookup, sort, case, and pivot/unpivot

Addressing the data movement lifecycle


Row generation, date generation, table comparison, map, and history preserving

Executing code
Procedure, AFL function

Transformations enriching data

Cleanse
Parse, standardize and enrich person, title, phone, firm, email and address information within a specified input source.

Geocode
Enrich address data with associated latitude and longitude information
2014 SAP SE or an SAP affiliate company. All rights reserved.

Public

10

Planned Innovation

Offering: Smart Data Preparation

POWERED BY HANA AVAILABLE ON-CLOUD AND ON-PREMISE FOR BUSINESS, DATA STEWARDS AND IT
Discover

Define

Search, explore,
profile local, ungoverned and enterprise datasets

Collaborate with business users to define shared rules,


business terms, policies and ownership

smart
data preparation

Prepare
Refine, merge,
cleanse, enrich, analyze datasets

Assess & Improve


Data quality and proactive governance

IT
Business

Data
Steward

Govern Access

Share

Manage access to enterprise datasets, semantics


and collective knowledgebase

Share and reuse datasets

Automate
Proactively deliver frequently used datasets

Monitor
Track dataset usage and data prep. actions, Anticipate demand

Increase business agility by enabling self-service data preparation for the business
Simplifies information governance for IT
Enable data stewards to collaborate, assess, define, monitor, remediate and improve data quality

2014 SAP SE or an SAP affiliate company. All rights reserved.

Business
IT
Data Steward
Public

11

SAP HANA EIM Architecture


On premise
HANA WebIDE

HANA Studio

http(s)

2014 SAP SE or an SAP affiliate company. All rights reserved.

Public

12

SAP HANA EIM Architecture


Cloud Ready
HANA WebIDE

2014 SAP SE or an SAP affiliate company. All rights reserved.

Public

13

SAP HANA EIM Roadmap


EIM services available natively in HANA & Consumed by HANA Applications
SAP HANA smart data integration (Modeling and Monitoring)
Real time from Suite and leading 3rd party databases
SAP Cloud Applications (e.g. SFSF)
oData , Hive and flat file adapters
Adapter SDK
Transformations
SAP HANA smart data quality
Cleanse, Geocoding

Planned Innovation
SAP HANA smart data integration
Design-time support of browser-based
Bi-directional replication (distribution)
New adapters (e.g. Teradata, Sybase ASE)
Enterprise readiness (monitoring, failover,
scheduling)
Delta support for consuming app (BW)
DDL schema replication

EIM
Application

SAP HANA smart data quality


Semantic profiling
Data quality assessment
Match, Best Record
smart data preparation* (Customer Preview)
Interactively discover, search, manipulate, profile,
cleanse and share datasets
Data usage monitoring analytics
Operationalization by IT (on premise only)

Applications
consuming EIM
services in HANA

EIM Services in HANA

HANA SPS09+ (Available now)

Data Quality Management for SAP SP06

* Final name TBD

2011 SAP AG. All rights reserved.

smart data preparation and stewardship


Additional connectivity
Stewardship capabilities (collaborate, assess,
monitor data quality and remediate)

LoB Applications & Analytics


Cloud for Planning
Master Data Governance
Customer Engagement Intelligence
Operational Process Intelligence
SAP BW
SAP Lumira
Demand Signal

Future Direction (Planned)


SAP HANA smart data integration
Non-relational data sources (e.g.
hierarchical/ semi structured data)
SAP extractor support
Cloud app connectivity
Task/workflow orchestration
SAP HANA smart data quality
Support additional domains
Policy Management and Advanced rules
Custom cleansing rules
Enrichment (e.g. D&B)
smart data preparation and stewardship
Syndicated data sets load and merging
Additional domains

Partner Applications

This is the current state of planning and may be changed by SAP at any time.

Internal

14

Intel

Legal Notices
INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR
OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL'S TERMS AND CONDITIONS
OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING
TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE,
MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT.
A "Mission Critical Application" is any application in which failure of the Intel Product could result, directly or indirectly, in personal injury or death. SHOULD
YOU PURCHASE OR USE INTEL'S PRODUCTS FOR ANY SUCH MISSION CRITICAL APPLICATION, YOU SHALL INDEMNIFY AND HOLD INTEL AND ITS
SUBSIDIARIES, SUBCONTRACTORS AND AFFILIATES, AND THE DIRECTORS, OFFICERS, AND EMPLOYEES OF EACH, HARMLESS AGAINST ALL CLAIMS
COSTS, DAMAGES, AND EXPENSES AND REASONABLE ATTORNEYS' FEES ARISING OUT OF, DIRECTLY OR INDIRECTLY, ANY CLAIM OF PRODUCT
LIABILITY, PERSONAL INJURY, OR DEATH ARISING IN ANY WAY OUT OF SUCH MISSION CRITICAL APPLICATION, WHETHER OR NOT INTEL OR ITS
SUBCONTRACTOR WAS NEGLIGENT IN THE DESIGN, MANUFACTURE, OR WARNING OF THE INTEL PRODUCT OR ANY OF ITS PARTS.
Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of
any features or instructions marked "reserved" or "undefined". Intel reserves these for future definition and shall have no responsibility whatsoever for
conflicts or incompatibilities arising from future changes to them. The information here is subject to change without notice. Do not finalize a design with this
information.

The products described in this document may contain design defects or errors known as errata which may cause the product to deviate from published
specifications. Current characterized errata are available on request.
Contact your local Intel sales office or your distributor to obtain the latest specifications and before placing your product order.
Copies of documents which have an order number and are referenced in this document, or other Intel literature, may be obtained by calling 1-800-548-4725, or
go to: http://www.intel.com/design/literature.htm%20
Intel, the Intel logo, Itanium, and Xeon are trademarks of Intel Corporation in the U.S. and/or other countries.
Other names and brands may be claimed as the property of others.

16

2014 SAP SE or an SAP affiliate company. All rights reserved.

Public

16

Intel Corporation

The Worlds Largest Semiconductor Manufacturer

Leading Manufacturer of Computer, Networking & Communications Products


Founded by Gordon Moore and Robert Noyce in 1968
Headquartered in Santa Clara, California
$52.7B in Annual Revenues - 25+ Consecutive Years of Positive Net Income
More than 160 Sites in 68 Countries

Over 106,000 Employees 82,000 technical roles, 10,000 Masters in Science,


6,000 PhDs, 4,500 MBAs

12th Most Valuable Brands in the World by Interbrand


Ranked #12 on Forbes Worlds Most Reputable Companies
Largest Voluntary Purchaser of Green Power in the United States since 2008
Invests $100 Million Each Year in Education Across More than 100 Countries
4 Million Hours of Volunteer Service toward improving education over the past
decade

Intel Confidential for internal use only

INTEL INFORMATION TECHNOLOGY

17

2014 Intel IT Vital Statistics

>6,065 IT employees
50 global IT sites

>106,000 Intel employees1


170 Intel sites in 66 Countries

61 Data Centers

(91 Data Centers in 2010)


80% of servers virtualized
(42% virtualized in 2010)

>119,000+ Devices

100% of laptops encrypted


100% of laptops with SSDs
>53,700 handheld devices
164 mobile applications developed
Source: Information provided by Intel IT as of Jan 2015
1Total

employee count does not include wholly owned subsidiaries that


Intel IT does not directly support
Copyright
2015,
Intel Corporation.
Allreserved.
rights reserved.
Copyright
2015,Intel
Corporation.
All rights

18

Agenda

To be landscape

Current HANA DI options

Proposed HANA DI options

Anticipated benefits from CUV

Opens from CUV

2014 SAP SE or an SAP affiliate company. All rights reserved.

Public

19

To be BI Landscape
*DSA: Downstream
Applications

Reporting

Extractors
(Batch)
BW Schema

Near real tme and Batch)

HANA

HANA
Models

Traditional BI

SAP NetWeaver &


Business Suites

Apps
Apps
DSA

Enterprise
Data Ware
House

Non-SAP
Other Sources

Advanced Analytics
& BIG data

Log
Social

Click

XLS

Unstructured Content

Intel Information Technology

2014 SAP SE or an SAP affiliate company. All rights reserved.


Intel Confidential for internal use

Big Data
Adv.
Analytics,
Visualizations

PAP
PAP

20

Public

20

Current DI options for HANA


Landscape
Transformation

Trigger Based
Near real
Time

ODBC/
JDBC

ODBC

Queries

User Queries,
Applications

ODBC/JDBC

(SLT)
SAP Biz.Suites
(ECC, CRM, SCM,
MDG)

SAP Live Cache


Replication
(LCR)

SCM AP0

XML
ETL Batch

Enterprise Data
Warehouse

OHD

ETL Tool
Files

BW Extractors

(Files, XML,
Spreadsheets...)

API
ETL Batch

(BWE)
Extractors
Batch

Direct Extractor
Connection
Cloud/Web Services

http

ETL Batch

Data Services

Non-SAP Data
Targets

ETL Batch

Data Services

(DXC)

Big Data

ETL Batch

(DBs, Files...)

(DS)
Messages/Services

RFC
http

(DS)
Non-SAP Data
Sources
ODBC/
JDBC

Smart Data Access


Data
Virtualization

Intel Information Technology

(SDA)

2014 SAP SE or an SAP affiliate company. All rights reserved.


Intel Confidential for internal use

Queries

Data Virtualization

21

Cloud/Web Services
Queries

Public

21

Proposed DI options for HANA

ODBC/
JDBC

SAP Biz.Suites
(ECC, CRM, SCM,
MDG)

Queries

User Queries,
Applications

ODBC/JDBC

Log/Trigger
Based
Near real Time
Extractors
Batch

ETL Batch

HANA EIM

Files
(Files, XML,
Spreadsheets...)
ETL Batch

OHD/
ETL Batch

ETL Tools
Example
Data Services
(DS)

ETL Batch

ETL Batch

Cloud/Web Services

Enterprise Data
Warehouse

Big Data

Non-SAP Data
Targets
(DBs, Files...)

Messages/Services
http

Non-SAP Data
Sources
Data
Virtualization

Intel Information Technology

2014 SAP SE or an SAP affiliate company. All rights reserved.


Intel Confidential for internal use

Queries

ODBC/
JDBC

Data Virtualization

22

Cloud/Web Services
Queries

Public

22

Anticipated benefits

Simplified Data integration landscape

Less infrastructure & tools to support

Integrated data profiling & quality

Better reliability & recoverability

Intel Information Technology

2014 SAP SE or an SAP affiliate company. All rights reserved.


Intel Confidential for internal use

23

Public

23

Opens from CUV

Impact on shared HANA platform (On-premises and cloud)

Troubleshooting and supportability (Traces, logs, recoverability,


restartability)

Extracting data out from HANA

Near Real Time from Non-SAP sources/Databases

Lifecycle management/Transport capabilities

Scheduling and monitoring capabilities

Intel Information Technology

2014 SAP SE or an SAP affiliate company. All rights reserved.


Intel Confidential for internal use

24

Public

24

Sharing Intel IT Best Practices


With the World

Learn more about Intel ITs Initiatives at

www.intel.com/IT
Copyright 2015, Intel Corporation. All rights reserved.

Summary of Value Proposition

SAP HANA EIM Value Proposition


Lower TCO

Open & Extensible

Real-time

Simplified Landscape, Integrated


modeling environment

Open framework
Data any style, shape and size-

Ability to replicate and transform


data in real time

SAP and non-SAP


On premise and cloud sources

Transactional consistency and


guaranteed delivery1

Single product covering multiple


use cases

Deploy on premise or on cloud

2014 SAP SE or an SAP affiliate company. All rights reserved.

Breakthrough Performance
(natively built in HANA)
Public

27

Further Information
HANA Academy Videos
YouTube Playlist
https://www.youtube.com/playlist?list=PLkzo92owKnVwQ_preA3cxlQjn_v3W0Eh5
SPS09 Whats new Playlist
https://www.youtube.com/playlist?list=PLkzo92owKnVwADqaEp2-YhXFRKDVoUQNL
Help Documentation

http://help.sap.com/hana_options_eim
Contact Information
Subha.ramachandran@sap.com
Yatish.k.goel@intel.com

SESSION 1155
2014 SAP SE or an SAP affiliate company. All rights reserved.

Public

28