Sie sind auf Seite 1von 29

Welcome to:

Unit 1 - Introduction to Big Data Analytics

© Copyright IBM Corporation 2015 9.1


Unit Objectives IBM ICE (Innovation Centre for Education)
IBM Power Systems

After completing this unit, you should be able to:


• What is ‘Big Data’
• The three V’s of ‘Big Data’
• The Importance of Big Data
• The Risks of Big Data
• The Need for Big Data
• The Structure Of Big Data
• The Need for Standards
• What is Big Data Analytics
• Big Data Analytics Adoption Structure
• Benefits of Big Data Analytics
• Barriers to Big Data Analytics
• Trends for Big Data Analytics

© Copyright IBM Corporation 2015


What is Big Data? IBM ICE (Innovation Centre for Education)
IBM Power Systems

• Big Data is a general term used to describe the voluminous amount


of unstructured and semi-structured data.

• Processing and loading big data into a relational database for


analysis will take too much time and would cost too much money.

• Big Data term is often used when speaking about


Petabytes and Exabytes of data.

• A primary goal for looking at big data is to discover repeatable


business patterns.

• It’s generally accepted that unstructured data, most of it located in


text files, accounts for at least 80% of an organization’s data
© Copyright IBM Corporation 2015
Three V’s of Big Data IBM ICE (Innovation Centre for Education)
IBM Power Systems

• Volume
– Volume indicates the amount of data for analysis.
– characteristic most associated with big data, volume refers to the mass
quantities of data.
– Data volumes continue to increase at an unprecedented rate.
• Variety
– Variety is about managing the complexity of multiple data types, including
structured, semi-structured and unstructured data.
– Organizations need to integrate and analyze data from a complex array of
both traditional and non-traditional information sources.
– Explosion of sensors, smart devices and social collaboration
technologies, generates data in countless forms like text, web data,
tweets, sensor data, audio, video and more.

© Copyright IBM Corporation 2015


Three V’s of Big Data (Cont.) IBM ICE (Innovation Centre for Education)
IBM Power Systems

• Velocity
– Data in motion
– The speed at which data is created, processed and analyzed continues to
accelerate.
– Velocity impacts latency – the lag time between when data is created or
captured, and when it is accessible.
– Data is continually being generated at a pace that is impossible for
traditional systems to capture, store and analyze.

© Copyright IBM Corporation 2015


Three V’s of Big Data (Cont.) IBM ICE (Innovation Centre for Education)
IBM Power Systems

© Copyright IBM Corporation 2015


Need for Big Data IBM ICE (Innovation Centre for Education)
IBM Power Systems

• Big Data can unlock significant value by making information


transparent
• As organizations create and store more transactional data in digital
form
• Big Data allows ever-narrower segmentation of customers and
therefore much more precisely tailored products or services
• Sophisticated analytics can substantially improve decision-making,
minimize risks, and unearth valuable insights that would otherwise
remain hidden
• Big Data can be used to develop the next generation of products and
services

© Copyright IBM Corporation 2015


Benefits & Barrier of Big Data Analytics IBM ICE (Innovation Centre for Education)
IBM Power Systems

• Benefits of Big Data Analytics


– Anything involving customers could benefit from big data analytics
– Business intelligence in general can benefit from big data analytics
– Specific analytic applications are likely beneficiaries of big data analytics

• Barriers to Big Data Analytics


– Inadequate staffing and skills are the leading barriers to big data analytics
– A lack of business support can hinder a big data analytics program
– Problems with database software can be barriers to big data analytics

© Copyright IBM Corporation 2015


Intrinsic Property of Data…it grows IBM ICE (Innovation Centre for Education)
IBM Power Systems

© Copyright IBM Corporation 2015


A Growing Interconnected and
Instrumental World IBM ICE (Innovation Centre for Education)
IBM Power Systems

© Copyright IBM Corporation 2015


Characteristics of Big Data IBM ICE (Innovation Centre for Education)
IBM Power Systems

• V4 = Volume Velocity Variety Veracity

© Copyright IBM Corporation 2015


Commoditization of Hardware Enabling New
Analytics IBM ICE (Innovation Centre for Education)
IBM Power Systems

• Low cost compute platform


– 1 petabyte Hadoop cluster for approx $1 million
– Hadoop architecture
• Optimized for high data volumes
– Clusters of affordable machines running a Distributed File System (HDFS)
and MapReduce processing
• Hardware failure is expected and managed
• Hardware Appliance
– Up and Running with new cluster in hours
• Cloud
– Up and Running with new cluster in minutes
– Pay what you use

© Copyright IBM Corporation 2015


The 5 Key Big Data Use Cases IBM ICE (Innovation Centre for Education)
IBM Power Systems

© Copyright IBM Corporation 2015


More Ways – Wide Ranging Analytics and
Techniques IBM ICE (Innovation Centre for Education)
IBM Power Systems

© Copyright IBM Corporation 2015


Big Data and Complexity in Health
Care IBM ICE (Innovation Centre for Education)
IBM Power Systems

• Medical information
is doubling every 5
years, much of
which is
unstructured
• 81% of physicians
report spending 5
hours or less per
month reading
medical journals
“Medicine has become too complex (and only) about 20 percent of the knowledge clinicians use
today is evidence-based”
– Steven Shapiro, Chief Medical and Scientific Officer, UPMC
…to keep up with the state of the art, a doctor would have to devote 160 hours a week to
perusing papers…”
– The Economist Feb 14th 2013
© Copyright IBM Corporation 2015
Big Data Platform and Application
Frameworks IBM ICE (Innovation Centre for Education)
IBM Power Systems

© Copyright IBM Corporation 2015


An Example of Big Data Platform in
Practice IBM ICE (Innovation Centre for Education)
IBM Power Systems

© Copyright IBM Corporation 2015


A Big Data Platform Manifesto IBM ICE (Innovation Centre for Education)
IBM Power Systems

© Copyright IBM Corporation 2015


Use Cases for a Big Data Platform IBM ICE (Innovation Centre for Education)
IBM Power Systems

• Financial services
– Problem:
• Manage the several Petabytes of data which is growing at 40-100% per year
under increasing pressure to prevent frauds and complain to regulations.
– How big data analytics can help:
• Fraud detection
• Risk management
• 360°View of the Customer

© Copyright IBM Corporation 2015


Use Cases for a Big Data Platform IBM ICE (Innovation Centre for Education)
IBM Power Systems

• Telecommunication services
– Problem:
• Legacy systems are used to gain insights from internally generated data
facing issues of high storage costs, long data loading time, and long
administration process.
– How big data analytics can help:
• CDR processing
• Churn prediction
• Geomapping / marketing
• Network monitoring

© Copyright IBM Corporation 2015


Use Cases for a Big Data Platform IBM ICE (Innovation Centre for Education)
IBM Power Systems

• Transportation services
– Problem:
• Traffic congestion has been increasing worldwide as a result of
increased urbanization and population growth reducing the
efficiency of transportation infrastructure and increasing travel
time and fuel consumption.
– How big data analytics can help:
• Real time analysis to weather and traffic congestion data streams
to identify traffic patterns reducing transportation costs.

© Copyright IBM Corporation 2015


Use Cases for a Big Data Platform IBM ICE (Innovation Centre for Education)
IBM Power Systems

• Healthcare and Life Sciences


– Problem:
• Vast quantities of real-time information are starting to come from
wireless monitoring devices that postoperative patients and those
with chronic diseases are wearing at home and in their daily
lives.
– How big data analytics can help:
• Epidemic early warning
• Intensive Care Unit and remote monitoring

© Copyright IBM Corporation 2015


Checkpoint (1 of 3) IBM ICE (Innovation Centre for Education)
IBM Power Systems

1. Big data generally refers to


a. Voluminous structured data
b. Voluminous unstructured data
c. Voluminous semi structured data
d. Both b & c

2. The three V’s in Big data refers to


a. Velocity
b. Volume
c. Variety
d. All the above

© Copyright IBM Corporation 2015


Checkpoint solution (1 of 3) IBM ICE (Innovation Centre for Education)
IBM Power Systems

1. Big data generally refers to


a. Voluminous structured data
b. Voluminous unstructured data
c. Voluminous semi structured data
d. Both b & c

2. The three V’s in Big data refers to


a. Velocity
b. Volume
c. Variety
d. All the above

© Copyright IBM Corporation 2015


Checkpoint (2 of 3) IBM ICE (Innovation Centre for Education)
IBM Power Systems

3. Big Data Analytics Adoption structure is


a. Educate ->Explore -> Engage -> Execute
b. Explore -> Educate -> Engage -> Execute
c. Explore -> Engage -> Execute
d. Explore -> Educate -> Execute -> Engage

4. Benefits of Big data includes the following


a. Analytics
b. Business Intelligence
c. Handling Volumes and data
d. All the above

© Copyright IBM Corporation 2015


Checkpoint solution (2 of 3) IBM ICE (Innovation Centre for Education)
IBM Power Systems

3. Big Data Analytics Adoption structure is


a. Educate ->Explore -> Engage -> Execute
b. Explore -> Educate -> Engage -> Execute
c. Explore -> Engage -> Execute
d. Explore -> Educate -> Execute -> Engage

4. Benefits of Big data includes the following


a. Analytics
b. Business Inteligence
c. Handling Volumes and data
d. All the above

© Copyright IBM Corporation 2015


Checkpoint (3 of 3) IBM ICE (Innovation Centre for Education)
IBM Power Systems

5. The barriers to Big Data includes


a. Lack of skilled staff
b. Lack of Sufficient Knowledge
c. Lack of software to handle the volume of data
d. Both A & B

© Copyright IBM Corporation 2015


Checkpoint solution (3 of 3) IBM ICE (Innovation Centre for Education)
IBM Power Systems

5. The barriers to Big Data includes


a. Lack of skilled staff
b. Lack of Sufficient Knowledge
c. Lack of software to handle the volume of data
d. Both A & B

© Copyright IBM Corporation 2015


Unit Summary IBM ICE (Innovation Centre for Education)
IBM Power Systems

Having completed this unit, you should be able to:


• What is ‘Big Data’
• The three V’s of ‘Big Data’
• The Importance of Big Data
• The Risks of Big Data
• The Need for Big Data
• The Structure Of Big Data
• The Need for Standards
• What is Big Data Analytics
• Big Data Analytics Adoption Structure
• Benefits of Big Data Analytics
• Barriers to Big Data Analytics
• Trends for Big Data Analytics

© Copyright IBM Corporation 2015

Das könnte Ihnen auch gefallen