Beruflich Dokumente
Kultur Dokumente
YARN SPINS
NEW FLEXIBILITY
MORE TO IT
THAN MORE NODES
AVOID DASHED
EXPECTATIONS
NEED FOR
ANALYTICS SPEED
HOME
YARN SPINS
NEW FLEXIBILITY
AVOID DASHED
EXPECTATIONS
MORE TO IT
THAN MORE
NODES
NEED FOR
ANALYTICS
SPEED
Organizations must be
prepared for frequent
changesmost immediately
in the Hadoop 2 release.
But Cosentino added in an email
that implementations of the existing
Hadoop architecture are restricted
by its batch-processing orientation,
which makes it more akin to a truck
than a sports car on performance.
Hadoop is ideally suited where
time latency is not an issue and
where significant amounts of data
need to be processed, he said.
In its HDFS-MapReduce configuration, Hadoop is very good
at analysis of very large, static
THE INS AND OUTS OF HARNESSING HADOOP
INTRODUCTION
HOME
YARN SPINS
NEW FLEXIBILITY
AVOID DASHED
EXPECTATIONS
MORE TO IT
THAN MORE
NODES
NEED FOR
ANALYTICS
SPEED
and staging areas for the data gushing into organizations. In such cases,
data can be pared down by MapReduce, transformed into or summarized in a relational structure and
moved along to an enterprise data
warehouse or data marts for analysis by business users and analytics
professionals. That approach also
provides increased flexibility: The
raw data can be kept in a Hadoop
system and modeled for analysis
as needed, using extract, load and
transform processes.
HOME
YARN SPINS
NEW FLEXIBILITY
AVOID DASHED
EXPECTATIONS
MORE TO IT
THAN MORE
NODES
NEED FOR
ANALYTICS
SPEED
No
current
plans to
add
29%
Plan to
add
9%
In use
now
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
HOME
YARN SPINS
NEW FLEXIBILITY
AVOID DASHED
EXPECTATIONS
MORE TO IT
THAN MORE
NODES
NEED FOR
ANALYTICS
SPEED
Hadoop isnt the first choice among the top technologies being used by
organizations to support their big data environments.
55%
52%
46%
41%
BASED ON RESPONSES FROM 222 IT, BUSINESS INTELLIGENCE, ANALYTICS AND BUSINESS PROFESSIONALS IN ORGANIZATIONS WITH ACTIVE OR
PLANNED BIG DATA MANAGEMENT AND ANALYTICS PROGRAMS; RESPONDENTS WERE ASKED TO CHOOSE ALL TECHNOLOGIES THAT APPLIED;
SOURCE: TECHTARGETS 2013 ANALYTICS & DATA WAREHOUSING READER SURVEY
AVOID DASHED
EXPECTATIONS
HOME
YARN SPINS
NEW FLEXIBILITY
AVOID DASHED
EXPECTATIONS
MORE TO IT
THAN MORE
NODES
NEED FOR
ANALYTICS
SPEED
For example, RAID Level 0 striping of data across a disk array, typically turned on by default in Hadoop
systems, can shackle I/O speeds
to the rate of the slowest drive in
an array. In addition, a single disk
failure can take down an entire
YARN SPINS
NEW FLEXIBILITY
AVOID DASHED
EXPECTATIONS
MORE TO IT
THAN MORE
NODES
NEED FOR
ANALYTICS
SPEED
Hadoop initially was the province of the large Internet companies that created it, and the likes of eBay, Facebook, LinkedIn, Twitter and Yahoo remain
marquee users of the technology. But the number of other types of organizations that are looking to ride the Hadoop surge is growing.
NASA is using Hadoop to make climate data available via a cloud-based service to researchers outside its walls. To help predict and improve crop yields,
agricultural and chemical company Monsanto is loading geospatial data from
internal and external sources into Hadoop for processing and then moving the
files to HBase, its companion NoSQL database, for analysis. Data storage technology vendor NetApp uses Hadoop to cull log data from sensors to monitor
the performance of its equipment at customer sites. Telecom service provider
China Mobile Group Guangdong built a Hadoop-based system to support online bill payments and provide new data analytics capabilities internally.
Marketing and advertising analysis is another common application for Hadoop
and related big data technologies. Edmunds.com, which publishes automobile
pricing data and vehicle reviews online, deployed a combination of Hadoop
and HBase to help business analysts fine-tune its paid-search marketing and
keyword bidding processes. Retailer Kohls plans to use Hadoop to enable
business users to analyze store and website data. And Luminar, a company
that analyzes data about Hispanic consumers in the U.S. for retailers, manufacturers and other clients, replaced a traditional data warehouse with a Hadoop system to power its analytical modeling.
But Colin White, president of consultancy BI Research, thinks Hadoop has the
potential to spark new and innovative applications, not just to step in and take
the place of traditional systems. My concern is in seeing Hadoop used for a
bunch of workloads in which it is reinventing the wheel, he said. Id rather
see it moving in the direction of solving problems we havent solved.
JACK VAUGHAN
HOME
YARN SPINS
NEW FLEXIBILITY
AVOID DASHED
EXPECTATIONS
MORE TO IT
THAN MORE
NODES
NEED FOR
ANALYTICS
SPEED
MORE TO IT THAN
MORE NODES
YARN SPINS
NEW FLEXIBILITY
AVOID DASHED
EXPECTATIONS
MORE TO IT
THAN MORE
NODES
NEED FOR
ANALYTICS
SPEED
NEED FOR
ANALYTICS SPEED
HOME
YARN SPINS
NEW FLEXIBILITY
AVOID DASHED
EXPECTATIONS
MORE TO IT
THAN MORE
NODES
NEED FOR
ANALYTICS
SPEED
10
HOME
YARN SPINS
NEW FLEXIBILITY
AVOID DASHED
EXPECTATIONS
MORE TO IT
THAN MORE
NODES
NEED FOR
ANALYTICS
SPEED
11
HOME
YARN SPINS
NEW FLEXIBILITY
AVOID DASHED
EXPECTATIONS
MORE TO IT
THAN MORE
NODES
NEED FOR
ANALYTICS
SPEED
SearchBusinessAnalytics.com;
in that position, he covers
business intelligence, analytics and data visualization
technologies and topics. He
previously was a news writer
for TechTargets SearchHealthIT.com website,
and he has also written for a variety of daily and
weekly newspapers in eastern Massachusetts.
Email him at eburns@techtarget.com.
12