Beruflich Dokumente
Kultur Dokumente
Hortonworks DataFlow
www.hortonworks.com
2016 Hortonworks
Contents
Data Collection
Real Time Decisions
Operational Efficiency
Security and Provenance
Bi-directional DataFlow
About Hortonworks
12
A single integrated platform for data acquisition, simple event processing, transport and delivery
mechanism from source to storage.
Hortonworks DataFlow enables the real time collection and
processing of perishable insights.
ENRICH CONTENT
PERISHABLE
INSIGHTS
HISTORICAL
INSIGHTS
Hortonworks DataFlow is designed to securely collect and transport data from highly diverse data sources be they big or
small, fast or slow, always connected or intermittently available.
Figure 1: Hortonworks DataFlow
DataFlow was designed inherently to meet the practical challenges of collecting data from a wide range of disparate data sources;
securely, efficiently and over a geographically disperse and possibly fragmented network. Because the NSA encountered many of the
issues enterprises are facing now, this technology has been field-proven with built-in capabilities for security, scalability, integration,
reliability and extensibility and has a proven track record for operational usability and deployment.
HORTONWORKS
DATAFLOW
ENABLES
ENTERPRISES
TO
HORTONWORKS DATAFLOW
ENABLES
ENTERPRISES
TO
Leverage Operational Efficiency
evolving dataflows
for coding
bi-directional dataflows
DATA COLLECTION
OPERATIONAL EFFICIENCY
data as needed
BI-DIRECTIONAL DATAFLOW
(data provenance)
endpoint capacity
Hortonworks DataFlow accelerates time to insight by securely enabling off-the shelf, flow based
programming for big data infrastructure and simplifying the current complexity of secure data
acquisition, ingestion and real time analysis of distributed, disparate data sources.
An ideal framework for collection of data and management of dataflows, the most popular uses
of Hortonworks DataFlow are for: simplified, streamlined big data ingest, increased security for
collection and sharing of data with high fidelity chain of custody metadata and as the underlying
infrastructure for the internet of things.
UI
UI
Figure 3: Current big data ingest solutions are complex and operationally inefficient
WHAT IS DATA
PROVENANCE?
Provenance is defined as
the place of origin or earliest
known history of some thing.
In the context of a dataflow,
data provenance is the ability
to trace the path of a piece of
data within a dataflow from
its place of creation, through
to its final destination. Within
Hortonworks DataFlow data
provenance provides the ability
to visually verify where data
came from, how it was used,
who viewed it, whether it was
sent, copied, transformed
or received. Any system or
person who came in contact
with a specific piece of data is
captured in its completeness
in terms of time, date, action,
precedents and dependents for
a complete picture of the chain
of custody for that precise
piece of data within a dataflow.
This provenance metadata
information is used to support
data sharing compliance
requirements as well as for
dataflow troubleshooting and
optimization.
Figure 4: Secure from source to storage with high fidelity data provenance
10
11
About Hortonworks
Hortonworks is a leading innovator at creating, distributing and supporting enterprise-ready open data platforms. Our mission is to
manage the worlds data. We have a single-minded focus on driving innovation in open source communities such as Apache Hadoop, NiFi,
and Spark. Our open Connected Data Platforms power Modern Data Applications that deliver actionable intelligence from all data: datain-motion and data-at-rest. Along with our 1600+ partners, we provide the expertise, training and services that allows our customers to
unlock the transformational value of data across any line of business. We are Powering the Future of Data.
Contact
For further information visit
For more information,
visit www.hortonworks.com.
+1 408 675-0983
+1 855 8-HORTON
INTL: +44 (0) 20 3826 1405
12
2011-2016 Hortonworks Inc. All Rights Reserved.
Privacy Policy | Terms of Service
A16_023