Big Data

Hochgeladen von

asvi1010

0% fanden dieses Dokument nützlich (0 Abstimmungen)

99 Ansichten24 Seiten

About big data and its application

Copyright

Verfügbare Formate

PPTX, PDF, TXT oder online auf Scribd lesen

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Dieses Dokument melden

About big data and its application

Copyright:

Verfügbare Formate

Als PPTX, PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

0% fanden dieses Dokument nützlich (0 Abstimmungen)

99 Ansichten24 Seiten

Big Data

Hochgeladen von

asvi1010

About big data and its application

Copyright:

Verfügbare Formate

Als PPTX, PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

Zu Seite

Sie sind auf Seite 1von 24

Im Dokument suchen

Big Data Analytics

A crash course
What is Big Data
Large and complex datasets
Structured, semi-structured or unstructured
Typically does not fit in memory to be
processed
Distributed storage structure
3Vs of Big Data
Velocity
Volume
Variety
Velocity
Low latency real-time speed
Examples
Telephone call records
Social media
Retail sales

Volume
Size of dataset
KB, MB, GB, TB, PB
Facebook
40 PB of data
100 TB/day
Twitter
8 TB/day
Yahoo
60 PB of data
Big Data size varies from company to company
Variety
Text
Audio
Video
Photos
Documents
Big Data Stack
Physical Infrastructure
Hardware & Network
Performance
Availability
Resilient & redundant
Scalability
Flexibility
Cost

Security Infrastructure
Data access
Application Access
Data encryption
Threat detection
Cloud and Big Data
IaaS Amazon EC2
PaaS Heruku, Pagodabox
SaaS GotoMeeting, SalesForce
DaaS Amazon
Major Providers
Cloudera, Amazon, Azure, Google, Openstack
Databases
Organizing Data Services
Distributed File System
Serialization & Coordination
ETL Tools
Workflow
Big Data Applications
Log Data Applications
Splunk, Loggly
Ad/Media Applications
Bluefin, DataXu
Marketing Applications
Bloomreach, Myrrix
Apache Hadoop
Open source framework for processing and
querying vast amounts of data on large
clusters of commodity hardware
Enterprise-ready cloud computing technology
Industry standard for Big Data
Jave based but abstractions available for
various languages
Concurrency, Scalability, Reliability
HDFS
Hadoop Distributed File System
File system to store large datasets
Blocks of 64 MB instead of 4-32 KB
Optimized for throughput over latency
High availability through replication instead of
redundancy
Optimized for read-many and write-once
DataNode and NameNode
MapReduce
Data processing paradigm
How data will input (Map)
How data will output (Reduce)
Works with arbitrarily large datasets
Integrates tightly with HDFS
Parallel processing
Divide and conquer
Key-value pair instead of RDBMS Schemas
Job tracker and task tracker
Other components
Mahout Machine learning
Pig High level language for interacting with
Hadoop
Hive Data warehousing
HBase Distributed, column-oriented DB
Sqoop SQL to Hadoop and vice versa
Ambari Web based Hadoop cluster
management
R + Hadoop
Hadoop for data storage, computation power
R for advanced analytics, visualization, data
loading
Cloud based
RHadoop
Data mining with R
Regression
lm
Classification
glm, ksvm, svm, randomforest, glmnet
Clustering
knn, kmeans, dist, pvclust, Mclust
Recommendation
recommenderlab
Hadoop
Linux based
Cloudera based
Java required
Singlenode or multinode

RHIPE
R and Hadoop Integrated Programming
Environment
Divide and Recombine Technique
RHadoop
Revolution Analytics
Rhdfs
Rmr
Rhbase

MapReduce in R
Real Time Data Streaming
IBM Infosphere
Twitter Storm
Apache S4 (Simple Scalable Streaming System)
Data Management at Enterprise

Das könnte Ihnen auch gefallen

Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Von Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Bewertung: 4 von 5 Sternen
4/5 (895)
Never Split the Difference: Negotiating As If Your Life Depended On It
Von Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Bewertung: 4.5 von 5 Sternen
4.5/5 (838)
The Yellow House: A Memoir (2019 National Book Award Winner)
Von Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Bewertung: 4 von 5 Sternen
4/5 (98)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Von Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Bewertung: 4 von 5 Sternen
4/5 (5794)
Shoe Dog: A Memoir by the Creator of Nike
Von Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Bewertung: 4.5 von 5 Sternen
4.5/5 (537)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Von Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Bewertung: 4.5 von 5 Sternen
4.5/5 (266)
The Little Book of Hygge: Danish Secrets to Happy Living
Von Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Bewertung: 3.5 von 5 Sternen
3.5/5 (400)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Von Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Bewertung: 4.5 von 5 Sternen
4.5/5 (474)
Yes Please
Von Everand
Yes Please
Amy Poehler
Bewertung: 4 von 5 Sternen
4/5 (1891)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Von Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Bewertung: 3.5 von 5 Sternen
3.5/5 (231)
Grit: The Power of Passion and Perseverance
Von Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Bewertung: 4 von 5 Sternen
4/5 (588)
The Emperor of All Maladies: A Biography of Cancer
Von Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Bewertung: 4.5 von 5 Sternen
4.5/5 (271)
The Unwinding: An Inner History of the New America
Von Everand
The Unwinding: An Inner History of the New America
George Packer
Bewertung: 4 von 5 Sternen
4/5 (45)
On Fire: The (Burning) Case for a Green New Deal
Von Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Bewertung: 4 von 5 Sternen
4/5 (74)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Von Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Bewertung: 4.5 von 5 Sternen
4.5/5 (345)
Team of Rivals: The Political Genius of Abraham Lincoln
Von Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Bewertung: 4.5 von 5 Sternen
4.5/5 (234)
Principles: Life and Work
Von Everand
Principles: Life and Work
Ray Dalio
Bewertung: 4 von 5 Sternen
4/5 (599)
Angela's Ashes: A Memoir
Von Everand
Angela's Ashes: A Memoir
Frank McCourt
Bewertung: 4.5 von 5 Sternen
4.5/5 (440)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Von Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Bewertung: 4 von 5 Sternen
4/5 (1090)
Fear: Trump in the White House
Von Everand
Fear: Trump in the White House
Bob Woodward
Bewertung: 3.5 von 5 Sternen
3.5/5 (738)
Steve Jobs
Von Everand
Steve Jobs
Walter Isaacson
Bewertung: 4.5 von 5 Sternen
4.5/5 (806)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Von Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Bewertung: 3.5 von 5 Sternen
3.5/5 (2259)
Rise of ISIS: A Threat We Can't Ignore
Von Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Bewertung: 3.5 von 5 Sternen
3.5/5 (137)
John Adams
Von Everand
John Adams
David McCullough
Bewertung: 4.5 von 5 Sternen
4.5/5 (2409)
The Glass Castle: A Memoir
Von Everand
The Glass Castle: A Memoir
Jeannette Walls
Bewertung: 4.5 von 5 Sternen
4.5/5 (1713)
Bad Feminist: Essays
Von Everand
Bad Feminist: Essays
Roxane Gay
Bewertung: 4 von 5 Sternen
4/5 (1016)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Von Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Bewertung: 4.5 von 5 Sternen
4.5/5 (121)
The Outsider: A Novel
Von Everand
The Outsider: A Novel
Stephen King
Bewertung: 4 von 5 Sternen
4/5 (1839)
The Light Between Oceans: A Novel
Von Everand
The Light Between Oceans: A Novel
M.L. Stedman
Bewertung: 4.5 von 5 Sternen
4.5/5 (789)
The Woman in Cabin 10
Von Everand
The Woman in Cabin 10
Ruth Ware
Bewertung: 3.5 von 5 Sternen
3.5/5 (2322)
A Man Called Ove: A Novel
Von Everand
A Man Called Ove: A Novel
Fredrik Backman
Bewertung: 4.5 von 5 Sternen
4.5/5 (4609)
Wolf Hall: A Novel
Von Everand
Wolf Hall: A Novel
Hilary Mantel
Bewertung: 4 von 5 Sternen
4/5 (3811)
Manhattan Beach: A Novel
Von Everand
Manhattan Beach: A Novel
Jennifer Egan
Bewertung: 3.5 von 5 Sternen
3.5/5 (792)
Brooklyn: A Novel
Von Everand
Brooklyn: A Novel
Colm Toibin
Bewertung: 3.5 von 5 Sternen
3.5/5 (1937)
The Perks of Being a Wallflower
Von Everand
The Perks of Being a Wallflower
Stephen Chbosky
Bewertung: 4.5 von 5 Sternen
4.5/5 (2104)
The Art of Racing in the Rain: A Novel
Von Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Bewertung: 4 von 5 Sternen
4/5 (4200)
Little Women
Von Everand
Little Women
Louisa May Alcott
Bewertung: 4 von 5 Sternen
4/5 (104)
A Tree Grows in Brooklyn
Von Everand
A Tree Grows in Brooklyn
Betty Smith
Bewertung: 4.5 von 5 Sternen
4.5/5 (1929)
The Constant Gardener: A Novel
Von Everand
The Constant Gardener: A Novel
John le Carré
Bewertung: 3.5 von 5 Sternen
3.5/5 (104)
Sing, Unburied, Sing: A Novel
Von Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Bewertung: 4 von 5 Sternen
4/5 (1103)
Her Body and Other Parties: Stories
Von Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Bewertung: 4 von 5 Sternen
4/5 (821)
BCMT Module 5 - Monitoring and Evaluating Tech4ED Centers
Dokument30 Seiten
BCMT Module 5 - Monitoring and Evaluating Tech4ED Centers
Cabaluay NHS
0% (1)
List Object
Dokument60 Seiten
List Object
CRA Esuela Claudio Arrau
Noch keine Bewertungen
Write About Global Catalog. How To View Replication Properties For AD Properties
Dokument19 Seiten
Write About Global Catalog. How To View Replication Properties For AD Properties
Chandan Kumar
Noch keine Bewertungen
IEEE Authorship Webinar
Dokument1 Seite
IEEE Authorship Webinar
Anonymous PxkbBVw
Noch keine Bewertungen
Week 2: Introduction To Discrete-Time Stochastic Processes: 15.455x Mathematical Methods of Quantitative Finance
Dokument58 Seiten
Week 2: Introduction To Discrete-Time Stochastic Processes: 15.455x Mathematical Methods of Quantitative Finance
Bruné
100% (1)
Premier Integration With Powerflex Drives - Presentation
Dokument42 Seiten
Premier Integration With Powerflex Drives - Presentation
Ron Bentley
Noch keine Bewertungen
Book PDF Reliability Maintainability and Risk Practical Methods For Engineers PDF Full Chapter
Dokument18 Seiten
Book PDF Reliability Maintainability and Risk Practical Methods For Engineers PDF Full Chapter
mary.turner669
100% (8)
95 Shortcuts For Windows Run Commands Keyboard Shortcuts PDF
Dokument7 Seiten
95 Shortcuts For Windows Run Commands Keyboard Shortcuts PDF
sreekanth
Noch keine Bewertungen
Design of PLC Based Automatic Flat Bottle Label Adjuster: Olorunda, P. A. and Adetunde, I. A
Dokument5 Seiten
Design of PLC Based Automatic Flat Bottle Label Adjuster: Olorunda, P. A. and Adetunde, I. A
Mani
Noch keine Bewertungen
C++ Summary
Dokument2 Seiten
C++ Summary
ramakantsawant
Noch keine Bewertungen
ETSI TS 138 215: 5G NR Physical Layer Measurements (3GPP TS 38.215 Version 17.3.0 Release 17)
Dokument33 Seiten
ETSI TS 138 215: 5G NR Physical Layer Measurements (3GPP TS 38.215 Version 17.3.0 Release 17)
rkaul2763
Noch keine Bewertungen
Republic of The Philippines Sangguniang Panlungsod City of Baguio SOFAD SESSION, 09 OCTOBER 2017, 2:00 P.M. Session Nr. 1
Dokument3 Seiten
Republic of The Philippines Sangguniang Panlungsod City of Baguio SOFAD SESSION, 09 OCTOBER 2017, 2:00 P.M. Session Nr. 1
rain
Noch keine Bewertungen
Adept Poverty
Dokument21 Seiten
Adept Poverty
Kitchie Hermoso
Noch keine Bewertungen
MANUAL Sbm100whi 00 Dfu Aen
Dokument10 Seiten
MANUAL Sbm100whi 00 Dfu Aen
Mary Carmen Andreu Merino
Noch keine Bewertungen
ZTE UMTS Idle Mode and Channel Behavior Feature Guide U9.2
Dokument102 Seiten
ZTE UMTS Idle Mode and Channel Behavior Feature Guide U9.2
Pushp Sharma
Noch keine Bewertungen
Management Information System
Dokument4 Seiten
Management Information System
Pratiksha Baid Daga
Noch keine Bewertungen
CCNA Exploration 2 - Module 4 Exam Answers Version 4.0
Dokument3 Seiten
CCNA Exploration 2 - Module 4 Exam Answers Version 4.0
fun kolla
Noch keine Bewertungen
Meraki Datasheet mt12 - en 2
Dokument2 Seiten
Meraki Datasheet mt12 - en 2
sipster2020
Noch keine Bewertungen
Cutler-Hammer: Automatic Transfer Switches
Dokument2 Seiten
Cutler-Hammer: Automatic Transfer Switches
danielliram993
Noch keine Bewertungen
Language Learning Material Development
Dokument20 Seiten
Language Learning Material Development
Ellen Grace Baguilat Putac
100% (1)
HiFi ROSE
Dokument1 Seite
HiFi ROSE
jesusrh
Noch keine Bewertungen
Flitedeck Pro X: Release Notes
Dokument28 Seiten
Flitedeck Pro X: Release Notes
Łukasz Barzyk
Noch keine Bewertungen
Imrt Vmat
Dokument29 Seiten
Imrt Vmat
andrea
Noch keine Bewertungen
174872-Report On Voice Enabled Enterprise Chatbot
Dokument61 Seiten
174872-Report On Voice Enabled Enterprise Chatbot
Balaji Grandhi
Noch keine Bewertungen
HTTPS: - WWW - Whois.com - Whois - Tech-Al - Info
Dokument4 Seiten
HTTPS: - WWW - Whois.com - Whois - Tech-Al - Info
Λουκάς
Noch keine Bewertungen
College Brochure Project
Dokument2 Seiten
College Brochure Project
api-275943721
Noch keine Bewertungen
SHA MAN 0052 Ver4 Web
Dokument96 Seiten
SHA MAN 0052 Ver4 Web
Greed Css
Noch keine Bewertungen
Strategic Assignment
Dokument3 Seiten
Strategic Assignment
jukilili3115
50% (4)
1st Quarter Exam in Math9
Dokument3 Seiten
1st Quarter Exam in Math9
Robert Jay Mejorada
100% (4)
Emptech Activity 1
Dokument6 Seiten
Emptech Activity 1
Ashley Agramon
Noch keine Bewertungen