Sie sind auf Seite 1von 3

BIG DATA

Introduction
Big data is a terminology being given to very large data sets which can be analyzed computationally
to show us patterns or trends in the random data. Today whole IT Industry is re-structuring the way
they used to maintain their database. This data could be anything right from email IDs, numbers of
employees, clients or blood groups of patients, database collection of driving license numbers of
whole world.
Big Data in simple words is a technique to manage the important and scattered database and
analyze its behavior. This technology is the latest technology on which whole world is moving onto.
Enormous Jobs and Opportunities to start own business will be created in the field.
NADC Says: Every day, we create 2.5 quintillion bytes of data so much that 90% of the data in the
world today has been created in the last two years alone. This data comes from everywhere: sensors
used to gather climate information, posts to social media sites, digital pictures and videos, purchase
transaction records, and cell phone GPS signals to name a few. This data is Big Data.
Course Content
1.

Introduction to Big Data

Traditional Data Processing Technologies

Apache Hadoop Architecture

Hadoop Architecture

Hadoop and RDBMS

Hadoop Distributions

HDFS Architecture

Hadoop Ecosystem MapReduce, Hadoop Streaming , Hive, Pig, Hbase

Where Hadoop fits in the Enterprise

Hadoop Setup and Installation

HDFS Programming Basics

Hadoop Streaming

Performance Tuning

Debugging Hadoop Programs

MapReduce Architecture

MapReduce Programming Basics

MapReduce Programming Using Big Insights


Accessing Hadoop Data Using Hive

Hive Architecture.

Downloading, Installing and Configuring Hive.

Understand what Apache Hive is and Hive use cases.

Make basic configuration changes in a Hive installation.

Use DDL to create new Hive databases and tables.


Pre-Requirement
The Workshop content consists of an approximately equal mixture of lecture and hands-on lab. This
will be a Two days workshop. All students have at least moderate knowledge in Java and Database.
Recommendation: It is strongly recommended to bring your own LAPTOP during the training on
which you can install and run programs if you would like to do the optional, hands-on
experiments/exercises after the trainings/ workshops.
Certification
1. "Certificate of Appreciation" for Organizing Person from ARK Technosolutions & NADC
India,AMALGAM-IIT MADRAS.
2. "Certificate of Association" for Organizing College from ARK Technosolutions & NADC
India,AMALGAM-IIT MADRAS.
3. "Certificate of Participation" to every participant from ARK Technosolutions & NADC
India,AMALGAM-IIT MADRAS.
4. "Certificate of Merit" to the Zonal Winners from ARK Technosolutions & NADC India,AMALGAMIIT MADRAS.
5. "Certificate of Coordination" to the Coordinators from ARK Technosolutions
& NADC India,AMALGAM-IIT MADRAS.

Regards

RUTUJ KARANDIKAR
HEAD MANAGER , INDIA
AMALGAM , IIT MADRAS
08425858196,9769172667

Das könnte Ihnen auch gefallen