Willkommen bei Scribd!

Karussell überspringen

Steps To Run Wordcount

Hochgeladen von

Praveen kumar

0% fanden dieses Dokument nützlich (0 Abstimmungen)

8 Ansichten3 Seiten

Originaltitel

Steps to run wordcount

Copyright

Verfügbare Formate

ODT, PDF, TXT oder online auf Scribd lesen

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Dieses Dokument melden

Copyright:

Verfügbare Formate

Als ODT, PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

0% fanden dieses Dokument nützlich (0 Abstimmungen)

8 Ansichten3 Seiten

Steps To Run Wordcount

Hochgeladen von

Praveen kumar

Copyright:

Verfügbare Formate

Als ODT, PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

Zu Seite

Sie sind auf Seite 1von 3

Im Dokument suchen

Steps to run wordcount program

Procedure:

Step 1: Create a folder WordCountTutorial in Desktop in hduser

Step 2: Inside the folder create a java file (WordCount.java) to count the words and its occurances.

Step 3: create anothe folder input_data. Inside the folder create input.txt using vi in terminal

Step 4: create a folder tutorial_classes

Step 5: Type the following commands in terminal

hadoop version

Step 6: javac -version

Step 7: export HADOOP_CLASSPATH=$(hadoop classpath)

Step 8: echo $HADOOP_CLASSPATH

Step 9: hadoop fs -mkdir /WordCountTutorial

Step 10: hadoop fs -mkdir /WordCountTutorial/Input

Step 11: hadoop fs -put '/home/hduser/Desktop/WordCountTutorial/input_data/input.txt'

/WordCountTutorial/Input

Step 12:cd /home/hduser/Desktop/WordCountTutorial/

Step 13: javac -classpath ${HADOOP_CLASSPATH} -d

'/home/hduser/Desktop/WordCountTutorial/tutorial_classes'
'/home/hduser/Desktop/WordCountTutorial/WordCount.java'

Step 14: jar -cvf firstTutorial.jar -C tutorial_classes/ .

Step 15: hadoop jar '/home/hduser/Desktop/WordCountTutorial/firstTutorial.jar' WordCount

/WordCountTutorial/Input /WordCountTutorial/Output

Step 16: hadoop dfs -cat /WordCountTutorial/Output/*

WordCount.java

import java.io.IOException;
import java.util.StringTokenizer;
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.Mapper;
import org.apache.hadoop.mapreduce.Reducer;
import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;
import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;

public class WordCount {

public static class TokenizerMapper

extends Mapper<Object, Text, Text, IntWritable>{

private final static IntWritable one = new IntWritable(1);

private Text word = new Text();

public void map(Object key, Text value, Context context

) throws IOException, InterruptedException {
StringTokenizer itr = new StringTokenizer(value.toString());
while (itr.hasMoreTokens()) {
word.set(itr.nextToken());
context.write(word, one);
}
}
}

public static class IntSumReducer

extends Reducer<Text,IntWritable,Text,IntWritable> {
private IntWritable result = new IntWritable();

public void reduce(Text key, Iterable<IntWritable> values,

Context context
) throws IOException, InterruptedException {
int sum = 0;
for (IntWritable val : values) {
sum += val.get();
}
result.set(sum);
context.write(key, result);
}
}

public static void main(String[] args) throws Exception {

Configuration conf = new Configuration();
Job job = Job.getInstance(conf, "word count");
job.setJarByClass(WordCount.class);
job.setMapperClass(TokenizerMapper.class);
job.setCombinerClass(IntSumReducer.class);
job.setReducerClass(IntSumReducer.class);
job.setOutputKeyClass(Text.class);
job.setOutputValueClass(IntWritable.class);
FileInputFormat.addInputPath(job, new Path(args[0]));
FileOutputFormat.setOutputPath(job, new Path(args[1]));
System.exit(job.waitForCompletion(true) ? 0 : 1);
}
}
Output:

Das könnte Ihnen auch gefallen

Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Von Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Bewertung: 4 von 5 Sternen
4/5 (895)
Open Source For You - September 2014 in
Dokument108 Seiten
Open Source For You - September 2014 in
nilanchal010
Noch keine Bewertungen
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Von Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Bewertung: 4 von 5 Sternen
4/5 (5794)
Iot Digitizing Power Utilities Paper PDF
Dokument10 Seiten
Iot Digitizing Power Utilities Paper PDF
Sweta Dey
Noch keine Bewertungen
Shoe Dog: A Memoir by the Creator of Nike
Von Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Bewertung: 4.5 von 5 Sternen
4.5/5 (537)
Big Data Lake
Dokument218 Seiten
Big Data Lake
Truc Nguyen Xuan
100% (4)
Grit: The Power of Passion and Perseverance
Von Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Bewertung: 4 von 5 Sternen
4/5 (588)
Big Data and Hadoop - 12 Aug 2021
Dokument19 Seiten
Big Data and Hadoop - 12 Aug 2021
Sahil Sarwar
Noch keine Bewertungen
The Yellow House: A Memoir (2019 National Book Award Winner)
Von Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Bewertung: 4 von 5 Sternen
4/5 (98)
Hadoop Ecosystem
Dokument56 Seiten
Hadoop Ecosystem
RUGAL NEEMA MBA 2021-23 (Delhi)
Noch keine Bewertungen
Principles: Life and Work
Von Everand
Principles: Life and Work
Ray Dalio
Bewertung: 4 von 5 Sternen
4/5 (599)
SPLK-2002.prepaway - Premium.exam.90q: Number: SPLK-2002 Passing Score: 800 Time Limit: 120 Min File Version: 1.3
Dokument26 Seiten
SPLK-2002.prepaway - Premium.exam.90q: Number: SPLK-2002 Passing Score: 800 Time Limit: 120 Min File Version: 1.3
IslamMohamed
Noch keine Bewertungen
Yes Please
Von Everand
Yes Please
Amy Poehler
Bewertung: 4 von 5 Sternen
4/5 (1891)
Hdfs
Dokument10 Seiten
Hdfs
Saikat Chakraborty
Noch keine Bewertungen
The Little Book of Hygge: Danish Secrets to Happy Living
Von Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Bewertung: 3.5 von 5 Sternen
3.5/5 (400)
Arpit Ashok Patel Resume
Dokument8 Seiten
Arpit Ashok Patel Resume
HARSHA
Noch keine Bewertungen
Never Split the Difference: Negotiating As If Your Life Depended On It
Von Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Bewertung: 4.5 von 5 Sternen
4.5/5 (838)
Hadoop Installation Steps
Dokument4 Seiten
Hadoop Installation Steps
B49 Pravin Teli
Noch keine Bewertungen
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Von Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Bewertung: 4.5 von 5 Sternen
4.5/5 (474)
Bigdata Idc WP
Dokument16 Seiten
Bigdata Idc WP
Akmalina
Noch keine Bewertungen
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Von Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Bewertung: 3.5 von 5 Sternen
3.5/5 (231)
Ds 42 Doc Map en
Dokument12 Seiten
Ds 42 Doc Map en
Madhavi J
Noch keine Bewertungen
Rise of ISIS: A Threat We Can't Ignore
Von Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Bewertung: 3.5 von 5 Sternen
3.5/5 (137)
Java 828242
Dokument43 Seiten
Java 828242
Naga
Noch keine Bewertungen
The Emperor of All Maladies: A Biography of Cancer
Von Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Bewertung: 4.5 von 5 Sternen
4.5/5 (271)
Resume
Dokument2 Seiten
Resume
Cahlen Humphreys
Noch keine Bewertungen
Fear: Trump in the White House
Von Everand
Fear: Trump in the White House
Bob Woodward
Bewertung: 3.5 von 5 Sternen
3.5/5 (738)
AWS Amazon Interview Question and Answers
Dokument55 Seiten
AWS Amazon Interview Question and Answers
Dharmala Chandra Sekhar
0% (1)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Von Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Bewertung: 4.5 von 5 Sternen
4.5/5 (266)
Lesson 6 NoSQL Databases HBase
Dokument47 Seiten
Lesson 6 NoSQL Databases HBase
Keerthi Uma Mahesh
100% (1)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Von Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Bewertung: 4.5 von 5 Sternen
4.5/5 (345)
DMBD MBAA21041 Sqoop
Dokument11 Seiten
DMBD MBAA21041 Sqoop
Rishu Verma
Noch keine Bewertungen
On Fire: The (Burning) Case for a Green New Deal
Von Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Bewertung: 4 von 5 Sternen
4/5 (74)
04 Bigdata Hive
Dokument22 Seiten
04 Bigdata Hive
Rohit Uppal
Noch keine Bewertungen
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Von Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Bewertung: 3.5 von 5 Sternen
3.5/5 (2259)
Hands On Exercises 2013
Dokument51 Seiten
Hands On Exercises 2013
Manish Jain
Noch keine Bewertungen
Team of Rivals: The Political Genius of Abraham Lincoln
Von Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Bewertung: 4.5 von 5 Sternen
4.5/5 (234)
Introduction PDF
Dokument69 Seiten
Introduction PDF
mihiri sweet4ever
Noch keine Bewertungen
The Unwinding: An Inner History of the New America
Von Everand
The Unwinding: An Inner History of the New America
George Packer
Bewertung: 4 von 5 Sternen
4/5 (45)
Huawei.H13-624-ENU.v2021-04-06.q169: Leave A Reply
Dokument42 Seiten
Huawei.H13-624-ENU.v2021-04-06.q169: Leave A Reply
Dendi
Noch keine Bewertungen
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Von Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Bewertung: 4 von 5 Sternen
4/5 (1090)
Data Analytics Trend Report Guide PDF
Dokument12 Seiten
Data Analytics Trend Report Guide PDF
Madhu kumar
Noch keine Bewertungen
Angela's Ashes: A Memoir
Von Everand
Angela's Ashes: A Memoir
Frank McCourt
Bewertung: 4.5 von 5 Sternen
4.5/5 (440)
Data Architect Interview Questions
Dokument66 Seiten
Data Architect Interview Questions
Ganpat Bagal
Noch keine Bewertungen
Steve Jobs
Von Everand
Steve Jobs
Walter Isaacson
Bewertung: 4.5 von 5 Sternen
4.5/5 (806)
Big Data Analytics Using Apache Hadoop
Dokument33 Seiten
Big Data Analytics Using Apache Hadoop
AbinBabyElichirayil
Noch keine Bewertungen
Bad Feminist: Essays
Von Everand
Bad Feminist: Essays
Roxane Gay
Bewertung: 4 von 5 Sternen
4/5 (1016)
D86898GC10 sg2 PDF
Dokument416 Seiten
D86898GC10 sg2 PDF
Natik Talibov
Noch keine Bewertungen
The Glass Castle: A Memoir
Von Everand
The Glass Castle: A Memoir
Jeannette Walls
Bewertung: 4.5 von 5 Sternen
4.5/5 (1713)
How Do I Register and Schedule My Cloudera Exam
Dokument16 Seiten
How Do I Register and Schedule My Cloudera Exam
AakashMalhotra
Noch keine Bewertungen
John Adams
Von Everand
John Adams
David McCullough
Bewertung: 4.5 von 5 Sternen
4.5/5 (2409)
Вибрация
Dokument11 Seiten
Вибрация
maratova.m98
Noch keine Bewertungen
The Outsider: A Novel
Von Everand
The Outsider: A Novel
Stephen King
Bewertung: 4 von 5 Sternen
4/5 (1839)
Batch Processing Vs Stream Processing
Dokument3 Seiten
Batch Processing Vs Stream Processing
mihir.chauhan1
Noch keine Bewertungen
The Light Between Oceans: A Novel
Von Everand
The Light Between Oceans: A Novel
M.L. Stedman
Bewertung: 4.5 von 5 Sternen
4.5/5 (789)
What Are The Core Components of Hadoop
Dokument6 Seiten
What Are The Core Components of Hadoop
Vani Chowdary
Noch keine Bewertungen
Brooklyn: A Novel
Von Everand
Brooklyn: A Novel
Colm Tóibín
Bewertung: 3.5 von 5 Sternen
3.5/5 (1937)
PHD Thesis Big Data
Dokument7 Seiten
PHD Thesis Big Data
carolynostwaltbillings
100% (2)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Von Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Bewertung: 4.5 von 5 Sternen
4.5/5 (121)
Hadoop/Hbase Installation: Install Java
Dokument11 Seiten
Hadoop/Hbase Installation: Install Java
shiva_1912-1
Noch keine Bewertungen
The Woman in Cabin 10
Von Everand
The Woman in Cabin 10
Ruth Ware
Bewertung: 3.5 von 5 Sternen
3.5/5 (2322)
Little Women
Von Everand
Little Women
Louisa May Alcott
Bewertung: 4 von 5 Sternen
4/5 (104)
A Man Called Ove: A Novel
Von Everand
A Man Called Ove: A Novel
Fredrik Backman
Bewertung: 4.5 von 5 Sternen
4.5/5 (4610)
Wolf Hall: A Novel
Von Everand
Wolf Hall: A Novel
Hilary Mantel
Bewertung: 4 von 5 Sternen
4/5 (3811)
Manhattan Beach: A Novel
Von Everand
Manhattan Beach: A Novel
Jennifer Egan
Bewertung: 3.5 von 5 Sternen
3.5/5 (792)
The Perks of Being a Wallflower
Von Everand
The Perks of Being a Wallflower
Stephen Chbosky
Bewertung: 4.5 von 5 Sternen
4.5/5 (2104)
The Art of Racing in the Rain: A Novel
Von Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Bewertung: 4 von 5 Sternen
4/5 (4200)
The Constant Gardener: A Novel
Von Everand
The Constant Gardener: A Novel
John le Carré
Bewertung: 3.5 von 5 Sternen
3.5/5 (104)
A Tree Grows in Brooklyn
Von Everand
A Tree Grows in Brooklyn
Betty Smith
Bewertung: 4.5 von 5 Sternen
4.5/5 (1929)
Her Body and Other Parties: Stories
Von Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Bewertung: 4 von 5 Sternen
4/5 (821)
Sing, Unburied, Sing: A Novel
Von Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Bewertung: 4 von 5 Sternen
4/5 (1103)