Willkommen bei Scribd!

Bda Lab Assignment

Hochgeladen von

0% fanden dieses Dokument nützlich (0 Abstimmungen)

21 Ansichten4 Seiten

This document contains a Big Data Analytics assignment submitted by Vaibhav Singh. It includes 7 questions to be solved using PySpark commands. The questions involve common list operations like incrementing elements, multiplying elements, finding most frequent words, and filtering even numbers. It also includes questions on joins between two files and the difference between map and flatMap transformations.

Originalbeschreibung:

lab assignment of big data analytics

Copyright

Verfügbare Formate

PDF, TXT oder online auf Scribd lesen

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Dieses Dokument melden

Copyright:

Verfügbare Formate

Als PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

0% fanden dieses Dokument nützlich (0 Abstimmungen)

21 Ansichten4 Seiten

Bda Lab Assignment

Hochgeladen von

Vaibhav Singh

Copyright:

Verfügbare Formate

Als PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

Zu Seite

Sie sind auf Seite 1von 4

Im Dokument suchen

BigDataAnalytics

Assignment

SubmittedBy:-VaibhavSingh
14B00033
CSC

Writeaprograminpysparkthefollowingquestions:-
1. Toincrementeachnumberinalistbyone.

l1=sc.parallalize([1,2,3,4,5])
l1.collect()
l1=rdd1.map(lambda x:x+1)
l1.collect()

Output = [2,3,4,5,6]

2. Tomultiplyeachnumberinalistby10

l1=sc.parallalize([1,2,3,4,5])
l1.collect()
l1=rdd1.map(lambda x:x*10)
l1.collect()

Output = [10,20,30,40,50]

3. To find most commonly occurring words with their associated
frequencies.

from operator import add

s=["a","b","a","c","a"]

s1=sc.parallelize(s)

s2=s1.map(lambda x:((x,1).reduce By key (add).collect())

print s2.collect()

Ouput = [("a",3),("b",1),("c",1)]

4. Findfrequencyofeachstate:-
State=["delhi","HP","HR","HR","UP"]

from operator import add

s=["delhi","HP","HR","HR","UP"]

s1=sc.parallelize(s)

s2=s1.map(lambda x:(x,1)).reduceByKey(add).collect())

print s2.collect()

Output = [("delhi",1),("HP",1),("HR",2),("UP",1)]

5. Toprintevennumbersoutofalistofnumbers.

l1=sc.parallalize([1,2,3,4,5,6])
l1.collect()
l2=l1.filter(lambda x:x%2==0)
print l2.collect()

Output = [2,4,6]

6. Write the spark commands to perform join operations between

twofiles.
Each file contains a persons name, DOB, and age. Group the
personbyage.

l1=sc.textFile(/home/1.txt)
l2=sc.textFile(/home/2.txtt)
l3=l1.map(lambdax:tuple(x.split()))
l4=l3.map(lambda(x,y,z):(x,y))
l5=l2.map(lambdax:tuple(x.split()))
l6=l5.map(lambda(x,y,z):(x,y)))
l7=l6.join(l4)
Printl7.collect()

Output=[(a,(23,25)),(s,(20,24)),(m,(21,20))]

7. Differentiatebetweenmapandflatmap.
Here is an example of the difference:
val textFile = sc.textFile("README.md") // create an RDD of lines of text

// MAP:

textFile.map(_.length) // map over the lines:

res2: Array[Int] = Array(14, 0, 71, 0, 0, ...)

// -> one length per line

// FLATMAP:

textFile.flatMap(_.split(" ")) // split each line into words:

res3: Array[String] = Array(#, Apache, Spark, ...)

// -> multiple words per line, and multiple lines

// - but we end up with a single output array of words

map transforms an RDD of length N into another RDD of length N.

For example, it maps from N lines into N line-lengths.

flatMap (loosely speaking) transforms an RDD of length N into a collection of N collections,

then flattens these into a single RDD of results.
For example, flatMapping from a collection of lines to a collection of words.

["aa bb cc", "", "dd"] => [["aa","bb","cc"],[],["dd"]] =>

["aa","bb","cc","dd"]

The input and output RDDs will therefore typically be of different sizes.

(You may need to call collect() on the RDDs generated in the examples above - I have
omitted this for clarity)

Das könnte Ihnen auch gefallen

Grit: The Power of Passion and Perseverance
Von Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Bewertung: 4 von 5 Sternen
4/5 (588)
The Yellow House: A Memoir (2019 National Book Award Winner)
Von Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Bewertung: 4 von 5 Sternen
4/5 (98)
Distributed System Assignment 3
Dokument2 Seiten
Distributed System Assignment 3
Vaibhav Singh
Noch keine Bewertungen
Introduction To Neural Networks: John Paxton Montana State University Summer 2003
Dokument24 Seiten
Introduction To Neural Networks: John Paxton Montana State University Summer 2003
Vaibhav Singh
Noch keine Bewertungen
Machine Learning Assignment
Dokument2 Seiten
Machine Learning Assignment
Vaibhav Singh
Noch keine Bewertungen
Conclusion
Dokument4 Seiten
Conclusion
Vaibhav Singh
Noch keine Bewertungen
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Von Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Bewertung: 4 von 5 Sternen
4/5 (5795)
Never Split the Difference: Negotiating As If Your Life Depended On It
Von Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Bewertung: 4.5 von 5 Sternen
4.5/5 (838)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Von Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Bewertung: 4 von 5 Sternen
4/5 (895)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Von Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Bewertung: 4.5 von 5 Sternen
4.5/5 (345)
Shoe Dog: A Memoir by the Creator of Nike
Von Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Bewertung: 4.5 von 5 Sternen
4.5/5 (537)
Yes Please
Von Everand
Yes Please
Amy Poehler
Bewertung: 4 von 5 Sternen
4/5 (1891)
The Little Book of Hygge: Danish Secrets to Happy Living
Von Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Bewertung: 3.5 von 5 Sternen
3.5/5 (400)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Von Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Bewertung: 4.5 von 5 Sternen
4.5/5 (474)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Von Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Bewertung: 3.5 von 5 Sternen
3.5/5 (231)
On Fire: The (Burning) Case for a Green New Deal
Von Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Bewertung: 4 von 5 Sternen
4/5 (74)
The Emperor of All Maladies: A Biography of Cancer
Von Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Bewertung: 4.5 von 5 Sternen
4.5/5 (271)
Angela's Ashes: A Memoir
Von Everand
Angela's Ashes: A Memoir
Frank McCourt
Bewertung: 4.5 von 5 Sternen
4.5/5 (440)
Bad Feminist: Essays
Von Everand
Bad Feminist: Essays
Roxane Gay
Bewertung: 4 von 5 Sternen
4/5 (1016)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Von Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Bewertung: 4.5 von 5 Sternen
4.5/5 (266)
The Unwinding: An Inner History of the New America
Von Everand
The Unwinding: An Inner History of the New America
George Packer
Bewertung: 4 von 5 Sternen
4/5 (45)
Team of Rivals: The Political Genius of Abraham Lincoln
Von Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Bewertung: 4.5 von 5 Sternen
4.5/5 (234)
Principles: Life and Work
Von Everand
Principles: Life and Work
Ray Dalio
Bewertung: 4 von 5 Sternen
4/5 (599)
Fear: Trump in the White House
Von Everand
Fear: Trump in the White House
Bob Woodward
Bewertung: 3.5 von 5 Sternen
3.5/5 (738)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Von Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Bewertung: 3.5 von 5 Sternen
3.5/5 (2259)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Von Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Bewertung: 4 von 5 Sternen
4/5 (1091)
Steve Jobs
Von Everand
Steve Jobs
Walter Isaacson
Bewertung: 4.5 von 5 Sternen
4.5/5 (806)
Rise of ISIS: A Threat We Can't Ignore
Von Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Bewertung: 3.5 von 5 Sternen
3.5/5 (137)
John Adams
Von Everand
John Adams
David McCullough
Bewertung: 4.5 von 5 Sternen
4.5/5 (2409)
The Glass Castle: A Memoir
Von Everand
The Glass Castle: A Memoir
Jeannette Walls
Bewertung: 4.5 von 5 Sternen
4.5/5 (1713)
The Outsider: A Novel
Von Everand
The Outsider: A Novel
Stephen King
Bewertung: 4 von 5 Sternen
4/5 (1839)
Brooklyn: A Novel
Von Everand
Brooklyn: A Novel
Colm Toibin
Bewertung: 3.5 von 5 Sternen
3.5/5 (1937)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Von Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Bewertung: 4.5 von 5 Sternen
4.5/5 (121)
The Light Between Oceans: A Novel
Von Everand
The Light Between Oceans: A Novel
M.L. Stedman
Bewertung: 4.5 von 5 Sternen
4.5/5 (789)
The Art of Racing in the Rain: A Novel
Von Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Bewertung: 4 von 5 Sternen
4/5 (4200)
Manhattan Beach: A Novel
Von Everand
Manhattan Beach: A Novel
Jennifer Egan
Bewertung: 3.5 von 5 Sternen
3.5/5 (792)
The Woman in Cabin 10
Von Everand
The Woman in Cabin 10
Ruth Ware
Bewertung: 3.5 von 5 Sternen
3.5/5 (2322)
The Perks of Being a Wallflower
Von Everand
The Perks of Being a Wallflower
Stephen Chbosky
Bewertung: 4.5 von 5 Sternen
4.5/5 (2104)
Wolf Hall: A Novel
Von Everand
Wolf Hall: A Novel
Hilary Mantel
Bewertung: 4 von 5 Sternen
4/5 (3811)
A Man Called Ove: A Novel
Von Everand
A Man Called Ove: A Novel
Fredrik Backman
Bewertung: 4.5 von 5 Sternen
4.5/5 (4610)
Little Women
Von Everand
Little Women
Louisa May Alcott
Bewertung: 4 von 5 Sternen
4/5 (104)
Sing, Unburied, Sing: A Novel
Von Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Bewertung: 4 von 5 Sternen
4/5 (1103)
A Tree Grows in Brooklyn
Von Everand
A Tree Grows in Brooklyn
Betty Smith
Bewertung: 4.5 von 5 Sternen
4.5/5 (1929)
Her Body and Other Parties: Stories
Von Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Bewertung: 4 von 5 Sternen
4/5 (821)
The Constant Gardener: A Novel
Von Everand
The Constant Gardener: A Novel
John le Carré
Bewertung: 3.5 von 5 Sternen
3.5/5 (104)
41 - 200810 Iss PRG Vastech PDF
Dokument16 Seiten
41 - 200810 Iss PRG Vastech PDF
MISPOLJ
Noch keine Bewertungen
Pretrained ResNet-18 Convolutional Neural Network - MATLAB Resnet18
Dokument2 Seiten
Pretrained ResNet-18 Convolutional Neural Network - MATLAB Resnet18
sidharth mahotra
Noch keine Bewertungen
Atmel S51AVR Programmer USERguide
Dokument10 Seiten
Atmel S51AVR Programmer USERguide
maksimad
Noch keine Bewertungen
Project Proposal
Dokument7 Seiten
Project Proposal
Sagar Ghimire
0% (1)
Gnuplot Ja
Dokument17 Seiten
Gnuplot Ja
Santiago Vidal garcia
Noch keine Bewertungen
Use Twitter To Control Arduino Uno Via Visual Basi
Dokument6 Seiten
Use Twitter To Control Arduino Uno Via Visual Basi
Mu'izz Kahar
Noch keine Bewertungen
Siemens TCP Ip Ethernet Manual
Dokument103 Seiten
Siemens TCP Ip Ethernet Manual
Jonathan Cheuquian
Noch keine Bewertungen
ITSY 2401 - Firewalls and Network Security - Network Security Plan Project
Dokument7 Seiten
ITSY 2401 - Firewalls and Network Security - Network Security Plan Project
Kyle LaPato
Noch keine Bewertungen
0610206v3 PDF
Dokument13 Seiten
0610206v3 PDF
ahmed3423
Noch keine Bewertungen
CPU Design HOWTO PDF
Dokument21 Seiten
CPU Design HOWTO PDF
Selvaraj Villy
Noch keine Bewertungen
Tcs Questions
Dokument12 Seiten
Tcs Questions
Kajol Mathuria
Noch keine Bewertungen
Lesson 6
Dokument74 Seiten
Lesson 6
Tek Casonete
Noch keine Bewertungen
Ec2 Auto Scaling
Dokument189 Seiten
Ec2 Auto Scaling
Nimbala Vinodkumar
Noch keine Bewertungen
Event Category Health Rule Violation Events
Dokument8 Seiten
Event Category Health Rule Violation Events
Vegga Firsthya
Noch keine Bewertungen
sm2 130127132631 Phpapp02
Dokument16 Seiten
sm2 130127132631 Phpapp02
Ashraful A. Khan
Noch keine Bewertungen
Failed To Create PDF Context Delegate
Dokument2 Seiten
Failed To Create PDF Context Delegate
Jessica
Noch keine Bewertungen
Marlon E. Calolot - Resume
Dokument5 Seiten
Marlon E. Calolot - Resume
ronaldrey_007
Noch keine Bewertungen
5 Axis Arm Robot Trainer Ed 7255 1390040798
Dokument2 Seiten
5 Axis Arm Robot Trainer Ed 7255 1390040798
Gabriel Aparecido Fonseca
Noch keine Bewertungen
Web Publishing Test Cases
Dokument3 Seiten
Web Publishing Test Cases
Jafar Bhatti
Noch keine Bewertungen
AccuLoad III - ALX - Modbus Communications Manual MN06131L
Dokument456 Seiten
AccuLoad III - ALX - Modbus Communications Manual MN06131L
EitanBoria
Noch keine Bewertungen
LFD259 Labs - V2019 01 14
Dokument86 Seiten
LFD259 Labs - V2019 01 14
Bill Ho
100% (2)
TM Customizing Basic Settings
Dokument17 Seiten
TM Customizing Basic Settings
aasifRockz
100% (1)
Kuhn Tucker Conditions
Dokument15 Seiten
Kuhn Tucker Conditions
Barath
Noch keine Bewertungen
ABAP - Defining A Range in Module Pool Program
Dokument13 Seiten
ABAP - Defining A Range in Module Pool Program
KIRAN
Noch keine Bewertungen
HC900 Hybrid Control Designer: User Guide
Dokument284 Seiten
HC900 Hybrid Control Designer: User Guide
Chiuda Daniel
Noch keine Bewertungen
Mipt 2014 Burunduk1.En
Dokument5 Seiten
Mipt 2014 Burunduk1.En
Vishal Golcha
Noch keine Bewertungen
Vrealize Suite Overview
Dokument44 Seiten
Vrealize Suite Overview
Muhammad Majid Khan
Noch keine Bewertungen
Julia Documentation
Dokument974 Seiten
Julia Documentation
rotmaus
Noch keine Bewertungen
Cyber Crime
Dokument29 Seiten
Cyber Crime
Evangeline Chai
Noch keine Bewertungen
Authentication Protocol (CHAP)
Dokument6 Seiten
Authentication Protocol (CHAP)
FAZIRA
Noch keine Bewertungen