Willkommen bei Scribd!

Karussell überspringen

Clustering Algorithm

Hochgeladen von

Biren Arora

0% fanden dieses Dokument nützlich (0 Abstimmungen)

21 Ansichten6 Seiten

It is based on salary spent

Originaltitel

Clustering algorithm

Copyright

Verfügbare Formate

DOCX, PDF, TXT oder online auf Scribd lesen

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Dieses Dokument melden

It is based on salary spent

Copyright:

Verfügbare Formate

Als DOCX, PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

0% fanden dieses Dokument nützlich (0 Abstimmungen)

21 Ansichten6 Seiten

Clustering Algorithm

Hochgeladen von

Biren Arora

It is based on salary spent

Copyright:

Verfügbare Formate

Als DOCX, PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

Zu Seite

Sie sind auf Seite 1von 6

Im Dokument suchen

BI Mini Project

Report Should contain:

1. Problem Definition (It should be in 1 to 2 Paragraph,Describe dataset that
you have considered):
➢ Suppose you are owing a supermarket mall and through membership
cards , you have some basic data about your customers like Customer
ID, age, genre, annual income and spending score. Spending Score is
something you assign to the customer based on your defined
parameters like customer behavior and purchasing data.
Problem Statement: You own the mall and want to understand the
customers like who can be easily converge [Target Customers] so that the
sense can be given to marketing team and plan the strategy accordingly.

2. Identifying which data mining task is needed & Why?

➢ We have implemented Hierarchical Clustering Algorithm because it
outputs a hierarchy, i.e: a structure that is more informative than the
unstructured set of flat clusters returned by k-means. Therefore, it is
easier to decide on the number of clusters by looking at the
dendrogram.

3. Implement the data mining algorithm of your choice(Python). Describe

it(you can show flowchart of the process, attach screenshot of code):

➢ import matplotlib.pyplot as plt

import pandas as pd
import seaborn as sns

dataset = pd.read_csv('Mall_Customers.csv')
x = dataset.iloc[:, [3,4]].values

plt.figure(1 , figsize = (15 , 5))

sns.countplot(y = 'Genre' , data = dataset)
plt.show()
plt.figure(1 , figsize = (15 , 7))
n=0
for x in ['Age' , 'Annual Income (k$)' , 'Spending Score (1-100)']:
for y in ['Age' , 'Annual Income (k$)' , 'Spending Score (1-100)']:
n += 1
plt.subplot(3 , 3 , n)
plt.subplots_adjust(hspace = 0.5 , wspace = 0.5)
sns.regplot(x = x , y = y , data = dataset)
plt.ylabel(y.split()[0]+' '+y.split()[1] if len(y.split()) > 1 else y )
plt.show()

plt.figure(1 , figsize = (15 , 7))

n=0
for cols in ['Age' , 'Annual Income (k$)' , 'Spending Score (1-100)']:
n += 1
plt.subplot(1 , 3 , n)
plt.subplots_adjust(hspace = 0.5 , wspace = 0.5)
sns.violinplot(x = cols , y = 'Genre' , data = dataset , palette = 'vlag')
'''sns.swarmplot(x = cols , y = 'Genre' , data = dataset)'''
plt.ylabel('Gender' if n == 1 else '')
plt.title('Boxplots' if n == 2 else '')
plt.show()

# Using Dendrogram to find optimal no. of clusters

import scipy.cluster.hierarchy as sch
dendrogram = sch.dendrogram(sch.linkage(x, method = 'ward'))
plt.title('Dendrogram')
plt.xlabel('Customers')
plt.ylabel('Euclidean Distances')
plt.show()

# Fitting hierarchical clustering to dataset

from sklearn.cluster import AgglomerativeClustering
hc = AgglomerativeClustering(n_clusters = 5, affinity = 'euclidean', linkage = 'ward')
y_hc = hc.fit_predict(x)

# Visualizing the clusters

plt.scatter(x[y_hc == 0, 0], x[y_hc == 0, 1], s = 100, c = 'red', label = 'Careful')
plt.scatter(x[y_hc == 1, 0], x[y_hc == 1, 1], s = 100, c = 'blue', label = 'Standard')
plt.scatter(x[y_hc == 2, 0], x[y_hc == 2, 1], s = 100, c = 'green', label = 'Targets')
plt.scatter(x[y_hc == 3, 0], x[y_hc == 3, 1], s = 100, c = 'cyan', label = 'Careless')
plt.scatter(x[y_hc == 4, 0], x[y_hc == 4, 1], s = 100, c = 'magenta', label = 'Sensible')
plt.title('Clusters of Clients')
plt.xlabel('Annual Income($)')
plt.ylabel('Spending score(1-100)')
plt.legend()
plt.show()

4. Interpret & visualize the result(Different Graph you can

show/Rapidminer output you can attach. its mandatory ):

➢
5. Provide clearly the BI decision that is to be taken as a result of mining:
➢ We have implemented Hierarchical Clustering Algorithm because it outputs
a hierarchy, i.e: a structure that is more informative than the unstructured set
of flat clusters returned by k-means. Therefore, it is easier to decide on the
number of clusters by looking at the dendrogram.

Das könnte Ihnen auch gefallen

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Von Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Bewertung: 4 von 5 Sternen
4/5 (5794)
Shoe Dog: A Memoir by the Creator of Nike
Von Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Bewertung: 4.5 von 5 Sternen
4.5/5 (537)
Yes Please
Von Everand
Yes Please
Amy Poehler
Bewertung: 4 von 5 Sternen
4/5 (1891)
The Yellow House: A Memoir (2019 National Book Award Winner)
Von Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Bewertung: 4 von 5 Sternen
4/5 (98)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Von Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Bewertung: 4 von 5 Sternen
4/5 (895)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Von Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Bewertung: 4.5 von 5 Sternen
4.5/5 (344)
The Little Book of Hygge: Danish Secrets to Happy Living
Von Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Bewertung: 3.5 von 5 Sternen
3.5/5 (399)
Grit: The Power of Passion and Perseverance
Von Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Bewertung: 4 von 5 Sternen
4/5 (588)
The Emperor of All Maladies: A Biography of Cancer
Von Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Bewertung: 4.5 von 5 Sternen
4.5/5 (271)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Von Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Bewertung: 4.5 von 5 Sternen
4.5/5 (266)
Never Split the Difference: Negotiating As If Your Life Depended On It
Von Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Bewertung: 4.5 von 5 Sternen
4.5/5 (838)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Von Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Bewertung: 3.5 von 5 Sternen
3.5/5 (231)
Principles: Life and Work
Von Everand
Principles: Life and Work
Ray Dalio
Bewertung: 4 von 5 Sternen
4/5 (599)
On Fire: The (Burning) Case for a Green New Deal
Von Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Bewertung: 4 von 5 Sternen
4/5 (73)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Von Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Bewertung: 4.5 von 5 Sternen
4.5/5 (474)
Team of Rivals: The Political Genius of Abraham Lincoln
Von Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Bewertung: 4.5 von 5 Sternen
4.5/5 (234)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Von Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Bewertung: 3.5 von 5 Sternen
3.5/5 (2259)
Angela's Ashes: A Memoir
Von Everand
Angela's Ashes: A Memoir
Frank McCourt
Bewertung: 4.5 von 5 Sternen
4.5/5 (440)
Rise of ISIS: A Threat We Can't Ignore
Von Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Bewertung: 3.5 von 5 Sternen
3.5/5 (137)
Steve Jobs
Von Everand
Steve Jobs
Walter Isaacson
Bewertung: 4.5 von 5 Sternen
4.5/5 (806)
Fear: Trump in the White House
Von Everand
Fear: Trump in the White House
Bob Woodward
Bewertung: 3.5 von 5 Sternen
3.5/5 (738)
The Unwinding: An Inner History of the New America
Von Everand
The Unwinding: An Inner History of the New America
George Packer
Bewertung: 4 von 5 Sternen
4/5 (45)
Bad Feminist: Essays
Von Everand
Bad Feminist: Essays
Roxane Gay
Bewertung: 4 von 5 Sternen
4/5 (1015)
John Adams
Von Everand
John Adams
David McCullough
Bewertung: 4.5 von 5 Sternen
4.5/5 (2409)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Von Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Bewertung: 4 von 5 Sternen
4/5 (1090)
The Glass Castle: A Memoir
Von Everand
The Glass Castle: A Memoir
Jeannette Walls
Bewertung: 4.5 von 5 Sternen
4.5/5 (1712)
The Light Between Oceans: A Novel
Von Everand
The Light Between Oceans: A Novel
M.L. Stedman
Bewertung: 4.5 von 5 Sternen
4.5/5 (789)
The Outsider: A Novel
Von Everand
The Outsider: A Novel
Stephen King
Bewertung: 4 von 5 Sternen
4/5 (1839)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Von Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Bewertung: 4.5 von 5 Sternen
4.5/5 (120)
The Woman in Cabin 10
Von Everand
The Woman in Cabin 10
Ruth Ware
Bewertung: 3.5 von 5 Sternen
3.5/5 (2322)
Brooklyn: A Novel
Von Everand
Brooklyn: A Novel
Colm Tóibín
Bewertung: 3.5 von 5 Sternen
3.5/5 (1937)
A Man Called Ove: A Novel
Von Everand
A Man Called Ove: A Novel
Fredrik Backman
Bewertung: 4.5 von 5 Sternen
4.5/5 (4609)
The Perks of Being a Wallflower
Von Everand
The Perks of Being a Wallflower
Stephen Chbosky
Bewertung: 4.5 von 5 Sternen
4.5/5 (2101)
Wolf Hall: A Novel
Von Everand
Wolf Hall: A Novel
Hilary Mantel
Bewertung: 4 von 5 Sternen
4/5 (3811)
Little Women
Von Everand
Little Women
Louisa May Alcott
Bewertung: 4 von 5 Sternen
4/5 (104)
A Tree Grows in Brooklyn
Von Everand
A Tree Grows in Brooklyn
Betty Smith
Bewertung: 4.5 von 5 Sternen
4.5/5 (1929)
Manhattan Beach: A Novel
Von Everand
Manhattan Beach: A Novel
Jennifer Egan
Bewertung: 3.5 von 5 Sternen
3.5/5 (792)
The Art of Racing in the Rain: A Novel
Von Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Bewertung: 4 von 5 Sternen
4/5 (4200)
Sing, Unburied, Sing: A Novel
Von Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Bewertung: 4 von 5 Sternen
4/5 (1103)
Her Body and Other Parties: Stories
Von Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Bewertung: 4 von 5 Sternen
4/5 (821)
The Constant Gardener: A Novel
Von Everand
The Constant Gardener: A Novel
John le Carré
Bewertung: 3.5 von 5 Sternen
3.5/5 (104)
PHP Final Exam Reviewer
Dokument3 Seiten
PHP Final Exam Reviewer
Beth Jacinto
Noch keine Bewertungen
Best ICT Book
Dokument4 Seiten
Best ICT Book
Abdulrahman Sadasivam
Noch keine Bewertungen
Soc Linux Cheatsheet
Dokument2 Seiten
Soc Linux Cheatsheet
FestilaCatalinGeorge
Noch keine Bewertungen
Admin Guide
Dokument682 Seiten
Admin Guide
Vijay Kumar
Noch keine Bewertungen
Praveen M CV
Dokument7 Seiten
Praveen M CV
sreekanth840
Noch keine Bewertungen
ISO 27001 Controls
Dokument20 Seiten
ISO 27001 Controls
Narasimha Rao Akundi
Noch keine Bewertungen
Cisc Vs Risc
Dokument15 Seiten
Cisc Vs Risc
Gấm Gà
100% (1)
Differential Evolution
Dokument67 Seiten
Differential Evolution
Rahul Mayank
Noch keine Bewertungen
DCOM Configuration Guide
Dokument53 Seiten
DCOM Configuration Guide
diegos109
Noch keine Bewertungen
Cheat Sheet Metasploit Meterpreter
Dokument7 Seiten
Cheat Sheet Metasploit Meterpreter
Rhett Ligon
Noch keine Bewertungen
SRM Datasheet
Dokument2 Seiten
SRM Datasheet
Ahmed Mallouh
Noch keine Bewertungen
ch3 Q
Dokument20 Seiten
ch3 Q
lo0302
Noch keine Bewertungen
EXP GDC User Manual EN 20170321
Dokument17 Seiten
EXP GDC User Manual EN 20170321
Edward S. Cruz
Noch keine Bewertungen
Teradata Interview Questions and Answers
Dokument21 Seiten
Teradata Interview Questions and Answers
Sai Vasu
Noch keine Bewertungen
Development and Test Methodology Boeing Jeppesen JDM
Dokument7 Seiten
Development and Test Methodology Boeing Jeppesen JDM
techgeekvn
Noch keine Bewertungen
Informatica Persistent Lookup Cache Complete
Dokument5 Seiten
Informatica Persistent Lookup Cache Complete
ani_datta
Noch keine Bewertungen
DATA WINDOW OBJECT 和 CONTROL 的基本技術掌握
Dokument21 Seiten
DATA WINDOW OBJECT 和 CONTROL 的基本技術掌握
Toan Hang
Noch keine Bewertungen
RecoverPoint - Virtual Installation
Dokument16 Seiten
RecoverPoint - Virtual Installation
vikmob
Noch keine Bewertungen
MSBTE Sample Question Paper: Remember Level 1M
Dokument35 Seiten
MSBTE Sample Question Paper: Remember Level 1M
Eeeeeww
Noch keine Bewertungen
Availability For The Always-On Enterprise - VCSP PDF
Dokument178 Seiten
Availability For The Always-On Enterprise - VCSP PDF
gutin
Noch keine Bewertungen
Tiny OS
Dokument77 Seiten
Tiny OS
bharathi devi
Noch keine Bewertungen
Multi-Core Architectures
Dokument43 Seiten
Multi-Core Architectures
vinotd1
100% (1)
Install Activation
Dokument1 Seite
Install Activation
Reinaldo
Noch keine Bewertungen
Europa Universalis IV Cheats
Dokument7 Seiten
Europa Universalis IV Cheats
Zamri Bin Radzali
Noch keine Bewertungen
3d Scanning 1214
Dokument32 Seiten
3d Scanning 1214
sansagith
Noch keine Bewertungen
Optisystem Vbscripting Ref Guide
Dokument198 Seiten
Optisystem Vbscripting Ref Guide
Thanh Lê
100% (1)
Datasheet It8718f
Dokument189 Seiten
Datasheet It8718f
Eduardo Montenegro
100% (1)
STQA
Dokument5 Seiten
STQA
akshay gawade
Noch keine Bewertungen
PHP Work Book
Dokument25 Seiten
PHP Work Book
Arjun Shanka
0% (1)
Robot Components: The Robots Can Be Made Out of The Below Mentioned
Dokument7 Seiten
Robot Components: The Robots Can Be Made Out of The Below Mentioned
Nani Kumar
Noch keine Bewertungen