Willkommen bei Scribd!

Analysis of Case Study

Hochgeladen von

0% fanden dieses Dokument nützlich (0 Abstimmungen)

40 Ansichten7 Seiten

This document summarizes a case study analyzing customer account data using unsupervised learning clustering methods to detect patterns of potentially fraudulent activity. Unsupervised learning and clustering were used since there were no predefined labels for fraudulent accounts. The analysis clustered 13,000 customer accounts into 40 groups using SAS Enterprise Miner. One cluster exhibited patterns of weekend, holiday, restaurant and hotel purchases, indicating improper personal expenses. The case study demonstrates using clustering for pattern detection to build a knowledge base to identify future fraudulent transactions. Limitations include potential issues with the sample, assumptions made without predefined labels, and needing to revisit the model if testing finds errors in judgments.

Originalbeschreibung:

Analytics

Originaltitel

Group3_DataClustering_CaseStudy

Copyright

Verfügbare Formate

PPTX, PDF, TXT oder online auf Scribd lesen

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Dieses Dokument melden

Copyright:

Verfügbare Formate

Als PPTX, PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

0% fanden dieses Dokument nützlich (0 Abstimmungen)

40 Ansichten7 Seiten

Analysis of Case Study

Hochgeladen von

ManU

Copyright:

Verfügbare Formate

Als PPTX, PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

Zu Seite

Sie sind auf Seite 1von 7

Im Dokument suchen

Analysis of Case study

Unsupervised learning method is used because

there is no target field in this case.
Unsupervised learning method is used to uncover
meaningful patterns in the data.
Clustering is often called an unsupervised learning
task as no class values denoting an a priori grouping of
the data instances are given.
Analysis of Case study(Contd..)
Using the SAS Enterprise Miner, a sample of
approximately 13,000 accounts is created.
To generate the clusters for analysis ,the value k and
cluster variable names are given in the cluster model
parameters option.
The algorithm will try to arrange the data around the
four clusters in such a way as to minimize differences
within clusters at the same time that it maximizes
differences between clusters.
Analysis of Case study(Contd..)
In this case, The parameters for the cluster analysis
were set to 40 clusters
In one of the clusters ,they found that that account
holders in one of those cluster made a high amount of
weekend and holiday purchases, restaurant purchases
and hotel purchases
These accounts are problematic as the patterns
exhibited by them clearly indicate improper use of
purchase cards for personal and unwarranted expenses.
Conclusion to the case
Cluster analysis yields substantive results in the
absence of a target field.
Used wisely, cluster analysis can help an organization
interested in fraud detection build a knowledge base of
fraud.
The ultimate objective would be the creation of
supervised learning model such as a neural network
that is focused on uncovering fraudulent transactions.
Recommendation
Data clustering should used for pattern detecting rather than
mere exploratory research
Cluster models should be tested before applying to new data,
producing cases for investigation
Investigation to cluster models must be carried out to validate
the conclusion derived from such patterns
Cluster models needs to be revisited if investigation show the
models judgment to be erroneous
Findings from cluster model investigation must be stored in a
knowledge base
Strength of the Case Study
Use of Cluster Analysis for pattern detection on fraudulent
behaviour
Exploratory analysis may be satisfied to discover some interesting cases
in the data
Pattern discovery will leverage the existing clusters and the general
patterns associated with those clusters to assign new cases to clusters
Use of SAS for data processing
Sample of approximately 13000 accounts created. Data processing
needed a tool which can handle such a large dataset
Creation of knowledge base on the basis of analysed account data
This will help in detection of future incidents without having to perform
the whole test again
Limitation of the Case Study
Possibility of wrong sample being chosen for performing analysis
Samples need to be representative of the total population so that models have
a chance to see possible combinations of fields
Since there is no target field, there is a possibility of wrongful
classification of raw data available as fraudulent
Cluster analysis is used in the case as a pattern detection technique; therefore,
the resulting cluster model would need to be tested were it is to be applied
The clusters created from the sample data are on the basis of hypothetical
assumptions
Possibility of error in the proposed model causing the requirement for a
revisit of the employed cluster analysis
The model would still need to be tested by using new data to ensure that the
clusters developed are consistent with the current model

Das könnte Ihnen auch gefallen

1.supervised and Unsupervised
Dokument42 Seiten
1.supervised and Unsupervised
rajthakre81
Noch keine Bewertungen
Data Mining Slides
Dokument65 Seiten
Data Mining Slides
Kriwaczf
Noch keine Bewertungen
DIgitization Week 7
Dokument6 Seiten
DIgitization Week 7
Ilion Barboso
Noch keine Bewertungen
P-2.1.2 Cross Validation and Regularization
Dokument37 Seiten
P-2.1.2 Cross Validation and Regularization
Puneet Parihar
Noch keine Bewertungen
Knowledge Discovery & Data Mining
Dokument30 Seiten
Knowledge Discovery & Data Mining
ilmuBiner
Noch keine Bewertungen
01 Introduction Clustering
Dokument11 Seiten
01 Introduction Clustering
Kushagra Bhatnagar
Noch keine Bewertungen
CENG3300 Lecture 3
Dokument24 Seiten
CENG3300 Lecture 3
huichloemail
Noch keine Bewertungen
Sampling Process: Prof. (DR.) C.K.Dash
Dokument43 Seiten
Sampling Process: Prof. (DR.) C.K.Dash
Sumit Kumar Sharma
Noch keine Bewertungen
3 DM Classification
Dokument55 Seiten
3 DM Classification
dawit gebreyohans
Noch keine Bewertungen
Module 1 ML Mumbai University
Dokument47 Seiten
Module 1 ML Mumbai University
2021.shreya.pawaskar
Noch keine Bewertungen
What Is Data Mining Again?: Unsuspected Relationships Summarize Understandable and Useful Models
Dokument29 Seiten
What Is Data Mining Again?: Unsuspected Relationships Summarize Understandable and Useful Models
Joseph Conteh
Noch keine Bewertungen
Data Mining Technologies and Implementations
Dokument34 Seiten
Data Mining Technologies and Implementations
Julio Omar Palacio Niño
Noch keine Bewertungen
Data Science PDF
Dokument11 Seiten
Data Science PDF
sredhar s
Noch keine Bewertungen
Presentation 1
Dokument28 Seiten
Presentation 1
Nisar Mohammad
Noch keine Bewertungen
Unit6 Part3 General Procedure
Dokument19 Seiten
Unit6 Part3 General Procedure
tamanna sharma
Noch keine Bewertungen
Lecture 1
Dokument19 Seiten
Lecture 1
Blue Whale
Noch keine Bewertungen
BRM CS
Dokument4 Seiten
BRM CS
palija shakya
Noch keine Bewertungen
Data Mining
Dokument15 Seiten
Data Mining
akashsharma9011328268
Noch keine Bewertungen
4 - Data Analytics Using DM and ML Algorithms - 1
Dokument71 Seiten
4 - Data Analytics Using DM and ML Algorithms - 1
Tariku Wodajo
Noch keine Bewertungen
3 Data Sampling, Collection and Testing Powerpoint
Dokument14 Seiten
3 Data Sampling, Collection and Testing Powerpoint
Ryana Rose Cristobal
Noch keine Bewertungen
Data Mining - An Overview
Dokument40 Seiten
Data Mining - An Overview
ShyamBhatt
Noch keine Bewertungen
Aima Data Mining
Dokument13 Seiten
Aima Data Mining
dineshdev
Noch keine Bewertungen
R Lect1 Introduction
Dokument16 Seiten
R Lect1 Introduction
Aakash Raj
Noch keine Bewertungen
Sample Designs and Sampling Procedures
Dokument35 Seiten
Sample Designs and Sampling Procedures
rashmisulabh
Noch keine Bewertungen
Business Analytics Process and Data Exploration
Dokument38 Seiten
Business Analytics Process and Data Exploration
J Warneck Gultøm
Noch keine Bewertungen
Computational Statistics: Unit - 5
Dokument27 Seiten
Computational Statistics: Unit - 5
yenop
Noch keine Bewertungen
NLP Chapter 2
Dokument79 Seiten
NLP Chapter 2
ai20152023
Noch keine Bewertungen
Enseble LEarning
Dokument57 Seiten
Enseble LEarning
YASH GAIKWAD
100% (1)
Unit-Iv DWDM
Dokument28 Seiten
Unit-Iv DWDM
varsha.j2177
Noch keine Bewertungen
Chapter 4 Classification
Dokument78 Seiten
Chapter 4 Classification
Mohamedsultan Awol
Noch keine Bewertungen
UNIT-04: Introduction To Data Mining: Data Mining Techniques KDD Process Association Rules.
Dokument40 Seiten
UNIT-04: Introduction To Data Mining: Data Mining Techniques KDD Process Association Rules.
Kuntal Gupta
Noch keine Bewertungen
Chapter 03 RM
Dokument34 Seiten
Chapter 03 RM
TMIMITWJ2030
Noch keine Bewertungen
Unit 3 (ML)
Dokument26 Seiten
Unit 3 (ML)
BHAVIN THUMAR
Noch keine Bewertungen
Business Research Methods William G. Zikmund
Dokument31 Seiten
Business Research Methods William G. Zikmund
Parth Upadhyay
Noch keine Bewertungen
Final Clustering
Dokument21 Seiten
Final Clustering
NEEL GHADIYA
Noch keine Bewertungen
Unit I Predictive Analytics
Dokument39 Seiten
Unit I Predictive Analytics
NarendranGRevathi
Noch keine Bewertungen
QT-Lecture 1
Dokument64 Seiten
QT-Lecture 1
hlnathani.efl
Noch keine Bewertungen
Chapter 14 Part B
Dokument21 Seiten
Chapter 14 Part B
Jennifer Jones
Noch keine Bewertungen
Segmentation and Profiling Using SPSS For Windows: Kate Grayson
Dokument44 Seiten
Segmentation and Profiling Using SPSS For Windows: Kate Grayson
Nguyen Phuong Thao
Noch keine Bewertungen
Chapter Four: Simulation Analysis and Probability Distribution
Dokument21 Seiten
Chapter Four: Simulation Analysis and Probability Distribution
Abebe Tesfaye
Noch keine Bewertungen
Unsupervised Machine Learning
Dokument10 Seiten
Unsupervised Machine Learning
Ananya S
Noch keine Bewertungen
Lesson 6
Dokument29 Seiten
Lesson 6
sadafilyas
Noch keine Bewertungen
Decision Tree Learning
Dokument15 Seiten
Decision Tree Learning
dbaechtel
Noch keine Bewertungen
Dmbi Unit-4
Dokument18 Seiten
Dmbi Unit-4
Paras Sharma
Noch keine Bewertungen
Sampling (Final)
Dokument32 Seiten
Sampling (Final)
SyedMateebTirmizi
Noch keine Bewertungen
Current Trends
Dokument35 Seiten
Current Trends
icecoolberge
Noch keine Bewertungen
L3 - Supervised and Unsupervised Learning
Dokument24 Seiten
L3 - Supervised and Unsupervised Learning
Gaurav Rohilla
100% (1)
Business Research Method: Unit 4
Dokument17 Seiten
Business Research Method: Unit 4
Prince Singh
Noch keine Bewertungen
Eda
Dokument12 Seiten
Eda
Inspiring Evolution
100% (1)
Intro To Machine Learning
Dokument25 Seiten
Intro To Machine Learning
PRAGYA SINGH BAGHEL STUDENTS JAIPURIA INDORE
Noch keine Bewertungen
Datamining
Dokument18 Seiten
Datamining
api-19626062
Noch keine Bewertungen
Chapter 7
Dokument5 Seiten
Chapter 7
bilawal
Noch keine Bewertungen
Testing, Assessing, and Evaluating Audit Evidence
Dokument17 Seiten
Testing, Assessing, and Evaluating Audit Evidence
hfjffj
Noch keine Bewertungen
Data Mining: Kabith Sivaprasad (BE/1234/2009) Rimjhim (BE/1134/2009) Utkarsh Ahuja (BE/1226/2009)
Dokument32 Seiten
Data Mining: Kabith Sivaprasad (BE/1234/2009) Rimjhim (BE/1134/2009) Utkarsh Ahuja (BE/1226/2009)
Rule2
Noch keine Bewertungen
Data Analysis (27 Questions) : 1. (Given A Dataset) Analyze This Dataset and Tell Me What You Can Learn From It
Dokument28 Seiten
Data Analysis (27 Questions) : 1. (Given A Dataset) Analyze This Dataset and Tell Me What You Can Learn From It
kumar kumar
Noch keine Bewertungen
Unit 6 I
Dokument33 Seiten
Unit 6 I
ankit mehta
Noch keine Bewertungen
Lecture-4: Introduction To Data Science
Dokument41 Seiten
Lecture-4: Introduction To Data Science
Saif Ali Khan
Noch keine Bewertungen
Pattern Recognition Application
Dokument43 Seiten
Pattern Recognition Application
Khaled Omar
Noch keine Bewertungen
Computational Statistics: Unit - 5
Dokument24 Seiten
Computational Statistics: Unit - 5
yenop
Noch keine Bewertungen
Random Sample Consensus: Robust Estimation in Computer Vision
Von Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
Noch keine Bewertungen
Classroom Management Model
Dokument10 Seiten
Classroom Management Model
api-317393009
Noch keine Bewertungen
Robotics Project Plan: Creativity Programming Skills
Dokument9 Seiten
Robotics Project Plan: Creativity Programming Skills
Atif Inayat Khan
Noch keine Bewertungen
Lesson Plan 2
Dokument15 Seiten
Lesson Plan 2
api-547040818
Noch keine Bewertungen
Article 3
Dokument20 Seiten
Article 3
Rokia Benzerga
Noch keine Bewertungen
Goals in Pluralistic Society
Dokument28 Seiten
Goals in Pluralistic Society
Danica Jane Santolaja
Noch keine Bewertungen
Community College Disadvantages
Dokument1 Seite
Community College Disadvantages
Rhelbam
Noch keine Bewertungen
Learning Management System
Dokument5 Seiten
Learning Management System
alfonxo
Noch keine Bewertungen
1edited GROUP 11 - THE EFFICACY LEVEL OF ACCOUNTANCY STUDENTS LEARNING STRATEGY IN ATTAINING THE RETENTION POLICY IN LA CONCEPCION COLLEGE BASIS FOR LEARNERS ENHANCEMENT PROGRAM
Dokument72 Seiten
1edited GROUP 11 - THE EFFICACY LEVEL OF ACCOUNTANCY STUDENTS LEARNING STRATEGY IN ATTAINING THE RETENTION POLICY IN LA CONCEPCION COLLEGE BASIS FOR LEARNERS ENHANCEMENT PROGRAM
Jim Ashter L. Salogaol
Noch keine Bewertungen
DBQ Project
Dokument1 Seite
DBQ Project
api-202417816
Noch keine Bewertungen
BTVTED FSM Curriculum 4th Year Only
Dokument1 Seite
BTVTED FSM Curriculum 4th Year Only
Audrey Rhiane Marin
Noch keine Bewertungen
Health Care Assistant Program Supplement To The Provincial Curriculum Guide 2015 Third Edition 1660080744
Dokument260 Seiten
Health Care Assistant Program Supplement To The Provincial Curriculum Guide 2015 Third Edition 1660080744
Natthakarn
Noch keine Bewertungen
B. SPT Memo TOR MAES
Dokument8 Seiten
B. SPT Memo TOR MAES
EVA NOEMI ABAYON
Noch keine Bewertungen
Humss Hand Out 6
Dokument5 Seiten
Humss Hand Out 6
FRANCHESKA GIZELLE S PANGILINAN
Noch keine Bewertungen
AAC HigherSecondary Eng PDF
Dokument202 Seiten
AAC HigherSecondary Eng PDF
Adesh Chauhan
Noch keine Bewertungen
Udl Lesson Plan Assignment Macdonald
Dokument11 Seiten
Udl Lesson Plan Assignment Macdonald
api-448386725
100% (1)
Latih Tubi Soalan Latihan Matematik Tahun 1 Cuti Bulan Mac 1
Dokument18 Seiten
Latih Tubi Soalan Latihan Matematik Tahun 1 Cuti Bulan Mac 1
Naszerene Anne
Noch keine Bewertungen
Lesson Plan 8 3 Our Amazing Body GE3
Dokument2 Seiten
Lesson Plan 8 3 Our Amazing Body GE3
Nguyễn Phúc
Noch keine Bewertungen
Academic Writing - Course Schedule
Dokument2 Seiten
Academic Writing - Course Schedule
Hanafi A Rahim
Noch keine Bewertungen
RW LAS Q2 Wk4 MELC4
Dokument11 Seiten
RW LAS Q2 Wk4 MELC4
Shanice Tacdoro
Noch keine Bewertungen
Kindergarten Lesson Plans For Back To School Free Curriculum Map August
Dokument21 Seiten
Kindergarten Lesson Plans For Back To School Free Curriculum Map August
sam
Noch keine Bewertungen
Online Activity 20.2 Skills Inventory
Dokument1 Seite
Online Activity 20.2 Skills Inventory
wayne
Noch keine Bewertungen
PSYT4000 Principles of Psychological Assessment Semester 2 2022 Bentley Perth Campus INT
Dokument12 Seiten
PSYT4000 Principles of Psychological Assessment Semester 2 2022 Bentley Perth Campus INT
Ahmad Qawwam
Noch keine Bewertungen
Teaching Assessment of Literature Studies Module
Dokument24 Seiten
Teaching Assessment of Literature Studies Module
Deign Rochelle Castillo
100% (3)
Lesson-Plan-TLE-6-Q1-W1-MELC-1 (Day 3)
Dokument4 Seiten
Lesson-Plan-TLE-6-Q1-W1-MELC-1 (Day 3)
Krisna Hundos
Noch keine Bewertungen
Approaches in Teaching Literature
Dokument9 Seiten
Approaches in Teaching Literature
Allia Katherine Ganan
Noch keine Bewertungen
Juan's Math Demo
Dokument31 Seiten
Juan's Math Demo
Gina Rivera Lazaro
Noch keine Bewertungen
709 Compression Test Lesson Plan
Dokument7 Seiten
709 Compression Test Lesson Plan
api-306672022
Noch keine Bewertungen
Revised Research Capstone
Dokument51 Seiten
Revised Research Capstone
Haziel Joy Gallego
Noch keine Bewertungen
Njoki Wane - (Re) Claiming My Indigenous Knowledge - Challenges, Resistance, and Opportunities
Dokument15 Seiten
Njoki Wane - (Re) Claiming My Indigenous Knowledge - Challenges, Resistance, and Opportunities
leo
Noch keine Bewertungen
Ldm2 Teachers Lac Session 1 Guide LAC Components Activity Suggested Period
Dokument2 Seiten
Ldm2 Teachers Lac Session 1 Guide LAC Components Activity Suggested Period
jhanie lapid
100% (2)