0 Bewertungen0% fanden dieses Dokument nützlich (0 Abstimmungen)
40 Ansichten7 Seiten
This document summarizes a case study analyzing customer account data using unsupervised learning clustering methods to detect patterns of potentially fraudulent activity. Unsupervised learning and clustering were used since there were no predefined labels for fraudulent accounts. The analysis clustered 13,000 customer accounts into 40 groups using SAS Enterprise Miner. One cluster exhibited patterns of weekend, holiday, restaurant and hotel purchases, indicating improper personal expenses. The case study demonstrates using clustering for pattern detection to build a knowledge base to identify future fraudulent transactions. Limitations include potential issues with the sample, assumptions made without predefined labels, and needing to revisit the model if testing finds errors in judgments.
This document summarizes a case study analyzing customer account data using unsupervised learning clustering methods to detect patterns of potentially fraudulent activity. Unsupervised learning and clustering were used since there were no predefined labels for fraudulent accounts. The analysis clustered 13,000 customer accounts into 40 groups using SAS Enterprise Miner. One cluster exhibited patterns of weekend, holiday, restaurant and hotel purchases, indicating improper personal expenses. The case study demonstrates using clustering for pattern detection to build a knowledge base to identify future fraudulent transactions. Limitations include potential issues with the sample, assumptions made without predefined labels, and needing to revisit the model if testing finds errors in judgments.
This document summarizes a case study analyzing customer account data using unsupervised learning clustering methods to detect patterns of potentially fraudulent activity. Unsupervised learning and clustering were used since there were no predefined labels for fraudulent accounts. The analysis clustered 13,000 customer accounts into 40 groups using SAS Enterprise Miner. One cluster exhibited patterns of weekend, holiday, restaurant and hotel purchases, indicating improper personal expenses. The case study demonstrates using clustering for pattern detection to build a knowledge base to identify future fraudulent transactions. Limitations include potential issues with the sample, assumptions made without predefined labels, and needing to revisit the model if testing finds errors in judgments.
there is no target field in this case. Unsupervised learning method is used to uncover meaningful patterns in the data. Clustering is often called an unsupervised learning task as no class values denoting an a priori grouping of the data instances are given. Analysis of Case study(Contd..) Using the SAS Enterprise Miner, a sample of approximately 13,000 accounts is created. To generate the clusters for analysis ,the value k and cluster variable names are given in the cluster model parameters option. The algorithm will try to arrange the data around the four clusters in such a way as to minimize differences within clusters at the same time that it maximizes differences between clusters. Analysis of Case study(Contd..) In this case, The parameters for the cluster analysis were set to 40 clusters In one of the clusters ,they found that that account holders in one of those cluster made a high amount of weekend and holiday purchases, restaurant purchases and hotel purchases These accounts are problematic as the patterns exhibited by them clearly indicate improper use of purchase cards for personal and unwarranted expenses. Conclusion to the case Cluster analysis yields substantive results in the absence of a target field. Used wisely, cluster analysis can help an organization interested in fraud detection build a knowledge base of fraud. The ultimate objective would be the creation of supervised learning model such as a neural network that is focused on uncovering fraudulent transactions. Recommendation Data clustering should used for pattern detecting rather than mere exploratory research Cluster models should be tested before applying to new data, producing cases for investigation Investigation to cluster models must be carried out to validate the conclusion derived from such patterns Cluster models needs to be revisited if investigation show the models judgment to be erroneous Findings from cluster model investigation must be stored in a knowledge base Strength of the Case Study Use of Cluster Analysis for pattern detection on fraudulent behaviour Exploratory analysis may be satisfied to discover some interesting cases in the data Pattern discovery will leverage the existing clusters and the general patterns associated with those clusters to assign new cases to clusters Use of SAS for data processing Sample of approximately 13000 accounts created. Data processing needed a tool which can handle such a large dataset Creation of knowledge base on the basis of analysed account data This will help in detection of future incidents without having to perform the whole test again Limitation of the Case Study Possibility of wrong sample being chosen for performing analysis Samples need to be representative of the total population so that models have a chance to see possible combinations of fields Since there is no target field, there is a possibility of wrongful classification of raw data available as fraudulent Cluster analysis is used in the case as a pattern detection technique; therefore, the resulting cluster model would need to be tested were it is to be applied The clusters created from the sample data are on the basis of hypothetical assumptions Possibility of error in the proposed model causing the requirement for a revisit of the employed cluster analysis The model would still need to be tested by using new data to ensure that the clusters developed are consistent with the current model
1edited GROUP 11 - THE EFFICACY LEVEL OF ACCOUNTANCY STUDENTS LEARNING STRATEGY IN ATTAINING THE RETENTION POLICY IN LA CONCEPCION COLLEGE BASIS FOR LEARNERS ENHANCEMENT PROGRAM