Beruflich Dokumente
Kultur Dokumente
PROBLEM STATEMENT
With the enhancement of IT infrastructure and better information collection over the past several decades, organizations have at their disposal a very deep information store. Most of the current organizations are looking for ways to explore this opportunity and mine this information which is available at their disposal to Improve their products and services Enhance Customer Relations Enhance their Top and Bottom lines Currently there are some established software which cater to various requirements of statistical applications in the industry. Some of these software are IBM SPSS, SAS, R, S-Plus etc. However with increasing challenges such as Volume of Data Speed of Analysis required and Complexity of Analysis there is a requirement for finding alternatives to these software / solutions which will be an improvement, advanced solution.
PURPOSE
The goal is to build a fully functioning end user application which would be competitively more efficient and faster when compared to the available softwares especially when it comes to BigData analysis. This application will also include the various data mining tools which would provide the business analyst a whole new interface.
SOLUTION FRAMEWORK
Raw Data
Transformed Data
Analysis
Output
THE 4 VS OF BIGDATA
Data Manipulation
Distributed File System)
Extracting the meta data information. Merging the various datasets. Sorting of various datasets.
Data Processing
functions
MAPREDUCE PARADIGM
CONCLUSION
The main aim of this particular endeavor is to develop this application and do a comparative study with the existing softwares in terms of speed of computation, efficiency and reliability.
This application would be a more generic model that is it should be easily extended to the various business domains to meet their requirements.