Beruflich Dokumente
Kultur Dokumente
Data
Data A necessity for almost any enterprise to carry out its business. Consists of raw facts, and when organized may be transformed into information. Database A collection of data organized to meet users needs
Database Management System (DBMS) A group of programs that manipulate the database and provide an interface between the database and the user of the database or other application programs.
Commercial DBMS based on relational model are now widely accepted in industry
e.g., Microsoft Access, Oracle 9i, Sybase Adaptive Server, >20 billion dollar industry!
The
Traditional Approach Separate Course files are Data created and stored for each Lecturer application Data program.
Student Data
Student Admin
Year List
Scheduler
Timetable
Payroll
Cheques
Student Data
Student Admin
Year List
Course Data
Scheduler
Timetable
Lecturer Data
Payroll
Cheques
Data redundancy Program-Data dependence Lack of flexibility Poor security Lack of data-sharing and availability
The Database Approach A pool of related data is shared by multiple application programs. Rather than having separate data files, each application uses a collection of data that is either joined or related in the database.
Creates and maintains databases Eliminates requirement for data definition statements Acts as interface between application programs and physical data files Separates logical and design views of data
Types of Databases
Centralized database Used by single central processor or multiple processors in client/server network Distributed database Stored in more than one physical location Partitioned database Duplicated database
(Analytical Database)
Multidimensional data analysis Supports manipulation and analysis of large volumes of data from multiple dimensions/perspectives
Operational Databases
Data bases store detailed data needed to support the operations of the entire organizations . Also called Subject area database (SADB),transaction data base,Production databases or personal databases
Characteristics of a Database
Structure data types data behavior Persistence store data on secondary storage Retrieval a declarative query language a procedural database programming language
Designing Databases
Primary key A field or set of fields that uniquely identifies the record.
An Entity-Relationship Diagram
21
Scientific Databases
Integration System
Personal Databases
Collects and combines information Provides integrated view, uniform user interface Supports sharing
Prehistoric Times
End-user Interfaces
1st DW Article
DW Confs.
Vendor DW Frameworks
Subject-Oriented:
The data warehouse is organized around the key subjects (or high-level entities) of the enterprise. Major subjects include
Customers, Patients,Students,Products etc .
Integrated
The data housed in the data warehouse are defined using consistent
Naming conventions Formats Encoding Structures Related Characteristics
Time-variant
The data in the warehouse contain a time dimension so that they may be used as a historical record of the business
Non-volatile
Data in the data warehouse are loaded and refreshed from operational systems, but cannot be updated by end-users
Data Warehouse- A Practitioners Viewpoint A data warehouse is simply a single, complete, and consistent store of data obtained from a variety of sources and made available to end users in a way they can understand and use it in a business context. -- Barry Devlin, IBM Consultant
Warehouse is a Specialized DB
Standard DB
Mostly updates Many small transactions Mb - Gb of data Current snapshot Index/hash on p.k. Raw data Thousands of users (e.g., clerical users)
Data Warehouse
Mostly reads Queries are long and complex Gb - Tb of data History Lots of scans Summarized, reconciled data Hundreds of users (e.g., decision-makers, analysts)
Data Mart
Data Mart
Data Mart
Data Warehousing: Two Distinct Issues (1) How to get information into warehouse
Data warehousing
Evaluate trends in benefit program use. Identify the wage and benefits costs to determine company-wide variation. Warehouses have also been applied to areas such as: logistics, inventory, purchasing, detailed transaction analysis and load balancing.
Other Areas
Oracle Financial DW
Marketing DW
Federated Financial DW
Federated Marketing DW