Sie sind auf Seite 1von 3

2511 Chapter 4

Data Source Example: Credit card swipes/RFID tags/Digital video surveillance/Emails/Radiology scans/Blogs Data life cycle: DataInformationKnowledgeWisdom

The difficulties of Managing Data


1. The amount of data increases exponentially with time 2. Data are scattered throughout organizations and collected by many individuals using various methods and devices 3. Data come from multiple sources: internal sources (corporate databases and company documents), personal sources (personal thoughts, opinions, and experiences), and external sources (commercial databases, governments reports, and corporate websites) 4. Data degrade over time 5. Data are also subject to data rot. Data rot: primarily to problems with the media on which the data are stored 6. Data errors information that is out of date, inaccurate, or technically corrupt. 7. Data security, quality, and integrity are critical, yet easily jeopardized and make the process of managing data more difficult.

The Data life cycle in an organization Fig 4.1 P.114 Data SourcesData analysisResultSolutions Legislation can affect data archiving requirements:
PIPEDA January 1, 2004 Sarbanes-Oxley U.S. July 2002

The Database Approach


A database management system (DBMS) is a set of program that provides users with tools to add, delete, access, and analyze data stored in one location. An organization can access the data by using query and reporting tools that are part of the DBMS or by using application program specifically written to access the data Advantages of Database Management system: Disadvantages of Database Management Systems Data redundancy reduced Data independence more achievable More complex (and costly) to set up and maintain Data can be effectively managed centrally Complex structures may be slower for processing Increases data accessibility high volume periodic transaction updates Security over data can be more consistent (Data Using the relational database model: Base on liked inconsistency: Various copies of the data do not two-dimensional tables agree). Comprises : Query language (SQL)/ Data Maximize Data security/Data integrity/Data dictionary/Normalization process independence

Data Hierarchy Fig 4.3 P117 Bit/Byte=character/Field/Record/File(Master, transaction, P128) A logical grouping of characters into a work, a small group of words, or an identification number is called a field. It can also contain data other than text and numbers. A logical grouping of related fields, such as the students name, and the courses taken, the date, and the grade, compose a record. A logical grouping of related records is called a file or tale. A logical grouping of related tables would constitute a database.

Data design for database and file-based systems (P117)


Data diagram is a diagram that represents entities in the database and their relationships. Entity (record) is a person, place, thing, or event such as a customer, an employee, or a product about which information is maintained Each characteristic or quality of a particular entity (field) is called attribute is the context of data modeling Every record in a table must contain at least one attribute/field that uniquely identifies that record so that it can be retrieved, updated, and sorted. This identifier is called the primary keyA student record at a Ca would probably use the students ID number as its primary key Secondary keys are other fields that have some identifying information but typically do not identify the record or entity with complete accuracy. Designers plan and develop the database through a process called entity-relationship modeling, using an entity-relationship diagram. A given type are grouped in entity classes Student, parking permit, class and professor are entity classes An instance of an entity class is the presentation of one particular entity Student (Lihang Zhong, 145-89-7123) is the instance of the Student entity class. Entity-Relationship diagram models also document what is happening with Data(P118) Traditional File Environment: Data is organized functionally (by application), usually sequentially and flat Advantages includes: simplicity of operation/Faster processing speedsMonthly payroll

What is a data warehouse ? It is a repository of historical data organized by subject to support decision makers in the org. A specialized form of database An architectural structure that defines how historical data is stored Linking components of the data warehouse via data communications increases the scope of data available Characteristic of a Data Warehouse 1. Organized by business dimension or subject. 2. Consistent 3. Historical 4. Non-volatile 5. Has ability to use on-line analytical processing (OLAP)is the process of performing complex, multi-dimensional analyses of data stored in a database or data warehouse 6. Multi-dimensional

7. Relationship with relational databases Advantage of data warehouses Data is Organized and consistent/Integrated (and possibly cleansed)/Historical and non-volatile/Optimized for access (for OLAP use, multi-dimensional) Disadvantage/constraints of data warehouses Very costly and complex to establish (hardware, software and people) Requires continual maintenance as supporting applications change Requires high levels of security to ensure access to authorized users Data is Not the system of record

Data Governance is an approach to managing information across an entire organization Master data are set of core data, such as customer, product, employee, vendor, geographic location, and so on, that span the enterprise information system and it is important to distinguish them from transaction data. Master Data management is a process that spans all organizational business processes and applications. Role of Data governance Planned approach to data management Involves the entire organization Creation, collection, handling and protection of information Video on the role of data governance and a data repository (data policy management) Note the difference between transaction and master files Master files (master data) is semi-permanent data, such as employee name, address, customer name, customer credit limit Master data management pertains to this type of data Transaction data represents business activities or event, such as payroll cheque, customer invoice Knowledge management systems (Fig 4.12 P133)

Tacit and Explicit knowledge Explicit knowledge is more objective, rational and technical while Tacit knowledge is cumulative, more subjective or experiential

Das könnte Ihnen auch gefallen