Chapter 1
Database Systems
Learning Objectives
• After completing this chapter, you will be able to:
• Define the difference between data and information
• Describe what a database is, various types, and why they are valuable assets for
decision making
• Explain the importance of database design
• See how modern databases evolved from file systems
• Understand flaws in file system data management
• Outline the main components of the database system
• Describe the main functions of a database management system (DBMS)
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 2
protected website for classroom use.
Why Databases?
• Characteristics of data in today’s world
• Ubiquitous (i.e., abundant, global, and everywhere)
• Pervasive (i.e., unescapable, prevalent, and persistent)
• Databases make data persistent and shareable in a secure way
• Specialized structures that allow computer-based systems to store, manage, and
retrieve data very quickly
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 3
protected website for classroom use.
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 4
protected website for classroom use.
Data versus Information
• Data consists of raw facts
• Not yet processed to reveal meaning to the end user
• Building blocks of information
• Information results from processing raw data to reveal meaning
• Requires context
• Bedrock of knowledge
• Should be accurate, relevant, and timely
• Accurate, relevant, timely information is the key to good decision making
• Good decision making is the key to organizational survival
• Data management: focuses on proper generation, storage, and retrieval of data
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 5
protected website for classroom use.
Transforming Raw Data into Information
6
Database Systems: Design, Implementation, & Management, International Edition, Rob, Coronel & Crockett
Transforming Raw Data into Information (continued)
7
Database Systems: Design, Implementation, & Management, International Edition, Rob, Coronel & Crockett
Transforming Raw Data into Information (continued)
8
Database Systems: Design, Implementation, & Management, International Edition, Rob, Coronel & Crockett
Transforming Raw Data into Information (continued)
9
Database Systems: Design, Implementation, & Management, International Edition, Rob, Coronel & Crockett
Introducing the Database
• Shared, integrated computer structure that stores data
• End-user data: raw facts of interest to end user
• Metadata: data about data, through which the end-user data is integrated and
managed
- Describes data characteristics and relationships
• Database management system (DBMS)
• Collection of programs
• Manages the database structure
• Controls access to data stored in the database
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 10
protected website for classroom use.
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 11
protected website for classroom use.
Role and Advantages of the DBMS (1 of 2)
• Database management system (DBMS): intermediary between the user and the
database
• Enables data to be shared
• Presents the end user with an integrated view of data
• Provides more efficient and effective data management
• Improves sharing, security, integration, access, decision-making, productivity, etc.
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 12
protected website for classroom use.
Advantages of the DBMS (continued)
• Improved data sharing – end users have better access to more and
better managed data
• Improved data security – provides a better framework for data
privacy and security policies
• Better data integration - promotes integrated view of organization’s
operations
• Minimized data inconsistency - probability of data inconsistency is
greatly reduced
13
Database Systems: Design, Implementation, & Management, International Edition, Rob, Coronel & Crockett
Role and Advantages of the DBMS (2 of 2)
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 14
protected website for classroom use.
Role and Advantages of the DBMS (continued)
• Improved data access - possible to produce quick answers to
ad hoc queries
• Improved decision making – better managed and improved
data access make it possible to generate better quality
information
• Increased end user productivity – the availability of data
combined with the tools that transform data into usable
information, empowers end users to make quick informed
decisions
15
Database Systems: Design, Implementation, & Management, International Edition, Rob, Coronel & Crockett
Types of Databases (1 of 5)
• Single-user database: supports one user at a time
• Desktop database: single-user database on a personal computer
• Multiuser database: supports multiple users at the same time
• Workgroup databases: supports a small number of users or a specific department
• Enterprise database: supports many users across many departments
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 16
protected website for classroom use.
Types of Databases (2 of 5)
• Classification by location
• Centralized database: data located at a single site
• Distributed database: data distributed across different sites
• Cloud database: created and maintained using cloud data services that provide
defined performance measures for the database
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 17
protected website for classroom use.
Types of Databases (3 of 5)
• Classification by data type
• General-purpose database: contains a wide variety of data used in multiple disciplines
• Discipline-specific database: contains data focused on specific subject areas
• Operational database: designed to support a company’s day-to-day operations
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 18
protected website for classroom use.
Types of Databases (4 of 5)
• Analytical database: stores historical data and business metrics used exclusively
for tactical or strategic decision making
• Data warehouse: stores data in a format optimized for decision support
• Online analytical processing (OLAP): tools for retrieving, processing, and modeling
data from the data warehouse
• Business intelligence: captures and processes business data to generate information
that support decision making
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 19
protected website for classroom use.
Types of Databases (5 of 5)
• Databases can be classified to reflect the degree to which the data is structured
• Unstructured data exists in its original (raw) state
• Structured data results from formatting
- Structure is applied based on type of processing to be performed
• Semistructured data: processed to some extent
• Extensible Markup Language (XML)
• Represents data elements in textual format
• Social media
• No Sql
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 20
protected website for classroom use.
21
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
22
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Why Database Design Is Important
• Focuses on design of database structure that will be used to store and manage
end-user data
• Well-designed database: facilitates data management and generates accurate and
valuable information
• Poorly designed database: causes difficult-to-trace errors that may lead to poor
decision making
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 23
protected website for classroom use.
24
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
25
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Evolution of File System Data Processing (1 of 3)
• Manual file systems
• Accomplished through a system of file folders and filing cabinets
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 26
protected website for classroom use.
• Computerized file systems
• Data processing (DP) specialist created a computer-based system to track data and
produce required reports
- Wrote software that managed the data
- Designed the application programs
- Became more complex as the files/applications increased
• No link between files, same customer details for sales, HR etc.
• File system redux: modern end-user productivity tools
• Ubiquitous use of personal productivity tools can introduce the same problems as the
old file systems
• Microsoft Excel
- Widely used by business users
- Users have become so adept at working with spreadsheets, they tend to use them to complete
tasks for which spreadsheets are not appropriate – database substitute
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 27
protected website for classroom use.
This table containing same data owned respectively by each dept
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 28
protected website for classroom use.
Evolution of File System Data Processing (2 of 3)
Table 1.2 Basic File Terminology
TERM DEFINITION
Data Raw facts, such as a telephone number, a birth date, a customer name, and a year-to-date
(YTD) sales value. Data has little meaning unless it has been organized in some logical
manner.
Field A character or group of characters (alphabetic or numeric) that has a specific meaning. A field
is used to define and store data.
Record A logically connected set of one or more fields that describes a person, place, or thing. For
example, the fields that constitute a record for a customer might consist of the customer’s
name, address, phone number, date of birth, credit limit, and unpaid balance.
File A collection of related records. For example, a file might contain data about the students
currently enrolled at Gigantic University.
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 29
protected website for classroom use.
Evolution of File System Data Processing (3 of 3)
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 30
protected website for classroom use.
Problems with File System Data Processing
• Problems with file systems challenge the types of information that can be
created from data as well as information accuracy
• Lengthy development times
• Difficulty of getting quick answers
• Complex system administration
• Lack of security and limited data sharing
• Extensive programming
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 31
protected website for classroom use.
Problems with File System
Data Processing
• Lengthy development times
32
Database Systems: Design, Implementation, & Management, International Edition, Rob, Coronel & Crockett
Problems with File System
Data Processing…
• Difficulty of getting answers – need to write programs to produce even the
simplest report makes adhoc queries impossible
• Complex system administration – each file need it own file management
programs that allow the user to add, modify and delete records
• Lack of security and limited data sharing
33
Database Systems: Design, Implementation, & Management, International Edition, Rob, Coronel & Crockett
Problems with File System
Data Processing…
• Extensive programming
• 1) open the original file
• 2) reads a record form the original file
• 3) Transform the original data to conform to the new structure
• 4) write the transformed data into a new file structure
• 5) Delete the original file
• File system application programs are affected by a change in the file structure
– structural dependence
• Structural independence – make changes to file structure without affecting
the application programs ability to access the data
34
Database Systems: Design, Implementation, & Management, International Edition, Rob, Coronel & Crockett
Structural and Data Dependence (1 of 2)
• Structural dependence
• Access to a file is dependent on its own structure
• All file system programs are modified to conform to a new file structure
• Structural independence
• File structure is changed without affecting the application’s ability to access the data
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 35
protected website for classroom use.
Structural and Data Dependence (2 of 2)
• Data dependence
• Data access changes when data storage characteristics change
• Changes in the data storage characteristics affecting the application
program’s ability to access the data
• Logical data format
How the human being views the data
• Physical data format
• Any program that access a file must tell the computer not only what to do but
how to do it – specify the opening of a specific file type, record specification,
field definitions.
• Data independence
• Data storage characteristics are changed without affecting the program’s ability to
access the data
• Practical significance of data dependence is the difference between logical and
physical format
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 36
protected website for classroom use.
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 37
protected website for classroom use.
Data Redundancy (1 of 2)
• Unnecessarily storing the same data at different places
• Islands of information (i.e., scattered data locations)
• Increases the probability of having different versions of the same data
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 38
protected website for classroom use.
Data Redundancy (2 of 2)
• Possible results of uncontrolled data redundancy
• Poor data security
• Data inconsistency
• Data-entry errors
• Data integrity problems
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 39
protected website for classroom use.
Data Anomalies
• Develop when not all of the required changes in the redundant data are made
successfully
• Update anomalies
• Insertion anomalies
• Deletion anomalies
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 40
protected website for classroom use.
41
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Database Systems (1 of 2)
• Logically related data stored in a single logical data repository
• Physically distributed among multiple storage facilities
• DBMS eliminates most of file system’s data inconsistency, data anomaly, data
dependence, and structural dependence problems
• Current generation DBMS software
- Stores data structures, relationships between structures, and access paths
- Defines, stores, and manages all access paths and components
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 42
protected website for classroom use.
Database Systems (2 of 2)
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 43
protected website for classroom use.
The Database System Environment (1 of 2)
• Database system: organization of components that define and regulate the
collection, storage, management, and use of data within a database
environment
• Hardware
• Software
- Operating system software
- DBMS software
- Application programs and utility software
• People
• Procedures
• Data
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 44
protected website for classroom use.
The Database System Environment (2 of 2)
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 45
protected website for classroom use.
DBMS Functions (1 of 3)
• Data dictionary management
• Data dictionary: stores definitions of data elements and their relationships
• Data storage management
• Performance tuning ensures efficient performance
• Data transformation and presentation
• Data is formatted to conform to logical expectations
• Security management
• Enforces user security and data privacy
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 46
protected website for classroom use.
DBMS Functions (2 of 3)
• Multiuser access control
• Sophisticated algorithms ensure that multiple users can access the database
concurrently without compromising its integrity
• Backup and recovery management
• Enables recovery of the database after a failure
• Data integrity management
• Minimizes redundancy and maximizes consistency
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 47
protected website for classroom use.
DBMS Functions (3 of 3)
• Database access languages and application programming interfaces
• Query language: lets the user specify what must be done without having to specify
how
• Structured Query Language (SQL): de facto query language and data access standard
supported by the majority of DBMS vendors
• Database communication interfaces
• Accept end-user requests via multiple, different network environments
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 48
protected website for classroom use.
Managing the Database System: A Shift in Focus
• Disadvantages of database systems
• Increased costs
• Management complexity
• Maintaining currency
• Vendor dependence
• Frequent upgrade/replacement cycles
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 49
protected website for classroom use.
Preparing for Your Database Professional Career
TABLE 1.3 DATABASE CAREER OPPORTUNITIES
JOB TITLE DESCRIPTION SAMPLE SKILLS REQUIRED
Database Developer Create and maintain database-based Programming, database fundamentals, SQL
applications
Database Designer Design and maintain databases Systems design, database design, SQL
Database Manage and maintain DBMS and Database fundamentals, SQL, vendor courses
Administrator databases
Database Analyst Develop databases for decision support QL, query optimization, data warehouses
reporting
Database Architect Design and implementation of database DBMS fundamentals, data modeling, SQL,
environments (conceptual, logical, and hardware knowledge, etc.
physical)
Database Consultant Help companies leverage database Database fundamentals, data modeling,
technologies to improve business database design, SQL, DBMS, hardware,
processes and achieve specific goals vendor-specific technologies, etc.
Database Security Implement security policies for data DBMS fundamentals, database administration,
Officer administration SQL, data security technologies, etc.
Cloud Computing Design and implement the infrastructure Internet technologies, cloud storage
Data Architect for next-generation cloud database technologies, data security, performance tuning,
systems large databases, etc.
Data Scientist Analyze large amounts of varied data to Data analysis, statistics, advanced mathematics,
generate insights, relationships, and SQL, programming, data mining, machine
predictable behaviors learning, data visualization
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 50
protected website for classroom use.
Summary
• Data consists of raw facts and is usually stored in a database
• Database design defines the database structure
- Can be classified according to the number of users, location, as well as data usage and
structure
• Databases evolved from manual and computerized file systems
- There are some limitations of file system data management
- DBMSs were developed to address the file system’s inherent weaknesses
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a password- 51
protected website for classroom use.