SlideShare uma empresa Scribd logo
1 de 57
 Chapter 1
 Database Systems
ISBN-13: 978-1337627900
ISBN-10: 1337627909
Buy Book Amazon.com URL
Modified by:
Usman Tariq, PhD
Associate Professor, PSAU
Office ☎ 00966 11 588 8386
Learning Objectives
 In this chapter, you will learn:
• The difference between data and information in the
context of Data Science
• What a database is, the various types of databases, and
why they are valuable assets for decision making
• The importance of database design
• How modern databases evolved from file systems
2
Learning Objectives
 In this chapter, you will learn:
• About flaws in file system data management
• The main components of the database system
• The main functions of a database management system
(DBMS)
3
Case Study: e-Commerce, Database & Data Science
external
Understanding data science and Database [1/2]
external
Understanding data science and Database [2/2]
external
 Telecommunications companies, such as Sprint and AT&T (in
USA), are known to have systems that keep data on trillions of
phone calls, with new data being added to the system at speeds
up to 70,000 calls per second!
 Not only do these companies have to store and manage
immense collections of data, they have to be able to find any
given fact in that data quickly
 Google receives over 63,000 searches
per second on any given day. That's the
average figure of how many people
use Google a day, which translates into at
least 2 trillion searches per year, 3.8 million
searches per minute, 228 million searches
per hour, and 5.6 billion searches per day.
How can businesses (retail, service, Govt. etc.) process this much data?
How can they store it all, and then quickly retrieve just the facts that
decision makers want to know, just when they want to know it?
How many relationships are in your data?
What is the level of complexity in your data?
How often do the data change?
How often does your application query the data?
How often does your application query the relationship underlying the
data?
How often do your users update the data?
How often do your users update the logic in the data?
How critical is your Application in a disaster scenario?
Data vs. Information
Data Information
 Raw facts
 Raw data - Not yet been processed
to reveal the meaning
 Building blocks of information
 Data management
 Generation, storage, and retrieval
of data
 Produced by processing data
 Reveals the meaning of data
 Enables knowledge creation
 Should be accurate, relevant, and
timely to enable good decision
making
11
Suppose that a university tracks data on faculty members for
reporting to accrediting bodies.
• To get the data for each faculty member into the database,
 you would provide a screen to allow for
 convenient data entry, complete with
o drop-down lists,
o combo boxes,
o option buttons, and
o other data-entry validation controls.
Data constitutes the building blocks of information.
Information is produced by processing data.
Information is used to reveal the meaning of data.
Accurate, relevant, and timely information is the key to good decision
making.
Good decision making is the key to organizational survival in a global
environment.
Data: Extraction from PSAU website
Keep in mind that raw data must be properly formatted
for storage, processing, and presentation.
Dates might be stored in Julian calendar formats within the database,
but displayed in a variety of formats, such as day-month-year or
month/day/year, for different purposes.
Respondents’ yes/no responses might need to be converted to a Y/N or
0/1 format for data storage.
More complex formatting is required when working with
complex data types, such as sounds, videos, or images.
Key Learnt Points
 Data constitutes the building blocks
of information.
 Information is produced by
processing data.
 Information is used to reveal the
meaning of data.
 Accurate, relevant, and timely
information is the key to good
decision making.
 Good decision making is the key to
organizational survival in a global
environment.
A key characteristic of knowledge is
that “new” knowledge can be
derived from “old” knowledge.
Database
 Shared, integrated computer
structure that stores a collection of:
 End-user data - Raw facts of interest to end
user
 Metadata: Data about data, which the end-
user data are integrated and managed
 Describe data characteristics and
relationships
17
FIGURE 1.11: ILLUSTRATING METADATA WITH
MICROSOFT SQL SERVER EXPRESS
Database management system (DBMS)
 Collection of programs
 Manages the database structure
 Controls access to data stored in the database
Role of the DBMS
 Intermediary between the user and
the database
 Enables data to be shared
 Presents the end user with an
integrated view of the data
 Receives and translates application
requests into operations required to
fulfill the requests
 Hides database’s internal
complexity from the application
programs and users
20
Figure 1.2 - The DBMS Manages the Interaction
between the End User and the Database
21
Advantages of the DBMS
• Better data integration and less data inconsistency
– Data inconsistency: Different versions of the same data appear in different places
– Reducing Data Redundancy
• Increased end-user productivity
• Searching and retrieving of data is very easy in DBMS systems.
• Improved:
 Data sharing
 Data security
 Data access
 Data Concurrency: In DBMS, Data are stored in one or more servers in the
network and that there is some software locking mechanism that prevents the
same set of data from being changed by two people at the same time.
 Database Management System can automatically takes care of backup and
recovery
 Decision making
 Data quality: Promoting accuracy, validity, and timeliness of data
22
Difference b/t DBMS & RDBMS
external
Choosing the Right Database [1/2]
i. First and foremost, you must understand how your
database will be used under the scope of the
requirements of your project.
ii. With one kind of database, you will only
fulfill some of your database needs.
iii. Performance comes only after you’ve successfully
matched all your database needs individually with
the right kind of database.
iv. There’s always a tradeoff between consistency,
availability and partition tolerance.
Choosing the Right Database [2/2]
Types of Databases
 Single-user database: Supports one user at a time
 Desktop database: Runs on PC
 Multiuser database: Supports multiple users at the
same time
 Workgroup databases: Supports a small number of
users or a specific department
 Enterprise database: Supports many users across
many departments
26
Types of Databases
 Centralized database: Data is located at a single
site
 Distributed database: Data is distributed across
different sites
 Cloud database: Created and maintained using
cloud data services that provide defined
performance measures for the database
27
Types of Databases
 General-purpose databases: Contains a wide
variety of data used in multiple disciplines
 Discipline-specific databases: Contains data
focused on specific subject areas
28
Types of Databases
 Operational database: Designed to support a
company’s day-to-day operations
 Analytical database: Stores historical data and
business metrics used exclusively for tactical or
strategic decision making
 Data warehouse: Stores data in a format optimized for
decision support
29
extract, transform, load
Types of Databases
 Online analytical processing (OLAP)
 Enable retrieving, processing, and modeling data from the
data warehouse
 Business intelligence: Captures and processes business
data to generate information that support decision
making 30
Types of Databases
 Unstructured data: It exists in their
original state
 Structured data: It results from
formatting
 Structure is applied based on type of
processing to be performed
 Semistructured data: Processed to some
extent
 Extensible Markup Language (XML)
 Represents data elements in textual format
31
Database Design
 Focuses on the design of the database structure that will be used to store
and manage end-user data
 Well-designed database
 Facilitates data management
 Generates accurate and valuable information
 Poorly designed database causes difficult-to-trace errors
33
Evolution of File System Data Processing
File System Redux: Modern End-User Productivity Tools
Includes spreadsheet programs such as Microsoft Excel
Computerized File Systems
Data processing (DP) specialist: Created a computer-based system that would track data
and produce required reports
Manual File Systems
Accomplished through a system of file folders and filing cabinets
36
Table 1.2 - Basic File Terminology
38
Figure - A Simple File System
39
Problems with File System Data Processing
40
Lengthy development times
Difficulty of getting quick answers
Complex system administration
Lack of security and limited data sharing
Extensive programming
Structural and Data Dependence
 Structural dependence: Access to a file is dependent
on its own structure
 All file system programs are modified to conform to a
new file structure
 Structural independence: File structure is changed
without affecting the application’s ability to access
the data
41
Structural and Data Dependence
 Data dependence
 Data access changes when data storage characteristics
change
 Data independence
 Data storage characteristics is changed without
affecting the program’s ability to access the data
 Practical significance of data dependence is
difference between logical and physical format
42
Data Redundancy
 Unnecessarily storing same data at different places
 Islands of information: Scattered data locations
 Increases the probability of having different versions of
the same data
43
Data Redundancy Implications
 Poor data security
 Data inconsistency
 Increased likelihood of data-entry errors when
complex entries are made in different files
 Data anomaly: Develops when not all of the required
changes in the redundant data are made successfully
44
Types of Data Anomaly
Update Anomalies
Insertion Anomalies
Deletion Anomalies
45
Lack of Design and Data-Modeling Skills
 Evident despite the availability of multiple personal
productivity tools being available
 Data-modeling skills is vital in the data design
process
 Good data modeling facilitates communication
between the designer, user, and the developer
46
Database Systems
 Logically related data stored in a single logical data
repository
 Physically distributed among multiple storage facilities
• DBMS eliminates most of file system’s problems
 Current generation DBMS software:
– Stores data structures, relationships between structures, and
access paths
– Defines, stores, and manages all access paths and
components
47
Figure 1.9 - Contrasting Database and File Systems [1/2]
48
Figure 1.10 - The Database System Environment [1/2]
49
Figure 1.10 - The Database System Environment [2/2]
DBMS Functions
51
Data dictionary management
• Data dictionary: Stores definitions of the data elements and their relationships
Data storage management
• Performance tuning: Ensures efficient performance of the database in terms of
storage and access speed
Data transformation and presentation
• Transforms entered data to conform to required data structures
Security management
• Enforces user security and data privacy
DBMS Functions
53
Multiuser access control
• Sophisticated algorithms ensure that multiple users can access the
database concurrently without compromising its integrity
Backup and recovery management
• Enables recovery of the database after a failure
Data integrity management
• Minimizes redundancy and maximizes consistency
DBMS Functions
54
Database access languages and application programming
interfaces
• Query language: Lets the user specify what must be done without having
to specify how
• Structured Query Language (SQL): De facto query language and data
access standard supported by the majority of DBMS vendors
Database communication interfaces
• Accept end-user requests via multiple, different network
environments
Disadvantages of Database Systems
55
Increased costs
Management complexity
Maintaining currency
Vendor dependence
Frequent upgrade/replacement cycles
56
Career Outcome

Mais conteúdo relacionado

Mais procurados

Fundamentals of Database ppt ch03
Fundamentals of Database ppt ch03Fundamentals of Database ppt ch03
Fundamentals of Database ppt ch03
Jotham Gadot
 
Fundamentals of Database ppt ch01
Fundamentals of Database ppt ch01Fundamentals of Database ppt ch01
Fundamentals of Database ppt ch01
Jotham Gadot
 
Logical database design and the relational model(database)
Logical database design and the relational model(database)Logical database design and the relational model(database)
Logical database design and the relational model(database)
welcometofacebook
 
Physical database design(database)
Physical database design(database)Physical database design(database)
Physical database design(database)
welcometofacebook
 

Mais procurados (20)

Advance database systems (part 1)
Advance database systems (part 1)Advance database systems (part 1)
Advance database systems (part 1)
 
INTRODUCTION TO DATABASE
INTRODUCTION TO DATABASEINTRODUCTION TO DATABASE
INTRODUCTION TO DATABASE
 
Database
DatabaseDatabase
Database
 
The relational database model
The relational database modelThe relational database model
The relational database model
 
Physical Database Design & Performance
Physical Database Design & PerformancePhysical Database Design & Performance
Physical Database Design & Performance
 
Relational data base management system (Unit 1)
Relational data base management system (Unit 1)Relational data base management system (Unit 1)
Relational data base management system (Unit 1)
 
Database design
Database designDatabase design
Database design
 
Relational Database Design
Relational Database DesignRelational Database Design
Relational Database Design
 
rdbms-notes
rdbms-notesrdbms-notes
rdbms-notes
 
BIS06 Physical Database Models
BIS06 Physical Database ModelsBIS06 Physical Database Models
BIS06 Physical Database Models
 
Fundamentals of Database ppt ch03
Fundamentals of Database ppt ch03Fundamentals of Database ppt ch03
Fundamentals of Database ppt ch03
 
DBMS Bascis
DBMS BascisDBMS Bascis
DBMS Bascis
 
Distributed database system
Distributed database systemDistributed database system
Distributed database system
 
Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]
Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]
Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]
 
Fundamentals of Database ppt ch01
Fundamentals of Database ppt ch01Fundamentals of Database ppt ch01
Fundamentals of Database ppt ch01
 
Logical database design and the relational model(database)
Logical database design and the relational model(database)Logical database design and the relational model(database)
Logical database design and the relational model(database)
 
Tutorial On Database Management System
Tutorial On Database Management SystemTutorial On Database Management System
Tutorial On Database Management System
 
Physical database design(database)
Physical database design(database)Physical database design(database)
Physical database design(database)
 
Introduction to oracle database (basic concepts)
Introduction to oracle database (basic concepts)Introduction to oracle database (basic concepts)
Introduction to oracle database (basic concepts)
 
Unit01 dbms
Unit01 dbmsUnit01 dbms
Unit01 dbms
 

Semelhante a Database Systems

Chap1-Introduction to database systems.ppt
Chap1-Introduction to database systems.pptChap1-Introduction to database systems.ppt
Chap1-Introduction to database systems.ppt
LisaMalar
 
DBS Theory Week 1 including relationships and relational database
DBS Theory Week 1 including relationships and relational databaseDBS Theory Week 1 including relationships and relational database
DBS Theory Week 1 including relationships and relational database
walaahuluu
 
D I T211 Chapter 1
D I T211    Chapter 1D I T211    Chapter 1
D I T211 Chapter 1
askme
 

Semelhante a Database Systems (20)

Database Systems
Database SystemsDatabase Systems
Database Systems
 
Db lecture 1
Db lecture 1Db lecture 1
Db lecture 1
 
Chap1-Introduction to database systems.ppt
Chap1-Introduction to database systems.pptChap1-Introduction to database systems.ppt
Chap1-Introduction to database systems.ppt
 
DBS Theory Week 1 including relationships and relational database
DBS Theory Week 1 including relationships and relational databaseDBS Theory Week 1 including relationships and relational database
DBS Theory Week 1 including relationships and relational database
 
DBMS-1.pptx
DBMS-1.pptxDBMS-1.pptx
DBMS-1.pptx
 
1. Chapter One.pdf
1. Chapter One.pdf1. Chapter One.pdf
1. Chapter One.pdf
 
Dbms Useful PPT
Dbms Useful PPTDbms Useful PPT
Dbms Useful PPT
 
database ppt(2)
database ppt(2)database ppt(2)
database ppt(2)
 
data base unit one on what is databases.pptx
data base unit one on what is databases.pptxdata base unit one on what is databases.pptx
data base unit one on what is databases.pptx
 
MS-CIT Unit 9.pptx
MS-CIT Unit 9.pptxMS-CIT Unit 9.pptx
MS-CIT Unit 9.pptx
 
Database, Lecture-1.ppt
Database, Lecture-1.pptDatabase, Lecture-1.ppt
Database, Lecture-1.ppt
 
Database & dbms
Database & dbmsDatabase & dbms
Database & dbms
 
DBMS and its Models
DBMS and its ModelsDBMS and its Models
DBMS and its Models
 
D I T211 Chapter 1
D I T211    Chapter 1D I T211    Chapter 1
D I T211 Chapter 1
 
Database Management System ( Dbms )
Database Management System ( Dbms )Database Management System ( Dbms )
Database Management System ( Dbms )
 
Chapter 1 Database Systems.pptx
Chapter 1 Database Systems.pptxChapter 1 Database Systems.pptx
Chapter 1 Database Systems.pptx
 
Dbms models
Dbms modelsDbms models
Dbms models
 
Database Systems-Lec 1.ppt
Database Systems-Lec 1.pptDatabase Systems-Lec 1.ppt
Database Systems-Lec 1.ppt
 
Complete dbms notes
Complete dbms notesComplete dbms notes
Complete dbms notes
 
Data Base Management Systems
Data Base Management SystemsData Base Management Systems
Data Base Management Systems
 

Mais de Usman Tariq

Io t in 5g aeon iccais-riyadh
Io t in 5g aeon iccais-riyadhIo t in 5g aeon iccais-riyadh
Io t in 5g aeon iccais-riyadh
Usman Tariq
 

Mais de Usman Tariq (16)

Distributed database management systems
Distributed database management systemsDistributed database management systems
Distributed database management systems
 
Database performance tuning and query optimization
Database performance tuning and query optimizationDatabase performance tuning and query optimization
Database performance tuning and query optimization
 
Io t in 5g aeon iccais-riyadh
Io t in 5g aeon iccais-riyadhIo t in 5g aeon iccais-riyadh
Io t in 5g aeon iccais-riyadh
 
Data science Prospects
Data science Prospects Data science Prospects
Data science Prospects
 
e-Marketing Research
e-Marketing Researche-Marketing Research
e-Marketing Research
 
ERP modules and business software package
ERP modules and business software packageERP modules and business software package
ERP modules and business software package
 
ERP Implementation Challenges and Package Selection
ERP Implementation Challenges and Package SelectionERP Implementation Challenges and Package Selection
ERP Implementation Challenges and Package Selection
 
Global E-Markets 3.0
Global E-Markets 3.0Global E-Markets 3.0
Global E-Markets 3.0
 
ERP Planning and Implementation
ERP Planning and ImplementationERP Planning and Implementation
ERP Planning and Implementation
 
Customer Relationship Management (CRM)
Customer Relationship Management (CRM)Customer Relationship Management (CRM)
Customer Relationship Management (CRM)
 
The e-Marketing Plan
The e-Marketing PlanThe e-Marketing Plan
The e-Marketing Plan
 
Strategic E-Marketing and Performance Metrics
Strategic E-Marketing and Performance MetricsStrategic E-Marketing and Performance Metrics
Strategic E-Marketing and Performance Metrics
 
What is an erp system?
What is an erp system?What is an erp system?
What is an erp system?
 
E-Marketing (past, present, and future)
E-Marketing (past, present, and future)E-Marketing (past, present, and future)
E-Marketing (past, present, and future)
 
ERP and related technology
ERP and related technology ERP and related technology
ERP and related technology
 
An overview - Enterprise
An overview - EnterpriseAn overview - Enterprise
An overview - Enterprise
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 

Database Systems

  • 1.  Chapter 1  Database Systems ISBN-13: 978-1337627900 ISBN-10: 1337627909 Buy Book Amazon.com URL Modified by: Usman Tariq, PhD Associate Professor, PSAU Office ☎ 00966 11 588 8386
  • 2. Learning Objectives  In this chapter, you will learn: • The difference between data and information in the context of Data Science • What a database is, the various types of databases, and why they are valuable assets for decision making • The importance of database design • How modern databases evolved from file systems 2
  • 3. Learning Objectives  In this chapter, you will learn: • About flaws in file system data management • The main components of the database system • The main functions of a database management system (DBMS) 3
  • 4.
  • 5. Case Study: e-Commerce, Database & Data Science external
  • 6. Understanding data science and Database [1/2] external
  • 7. Understanding data science and Database [2/2] external
  • 8.  Telecommunications companies, such as Sprint and AT&T (in USA), are known to have systems that keep data on trillions of phone calls, with new data being added to the system at speeds up to 70,000 calls per second!  Not only do these companies have to store and manage immense collections of data, they have to be able to find any given fact in that data quickly
  • 9.  Google receives over 63,000 searches per second on any given day. That's the average figure of how many people use Google a day, which translates into at least 2 trillion searches per year, 3.8 million searches per minute, 228 million searches per hour, and 5.6 billion searches per day.
  • 10. How can businesses (retail, service, Govt. etc.) process this much data? How can they store it all, and then quickly retrieve just the facts that decision makers want to know, just when they want to know it? How many relationships are in your data? What is the level of complexity in your data? How often do the data change? How often does your application query the data? How often does your application query the relationship underlying the data? How often do your users update the data? How often do your users update the logic in the data? How critical is your Application in a disaster scenario?
  • 11. Data vs. Information Data Information  Raw facts  Raw data - Not yet been processed to reveal the meaning  Building blocks of information  Data management  Generation, storage, and retrieval of data  Produced by processing data  Reveals the meaning of data  Enables knowledge creation  Should be accurate, relevant, and timely to enable good decision making 11
  • 12. Suppose that a university tracks data on faculty members for reporting to accrediting bodies. • To get the data for each faculty member into the database,  you would provide a screen to allow for  convenient data entry, complete with o drop-down lists, o combo boxes, o option buttons, and o other data-entry validation controls. Data constitutes the building blocks of information. Information is produced by processing data. Information is used to reveal the meaning of data. Accurate, relevant, and timely information is the key to good decision making. Good decision making is the key to organizational survival in a global environment.
  • 13.
  • 14. Data: Extraction from PSAU website
  • 15. Keep in mind that raw data must be properly formatted for storage, processing, and presentation. Dates might be stored in Julian calendar formats within the database, but displayed in a variety of formats, such as day-month-year or month/day/year, for different purposes. Respondents’ yes/no responses might need to be converted to a Y/N or 0/1 format for data storage. More complex formatting is required when working with complex data types, such as sounds, videos, or images.
  • 16. Key Learnt Points  Data constitutes the building blocks of information.  Information is produced by processing data.  Information is used to reveal the meaning of data.  Accurate, relevant, and timely information is the key to good decision making.  Good decision making is the key to organizational survival in a global environment. A key characteristic of knowledge is that “new” knowledge can be derived from “old” knowledge.
  • 17. Database  Shared, integrated computer structure that stores a collection of:  End-user data - Raw facts of interest to end user  Metadata: Data about data, which the end- user data are integrated and managed  Describe data characteristics and relationships 17
  • 18. FIGURE 1.11: ILLUSTRATING METADATA WITH MICROSOFT SQL SERVER EXPRESS
  • 19. Database management system (DBMS)  Collection of programs  Manages the database structure  Controls access to data stored in the database
  • 20. Role of the DBMS  Intermediary between the user and the database  Enables data to be shared  Presents the end user with an integrated view of the data  Receives and translates application requests into operations required to fulfill the requests  Hides database’s internal complexity from the application programs and users 20
  • 21. Figure 1.2 - The DBMS Manages the Interaction between the End User and the Database 21
  • 22. Advantages of the DBMS • Better data integration and less data inconsistency – Data inconsistency: Different versions of the same data appear in different places – Reducing Data Redundancy • Increased end-user productivity • Searching and retrieving of data is very easy in DBMS systems. • Improved:  Data sharing  Data security  Data access  Data Concurrency: In DBMS, Data are stored in one or more servers in the network and that there is some software locking mechanism that prevents the same set of data from being changed by two people at the same time.  Database Management System can automatically takes care of backup and recovery  Decision making  Data quality: Promoting accuracy, validity, and timeliness of data 22
  • 23. Difference b/t DBMS & RDBMS external
  • 24. Choosing the Right Database [1/2] i. First and foremost, you must understand how your database will be used under the scope of the requirements of your project. ii. With one kind of database, you will only fulfill some of your database needs. iii. Performance comes only after you’ve successfully matched all your database needs individually with the right kind of database. iv. There’s always a tradeoff between consistency, availability and partition tolerance.
  • 25. Choosing the Right Database [2/2]
  • 26. Types of Databases  Single-user database: Supports one user at a time  Desktop database: Runs on PC  Multiuser database: Supports multiple users at the same time  Workgroup databases: Supports a small number of users or a specific department  Enterprise database: Supports many users across many departments 26
  • 27. Types of Databases  Centralized database: Data is located at a single site  Distributed database: Data is distributed across different sites  Cloud database: Created and maintained using cloud data services that provide defined performance measures for the database 27
  • 28. Types of Databases  General-purpose databases: Contains a wide variety of data used in multiple disciplines  Discipline-specific databases: Contains data focused on specific subject areas 28
  • 29. Types of Databases  Operational database: Designed to support a company’s day-to-day operations  Analytical database: Stores historical data and business metrics used exclusively for tactical or strategic decision making  Data warehouse: Stores data in a format optimized for decision support 29 extract, transform, load
  • 30. Types of Databases  Online analytical processing (OLAP)  Enable retrieving, processing, and modeling data from the data warehouse  Business intelligence: Captures and processes business data to generate information that support decision making 30
  • 31. Types of Databases  Unstructured data: It exists in their original state  Structured data: It results from formatting  Structure is applied based on type of processing to be performed  Semistructured data: Processed to some extent  Extensible Markup Language (XML)  Represents data elements in textual format 31
  • 32.
  • 33. Database Design  Focuses on the design of the database structure that will be used to store and manage end-user data  Well-designed database  Facilitates data management  Generates accurate and valuable information  Poorly designed database causes difficult-to-trace errors 33
  • 34.
  • 35.
  • 36. Evolution of File System Data Processing File System Redux: Modern End-User Productivity Tools Includes spreadsheet programs such as Microsoft Excel Computerized File Systems Data processing (DP) specialist: Created a computer-based system that would track data and produce required reports Manual File Systems Accomplished through a system of file folders and filing cabinets 36
  • 37.
  • 38. Table 1.2 - Basic File Terminology 38
  • 39. Figure - A Simple File System 39
  • 40. Problems with File System Data Processing 40 Lengthy development times Difficulty of getting quick answers Complex system administration Lack of security and limited data sharing Extensive programming
  • 41. Structural and Data Dependence  Structural dependence: Access to a file is dependent on its own structure  All file system programs are modified to conform to a new file structure  Structural independence: File structure is changed without affecting the application’s ability to access the data 41
  • 42. Structural and Data Dependence  Data dependence  Data access changes when data storage characteristics change  Data independence  Data storage characteristics is changed without affecting the program’s ability to access the data  Practical significance of data dependence is difference between logical and physical format 42
  • 43. Data Redundancy  Unnecessarily storing same data at different places  Islands of information: Scattered data locations  Increases the probability of having different versions of the same data 43
  • 44. Data Redundancy Implications  Poor data security  Data inconsistency  Increased likelihood of data-entry errors when complex entries are made in different files  Data anomaly: Develops when not all of the required changes in the redundant data are made successfully 44
  • 45. Types of Data Anomaly Update Anomalies Insertion Anomalies Deletion Anomalies 45
  • 46. Lack of Design and Data-Modeling Skills  Evident despite the availability of multiple personal productivity tools being available  Data-modeling skills is vital in the data design process  Good data modeling facilitates communication between the designer, user, and the developer 46
  • 47. Database Systems  Logically related data stored in a single logical data repository  Physically distributed among multiple storage facilities • DBMS eliminates most of file system’s problems  Current generation DBMS software: – Stores data structures, relationships between structures, and access paths – Defines, stores, and manages all access paths and components 47
  • 48. Figure 1.9 - Contrasting Database and File Systems [1/2] 48
  • 49. Figure 1.10 - The Database System Environment [1/2] 49
  • 50. Figure 1.10 - The Database System Environment [2/2]
  • 51. DBMS Functions 51 Data dictionary management • Data dictionary: Stores definitions of the data elements and their relationships Data storage management • Performance tuning: Ensures efficient performance of the database in terms of storage and access speed Data transformation and presentation • Transforms entered data to conform to required data structures Security management • Enforces user security and data privacy
  • 52.
  • 53. DBMS Functions 53 Multiuser access control • Sophisticated algorithms ensure that multiple users can access the database concurrently without compromising its integrity Backup and recovery management • Enables recovery of the database after a failure Data integrity management • Minimizes redundancy and maximizes consistency
  • 54. DBMS Functions 54 Database access languages and application programming interfaces • Query language: Lets the user specify what must be done without having to specify how • Structured Query Language (SQL): De facto query language and data access standard supported by the majority of DBMS vendors Database communication interfaces • Accept end-user requests via multiple, different network environments
  • 55. Disadvantages of Database Systems 55 Increased costs Management complexity Maintaining currency Vendor dependence Frequent upgrade/replacement cycles
  • 56. 56