SQL Database Design For Developers at php[tek] 2024

SQL Database Design
For Developers
Scott Keck-Warren
php[tek] 2024
@scottKeckWarren@phpc.social

1.Never Worked With a DBA On a Project
2.Every Project Has At Least One SQL
Database

My (Early) Relationship to SQL
Database:
Less Than Ideal

Problems
• “Random” slowness
• Data inconsistencies
• Weird Bugs

SQL Database Design
For Developers

Scott Keck-Warren
Director of Technology
@ WeCare Connect

Scott Keck-Warren
PHP Developer

Scott Keck-Warren
Content Creator
@php[architect] YouTube
Community Corner Podcast

Scott’s Rules For Database
Design

Never Worked With a DBA On a
Project

Scott’s Rules For Database Design
1. Normalize Your Database For Data
Deduplication
2. Use The Database Engine to Keep Data
Clean
3. Proactively Add Indexes to Keep Queries
Performant

Users Table
• Email address
• Password
• Active state
• Hire Date
• Listing of previous passwords
• Office Name
• Office City
• Office Zip

Users Table
• Email address (string)
• Password (string)
• Active state (string)
• Hire Date (string)
• Listing of previous passwords (string)
• Office Name (string)
• Office City (string)
• Office Zip (string)

Users Table
email password active hire_date
previous_
password
office_name office_phone office_city office_zip
alice@exa
mple.com
hash1 1 1/1/2024
hash1
hash5
hash6
Main Office
555-555-5555
Saginaw 48609
avery@exa
mple.com
NULL 1 8/11/2024
hash2
hash7
hash8
main office 5555555555 Saginaw 48609
scott@exa
mple.com
hash3 1
May 11th,
23
hash3 Man office
(555)555-5555
Saginaw 48609
scott@exa
mple.com
hash4 1 Tuesday hash4 Main
555/555/5555
Saginaw 48609

Normalize Your Database For
Data Deduplication

Normalize Your Database For Data Deduplication
“[T]he process of structuring a relational
database in accordance with a series of so-
called normal forms in order to reduce data
redundancy and improve data integrity.”
-“Database normalization” on Wikipedia

• UNF: Unnormalized form
• 1NF: First normal form
• 2NF: Second normal form
• 3NF: Third normal form
• EKNF: Elementary key normal
form
• BCNF: Boyce–Codd normal form
• 4NF: Fourth normal form
• ETNF: Essential tuple normal
form
• 5NF: Fifth normal form
• DKNF: Domain-key normal form
• 6NF: Sixth normal form

• Boyce–Codd Normal Form:
• X should be a superkey for every
functional dependency (FD) X−>Y in a
given relation.

Unnormalized Form
• A table doesn’t meet any of the conditions of normalization
• Essentially a spreadsheet
previous_
password
office_name office_phone office_city office_zip
alice@exa
mple.com
hash1 1 1/1/2024
hash1
hash5
hash6
Main Office
555-555-5555
Saginaw 48609
avery@exa
mple.com
NULL 1 8/11/2024
hash2
hash7
hash8
main office 5555555555 Saginaw 48609
scott@exa
mple.com
hash3 1
May 11th,
23
hash3 Man office
(555)555-5555
Saginaw 48609
scott@exa
mple.com
555/555/5555
Saginaw 48609

First Normal Form (1NF)
1. The table contains a unique identifier, also called the primary key, that is
used to identify the row.
2. Each column contains atomic values (values that can not be broken
down)

1NF - users
previous_p
assword
office_
name
office_phone office_city office_zip
alice@exa
mple.com
hash1 1 1/1/2024
hash1
hash5
hash6
Main
Office
555-555-5555
Saginaw 48609
avery@exa
mple.com
NULL 1 8/11/2024
hash2
hash7
Hash8
main
office
5555555555 Saginaw 48609
scott@exa
mple.com
hash3 1
May 11th,
23
hash3
Man
office
(555)555-5555
Saginaw 48609
scott@exa
mple.com
555/555/5555
Saginaw 48609

1NF - users
• A unique identifier should be:
• Auto-incrementing int
• UUID

1NF - users
id email password active hire_date
previous_
password
office_
name
office_phone
office_cit
y
office_zip
1
alice@exa
mple.com
hash1 1 1/1/2024
hash1
hash5
hash6
Main
Office
555-555-5555
Saginaw 48609
2
avery@ex
ample.com
NULL 1 8/11/2024
hash2
hash7
Hash8
main
office
5555555555 Saginaw 48609
3
scott@exa
mple.com
hash3 1
May 11th,
23
hash3
Man
office
(555)555-
5555 Saginaw 48609
4
scott@exa
mple.com
555/555/5555
Saginaw 48609

1NF - users
previous_
password
office_
name
office_phone
office_cit
y
office_zip
1
alice@exa
mple.com
hash1 1 1/1/2024
hash1
hash5
hash6
Main
Office
555-555-5555
Saginaw 48609
2
avery@ex
ample.com
NULL 1 8/11/2024
hash2
hash7
hash8
main
office
5555555555 Saginaw 48609
3
scott@exa
mple.com
hash3 1
May 11th,
23
hash3
Man
office
(555)555-
5555 Saginaw 48609
4
scott@exa
mple.com
555/555/5555
Saginaw 48609

1NF - user_password_histories
id user_id password

1NF - user_password_histories
id user_id password
1 1 hash1
2 1 hash5
3 1 hash6
4 2 hash2
5 2 hash7
6 2 hash8
7 3 hash3
8 4 hash4

1NF - users
office_
name
1
alice@exa
mple.com
hash1 1 1/1/2024
Main
Office
555-555-5555
Saginaw 48609
2
avery@exa
mple.com
NULL 1 8/11/2024
main
office
5555555555 Saginaw 48609
3
scott@exa
mple.com
hash3 1
May 11th,
23
Man
office
(555)555-5555
Saginaw 48609
4
scott@exa
mple.com
hash4 1 Tuesday Main
555/555/5555
Saginaw 48609

Second Normal Form (2NF)
1. Is already in 1NF
2. All the non-key columns are dependent on the primary key of the table

Second Normal Form (2NF)
office_
name
1
alice@exa
mple.com
hash1 1 1/1/2024
Main
Office
555-555-5555
Saginaw 48609
2
avery@exa
mple.com
NULL 1 8/11/2024
main
office
5555555555 Saginaw 48609
3
scott@exa
mple.com
hash3 1
May 11th,
23
Man
office
(555)555-5555
Saginaw 48609
4
scott@exa
mple.com
555/555/5555
Saginaw 48609

2nd - offices
id name phone city zip
1 Main Office
555-555-5555
Saginaw 48609
2 main office 5555555555 Saginaw 48609
3 Man office
(555)555-5555
Saginaw 48609
4 Main
555/555/5555
Saginaw 48609

2NF - users
office_
name
1
alice@exa
mple.com
hash1 1 1/1/2024
Main
Office
555-555-5555
Saginaw 48609
2
avery@exa
mple.com
NULL 1 8/11/2024
main
office
5555555555 Saginaw 48609
3
scott@exa
mple.com
hash3 1
May 11th,
23
Man
office
(555)555-5555
Saginaw 48609
4
scott@exa
mple.com
555/555/5555
Saginaw 48609

2NF - users
office_
name
office_phone
office_cit
y
office_zip office_id
1
alice@exa
mple.com
hash1 1 1/1/2024
Main
Office
555-555-5555
Saginaw 48609 1
2
avery@ex
ample.com
NULL 1 8/11/2024
main
office
5555555555 Saginaw 48609 2
3
scott@exa
mple.com
hash3 1
May 11th,
23
Man
office
(555)555-
5555 Saginaw 48609 3
4
scott@exa
mple.com
555/555/5555
Saginaw 48609 4

2NF - users
id email password active hire_date office_id
1
alice@example.co
m
hash1 1 1/1/2024 1
2
avery@example.c
om
NULL 1 8/11/2024 2
3
scott@example.co
m
hash3 1 May 11th, 23 3
4
scott@example.co
m
hash4 1 Tuesday 4

Third Normal Form (3NF)
1. Is already in 2NF
2. It contains columns that are non-transitively dependent on the primary key

3NF - offices
id name phone city zip
1 Main Office
555-555-5555
Saginaw 48609
3 Man office
(555)555-5555
Saginaw 48609
4 Main
555/555/5555
Saginaw 48609

3NF - zips
id city
48609 Saginaw
48640 Midland
48642 Midland
48901 Lansing

3NF - zips
id city state
48609 Saginaw MI
48640 Midland MI
48642 Midland MI
48901 Lansing MI

Use The Database Engine to
Keep Data Clean

mysql> insert into users
(password)
values
(“just a password?");
Query OK, 1 row affected (0.01 sec)

To Prevent Bugs:
Make The Database
Work For Us

1
alice@example.co
m
hash1 1 1/1/2024 1
2
avery@example.c
om
NULL 1 8/11/2024 2
3
scott@example.co
m
4
scott@example.co
m
hash4 1 Tuesday 4
5 Hash12 2 2024-04-01 1000

Use Correct Column Types
1
alice@example.co
m
hash1 1 1/1/2024 1
2
avery@example.c
om
NULL 1 8/11/2024 2
3
scott@example.co
m
4
scott@example.co
m
hash4 1 Tuesday 4
5 Hash12 2 2024-04-01 1000

• Numeric: INT, TINYINT, BIGINT, FLOAT, REAL, etc.
• Date/Time: DATE, TIME, DATETIME, etc.
• String: CHAR, VARCHAR, TEXT, etc.
• Binary data types such as: BLOB, etc.

mysql> insert into users (hire_date) values ("tuesday");
ERROR 1292 (22007): Incorrect date value: 'tuesday'
for column 'hire_date' at row 1
mysql> insert into users (hire_date) values ("May 11th, 23");
ERROR 1292 (22007): Incorrect date value: 'May 11th, 23'
for column 'hire_date' at row 1

Use NOT NULL for Required
Fields

Use NOT NULL for Required Fields
mysql> insert into users
(password)
values
(“just a password?");

mysql>
insert into users
(password)
values
(“just a password?”);
ERROR 1364 (HY000): Field ‘email' doesn't have a default value

mysql>
insert into users
(email, password)
values
(“s@s”, "just a password?");
ERROR 1364 (HY000): Field 'active' doesn't have a default value

Use UNIQUE for Unique Values
mysql> insert into users (email) values ("scott@keck-warren.com");

Use UNIQUE for Unique Values
ERROR 1062 (23000): Duplicate entry 'scott@keck-warren.com' for key 'users.email'

Use Foreign Keys For
References To Other Tables

Use Foreign Keys For References To Other Tables
id name phone city zip_id
1 Main Office
555-555-5555
Saginaw 48609
3 Man office
(555)555-5555
Saginaw 48609
4 Main
555/555/5555
Saginaw 48609

1 Main Office
555-555-5555
Saginaw 48609
3 Man office
(555)555-5555
Saginaw 48609

1 Main Office
555-555-5555
Saginaw 48609

mysql> insert into users (email, office_id) values ("s@kw.com", 1000000);

Foreign Key Constraints
ERROR 1452 (23000): Cannot add or update a child row: a foreign key constraint
fails (`databasetalk`.ùsers`, CONSTRAINT ùsers_ibfk_1` FOREIGN KEY (òffice_id`) REFERENCES òffices` (ìd`))

mysql> delete from offices where id = 1;
ERROR 1451 (23000): Cannot delete or update a parent row: a foreign key constraint
fails (`databasetalk`.ùsers`, CONSTRAINT ùsers_ibfk_1` FOREIGN KEY (òffice_id`) REFERENCES òffices` (ìd`))

mysql> insert into user_password_histories (user_id, password) values (2, "test1");
mysql> insert into user_password_histories (user_id, password) values (2, "test2");

mysql> delete from users where id = 2;
mysql> select * from user_password_histories where user_id = 2;
Empty set (0.00 sec)

Use Triggers For Complex
Requirements

Use Triggers For Complex Requirements
• Triggers add additional power to DB
• Operate based on create, update, or delete

1
alice@example.co
m
hash1 1 1/1/2024 1
2
avery@example.c
om
NULL 1 8/11/2024 2
3
scott@example.co
m
4
scott@example.co
m
hash4 1 Tuesday 4
5 Hash12 2 2024-04-01 1000

mysql> insert into users (active) values (2);
ERROR 1644 (45000): active must be 0 or 1

Proactively Add Indexes to Keep
Queries Performant

Indexes in Databases
2023-12-03
Index: hire_date
2023-12-04
2023-12-05
2023-12-06

What You Need to Know
1. Normalize Your Database For Data
Deduplication
2. Use The Database Engine to Keep Data
Clean
3. Proactively Add Indexes to Keep Queries
Performant

1. The table contains a unique identifier, also called the primary key, that is
used to identify the row.
2. Each column contains atomic values (values that can not be broken
down)
3. All the non-key columns are dependent on the primary key of the table
4. It contains columns that are non-transitively dependent on the primary key

• Make the DB Work With You
• Correct Column Types
• NOT NULL for Required Fields
• UNIQUE for Unique Values
• Foreign Keys For References To Other Tables
• Triggers For Complex Requirements

• Use indexes on commonly searched columns
• Start simple
• See recorded talks about how to add

Questions/Follow Me
• Questions
• Please rate the talk
• <link>
• @scottKeckWarren@phpc.social
• @scottKeckWarren@twitter.com

SQL Database Design For Developers at php[tek] 2024

Recomendados

Recomendados

Mais conteúdo relacionado

Semelhante a SQL Database Design For Developers at php[tek] 2024

Semelhante a SQL Database Design For Developers at php[tek] 2024 (20)

Mais de Scott Keck-Warren

Mais de Scott Keck-Warren (8)

Último

Último (20)

SQL Database Design For Developers at php[tek] 2024

Notas do Editor