14
Logical Database Design and the Relational Model Zaldy Adrianto [email protected] Objectives Definition of terms List five properties of relations State two properties of candidate keys Define first, second, and third normal form Describe problems from merging relations Transform E-R and EER diagrams to relations Create tables with entity and relational integrity constraints Use normalization to convert anomalous tables to well- structured relations The Physical Design Stage of SDLC Project Identification and Selection Project Initiation and Planning Analysis Physical Design Implementation Maintenance Logical Design Purpose – information requirements structure Deliverable – detailed design specifications Database activity – logical database design Relation / table Definition: A relation is a named, two- dimensional table of data Table consists of rows (records), and columns (attribute or field) Requirements for a table to qualify as a relation: It must have a unique name. Every attribute value must be atomic (not multivalued, not composite) Every row must be unique (can’t have two rows with exactly the same values for all their fields) Attributes (columns) in tables must have unique names The order of the columns must be irrelevant The order of the rows must be irrelevant NOTE: all relations are in 1 st Normal form Correspondence with E-R Model Relations (tables) correspond with entity types and with many-to-many relationship types Rows correspond with entity instances and with many-to-many relationship instances Columns correspond with attributes NOTE: The word relation (in relational database) is NOT the same as the word relationship (in E-R model)

05_HO_Logical Database Design and the Relational Model

Embed Size (px)

DESCRIPTION

DBMS

Citation preview

Page 1: 05_HO_Logical Database Design and the Relational Model

Logical Database Design and the

Relational Model

Zaldy Adrianto

[email protected]

Objectivesn Definition of termsn List five properties of relationsn State two properties of candidate keysn Define first, second, and third normal formn Describe problems from merging relationsn Transform E-R and EER diagrams to relationsn Create tables with entity and relational integrity

constraintsn Use normalization to convert anomalous tables to well-

structured relations

The Physical Design Stage of SDLC

Project Identification and Selection

Project Initiation and Planning

Analysis

Physical Design

Implementation

Maintenance

Logical Design

Purpose – information requirements structure Deliverable – detailed design specifications

Database activity – logical database design

Relation / table

nDefinition: A relation is a named, two-dimensional table of data

nTable consists of rows (records), and columns (attribute or field)

Requirements for a table to qualify as a relation:

✦ It must have a unique name.✦ Every attribute value must be atomic (not

multivalued, not composite)✦ Every row must be unique (can’t have two rows

with exactly the same values for all their fields)✦ Attributes (columns) in tables must have

unique names✦ The order of the columns must be irrelevant✦ The order of the rows must be irrelevant

NOTE: all relations are in 1st Normal form

Correspondence with E-R Model

nRelations (tables) correspond with entity types and with many-to-many relationship types

nRows correspond with entity instances and with many-to-many relationship instances

nColumns correspond with attributes

nNOTE: The word relation (in relational database) is NOT the same as the word relationship (in E-R model)

Page 2: 05_HO_Logical Database Design and the Relational Model

Key Fields

¨Keys are special fields that serve two main purposes:

¨Primary keys are unique identifiers of the relation in question. Examples include employee numbers, social security numbers, etc. This is how we can guarantee that all rows are unique

¨Foreign keys are identifiers that enable a dependent relation (on the many side of a relationship) to refer to its parent relation (on the one side of the relationship)

n Keys can be simple (a single field) or composite (more than one field)

Primary KeyForeign Key (implements

1:N relationship between customer and order)

Combined, these are a composite primary key (uniquely identifies the order line)…individually they are foreign keys (implement M:N relationship between order and product)

Integrity Constraints

n Domain Constraints nAllowable values for an attribute. See Table 5-1

n Entity Integrity nNo primary key attribute may be null. All primary

key fields MUST have data n Action Assertions

nBusiness rules. Recall from Ch. 4

Integrity Constraintsn Referential Integrity – rule that states that any foreign key value

(on the relation of the many side) MUST match a primary key value in the relation of the one side. (Or the foreign key can be null)

nFor example: Delete RulesnRestrict – don’t allow delete of “parent” side if related rows exist in “dependent” side

nCascade – automatically delete “dependent” side rows that correspond with the “parent” side row to be deleted

nSet-to-Null – set the foreign key in the dependent side to null if deleting from the parent side not allowed for weak entities

Referential integrity constraints (Pine Valley Furniture)

Referential integrity constraints are drawn

via arrows from dependent to parent

table

Page 3: 05_HO_Logical Database Design and the Relational Model

Referential integrity constraints are

implemented with foreign key to primary key references

Transforming EER Diagrams into Relations

Mapping Regular Entities to Relations Simple attributes: E-R attributes map directly onto

the relation

Composite attributes: Use only their simple, component attributes

Multivalued Attribute - Becomes a separate relation with a foreign key taken from the superior entity

Figure 5-8: Mapping a regular entity(a)

CUSTOMER entity type with simple attributes

(b) CUSTOMER relation (table)

(a) CUSTOMER entity type with

composite attribute

(b) CUSTOMER relation with address detail

Figure 5-9: Mapping a composite attribute

Figure 5-10: Mapping a multivalued attribute

1–to–many relationship between original entity and new relation

Multivalued attribute becomes a separate relation with foreign key

(b)

(a)

Transforming EER Diagrams into Relations (cont.)

Mapping Weak Entities

Becomes a separate relation with a foreign key taken from the superior entity

Primary key composed of:

Partial identifier of weak entity

Primary key of identifying relation (strong entity)

Page 4: 05_HO_Logical Database Design and the Relational Model

NOTE: the domain constraint for the foreign key should NOT allow null value if

DEPENDENT is a weak entity

Composite primary key

Transforming EER Diagrams into Relations (cont.)

nMapping Binary Relationships

nOne-to-Many - Primary key on the one side becomes a foreign key on the many side

nMany-to-Many - Create a new relation with the primary keys of the two entities as its primary key

nOne-to-One - Primary key on the mandatory side becomes a foreign key on the optional side

Figure 5-12a: Example of mapping a 1:M relationshipRelationship between customers and orders

Note the mandatory one

Figure 5-12b Mapping the relationship

Foreign key

Again, no null value in the foreign key…this is because of

the mandatory minimum cardinality

Figure 5-13a: Example of mapping an M:N relationship

The Supplies relationship will need to become a separate relation

Page 5: 05_HO_Logical Database Design and the Relational Model

Gambar 5-13b hasil tiga tabel

New intersection

relationForeign key

Foreign key

Composite primary key

Figure 5-14a: Mapping a binary 1:1 relationshipIn_charge relationship

Figure 5-14b Resulting relations Transforming EER Diagrams into Relations (cont.)

nMapping Associative Entities

nIdentifier Not Assigned

nDefault primary key for the association relation is composed of the primary keys of the two entities (as in M:N relationship)

nIdentifier Assigned

nIt is natural and familiar to end-users

nDefault identifier may not be unique

Page 6: 05_HO_Logical Database Design and the Relational Model

Figure 5-16a: Mapping an associative entity with an identifier Associative entity

Figure 5-16b Three resulting relations

Transforming EER Diagrams into Relations (cont.)

nMapping Unary Relationships

nOne-to-Many - Recursive foreign key in the same relation

nMany-to-Many - Two relations:

nOne for the entity type

nOne for an associative relation in which the primary key has two attributes, both taken from the primary key of the entity

Figure 5-17: Mapping a unary 1:N relationship

(a) EMPLOYEE entity dengan Manages relationship

(b) EMPLOYEE relation with

recursive foreign key

Figure 5-18: Mapping a unary M:N relationship

(a) Bill-of-materials relationships (M:N)

(b) ITEM and COMPONENT

relations

Transforming EER Diagrams into Relations (cont.)

nMapping Ternary (and n-ary) RelationshipsnOne relation for each entity and one for the associative entity

nAssociative entity has foreign keys to each entity in the relationship

Page 7: 05_HO_Logical Database Design and the Relational Model

Figure 5-19a: Mapping a ternary relationship with associative entity

Figure 5-19b Mapping the ternary relationship

Remember that the primary key MUST

be unique

Transforming EER Diagrams into Relations (cont.)

nMapping Supertype/Subtype RelationshipsnOne relation for supertype and for each subtype

nSupertype attributes (including identifier and subtype discriminator) go into supertype relation

nSubtype attributes go into each subtype; primary key of supertype relation also becomes primary key of subtype relation

n1:1 relationship established between supertype and each subtype, with supertype as primary table

Figure 5-20: Supertype/subtype relationships

Figure 5-21: Mapping Supertype/subtype relationships to relations

These are implemented as one-to-one relationships

Data Normalization

Page 8: 05_HO_Logical Database Design and the Relational Model

Normalisasi Data (Data Normalization)

nPrimarily a tool to validate and improve a logical design so that it satisfies certain constraints that avoid unnecessary duplication of data

nThe process of decomposing relations with anomalies to produce smaller, well-structured relations

Well-Structured Relations

n A relation that contains minimal data redundancy and allows users to insert, delete, and update rows without causing data inconsistencies

n Goal is to avoid anomalies

Tabel tidak boleh berhubungan dengan lebih dari satu entitas (entity)

Goal is to avoid anomalies✦ Insertion Anomaly – adding new rows forces

user to create duplicate data✦ Deletion Anomaly – deleting rows may cause

a loss of data that would be needed for other future rows

✦ Modification Anomaly – changing data in a row forces changes to other rows because of duplication

General rule of thumb: a table should not pertain to more than one entity type

Example – Figure 5.2b

Question – Is this a relation?

Answer – Yes: unique rows and no multivalued attributes

Question – What’s the primary key?

Answer – Composite: Emp_ID, Course_Title

nAnomalies in this Table ¨Insertion – can’t enter a new employee without having

the employee take a class

¨Deletion – if we remove employee 140, we lose information about the existence of a Tax Acc class

¨Modification – giving a salary increase to employee 100 forces us to update multiple records

Why do these anomalies exist?

Because there are two themes (entity types) into one relation. This results in duplication, and an unnecessary dependency between the entities

Page 9: 05_HO_Logical Database Design and the Relational Model

nFunctional Dependency: The value of one attribute (the determinant) determines the value of another attribute

nCandidate Key:nA unique identifier. One of the candidate keys will

become the primary keynE.g. perhaps there is both credit card number and SS# in a table…in this case both are candidate keys

nEach non-key field is functionally dependent on every candidate key

Functional Dependencies and Keys Figure 5.22 -

Steps in normalization

Normalization steps

✦ Multivalued tabel ✦ Removed multivalued attributes (to form 1NF)

✦ First Normal Form - 1NF ✦ Removed Partial Dependencies (to form 2 NF)

✦ Second Normal Form - 2NF✦ Removed Transitive dependencies and Full

Dependencies (if any) (to Form 3NF)

First Normal Form

nNo multivalued attributes

nEvery attribute value is atomic

nFig. 5-25 is not in 1st Normal Form (multivalued attributes) it is not a relation

nFig. 5-26 is in 1st Normal form

nAll relations are in 1st Normal Form

Table with multivalued attributes, not in 1st normal form

Note: this is NOT a table

Table with no multivalued attributes and unique rows, in 1st normal form

Note: this is relation, but not a well-structured one

Page 10: 05_HO_Logical Database Design and the Relational Model

Anomalies in this Table

n Insertion – if new product is ordered for order 1007 of existing customer, customer data must be re-entered, causing duplication

n Deletion – if we delete the Dining Table from Order 1006, we lose information concerning this item's finish and price

n Update – changing the price of product ID 4 requires update in several records

Why do these anomalies exist?

Because there are multiple themes (entity types) into one relation. This results in duplication, and an unnecessary dependency between the entities

Second Normal Form¨1NF PLUS every non-key attribute is fully

functionally dependent on the ENTIRE primary key

¨Every non-key attribute must be defined by the entire key, not by only part of the key

¨No partial functional dependencies

¨Partial functional dependencies is a Functional dependencies in which one or more nonkey attributes are functionally dependent on part (but not all) of the primary key

Order_ID è Order_Date, Customer_ID, Customer_Name, Customer_Address

Therefore, NOT in 2nd Normal Form

Customer_ID è Customer_Name, Customer_AddressProduct_ID è Product_Description, Product_Finish, Unit_PriceOrder_ID, Product_ID è Order_Quantity

Getting it into Second Normal Form

Partial Dependencies are removed, but there are still

transitive dependencies

Third Normal Form

n 2NF PLUS no transitive dependencies (functional dependencies on non-primary-key attributes)

nNote: this is called transitive, because the primary key is a determinant for another attribute, which in turn is a determinant for a third

nSolution: non-key determinant with transitive dependencies go into a new table; non-key determinant becomes primary key in the new table and stays as foreign key in the old table

Page 11: 05_HO_Logical Database Design and the Relational Model

Getting it into Third Normal Form

Transitive dependencies are removed

Merging Relationsn View Integration – Combining entities from multiple ER

models into common relations n Issues to watch out for when merging entities from

different ER models: nSynonyms – two or more attributes with different

names but same meaning nHomonyms – attributes with same name but different

meanings nTransitive dependencies – even if relations are in 3NF

prior to merging, they may not be after merging nSupertype/subtype relationships – may be hidden prior

to merging

Enterprise KeysnPrimary keys that are unique in the whole database, not just within a single relation

nCorresponds with the concept of an object ID in object-oriented systems

Parking ticket table (unormalized form)St_ID L-Name F-Name Phone

Number St_Lic Lic_No Ticket# Date Code Fine

38249 Brown Thomas 1117804 FL BRY123 15634 10/17/04 2 25

16017 11/13/04 1 15

82453 Green Sally 3911689 AL TRE141 14987 10/05/04 3 100

16293 11/18/04 1 15

17892 12/13/04 2 25

St_ID = student_ID

St_Lic = State_licence

Lic_no = License_number

Ticket# = Ticket_Number

Code = violation code

Parking ticket table (1NF)

St_ID L-Name F-Name Phone Number St_Lic Lic_No Ticket# Date Code Fine

38249 Brown Thomas 1117804 FL BRY123 15634 10/17/04 2 25

38249 Brown Thomas 1117804 FL BRY123 16017 11/13/04 1 15

82453 Green Sally 3911689 AL TRE141 14987 10/05/04 3 100

82453 Green Sally 3911689 AL TRE141 16293 11/18/04 1 15

82453 Green Sally 3911689 AL TRE141 17892 12/13/04 2 25

St_ID = student_ID

St_Lic = State_licence

Lic_no = License_number

Ticket# = Ticket_Number

Code = violation code

Page 12: 05_HO_Logical Database Design and the Relational Model

Ilustrasi

Student_ID Student_name

Campus_addres Major Course_I

DCourse

TitleInstructor_n

ameInstructor_l

ocation Grade

168300458 Williams 208 Brooks IS IS 350 Database MGT Codd B 104 A

268300458 WIlliams 208 Brooks IS IS 465 System Analysis Parsons B 317 B

543291073 Baker 104 Phillips Acct IS 350 Database

MGT Codd B 104 C

543291073 Baker 104 Phillips Acct Acct 201

Fund Accountin

gMiller H 310 B

543291073 Baker 104 Phillips Acct Mktg

300Intro

Mrktg Bennet B 212 A

1 NF

Student_ID Student_name

Campus_addres Major Course

_IDCourse

TitleInstructor_

nameInstructor_location Grade

Primary Key

Student_ID Student_name

Campus_addres Major Course

_IDCourse

TitleInstructor_

nameInstructor_location Grade

Partial dependencies

Partial dependencies

Student_ID Student_name

Campus_address Major Course

_IDCourse

TitleInstructor_

nameInstructor_location Grade

Partial dependencies

Partial dependencies

Attributes that functionally dependent on Primary key

Student_ID Student_name

Campus_address Major Course

_IDCourse

TitleInstructor_

nameInstructor_location Grade

Partial dependencies

Partial dependencies

Full dependencies

Page 13: 05_HO_Logical Database Design and the Relational Model

Student_ID Student_name

Campus_address Major Course

_IDCourse

TitleInstructor_

nameInstructor_location Grade

Partial dependencies

Partial dependencies

Full dependencies

Transitive dependencies

Student_ID Student_name

Campus_address Major Course

_IDCourse

TitleInstructor_

nameInstructor_location Grade

Partial dependencies

Partial dependencies

Full dependenciesTransitive

dependenciesAttributes that functionally

dependent on Non- Primary key

Student_ID Student_name

Campus_address Major

Course_ID Student_ID Course Title Instructor_name Instructor_location Grade

Transitive dependencies

3 NF

2 NF

Student_ID Student_name Campus_address Major

Course_ID Student_ID Grade

Course_ID Course Title Instructor_name Instructor_location

2 NF

3 NF

2 NF

3 NF

Course_ID Course Title

Student_ID Student_name

Campus_address Major

Course_ID Student_ID Grade

Course Title

Instructor_name

Instructor_location

3 NF

Course_ID Course Title

Student_ID Student_name

Campus_address Major

Course_ID Student_ID Grade

3 NF

Course Title

Instructor_name

Instructor_location

PK

PK

FK

3 NF

Page 14: 05_HO_Logical Database Design and the Relational Model

Review Schema Database✦ External schema

✦ Enterprise Data Modelling✦ Conceptual Schema

✦ Entity Relationship Diagram ✦ Internal Schema

✦ Logical Schema ✦ Relational Data Model (need

normalization when anomalies occurs)✦ Physical Schema

Questions ???