0% found this document useful (0 votes)

54 views22 pages

Normalization

Uploaded by

Bishnu Chauhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views22 pages

Normalization

Uploaded by

Bishnu Chauhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Normalization

One of the most important factors in software development is

database definition. If your tables are not set up properly, it
can cause you a lot of headaches down the road when you
extract the data you want. By understanding data
relationships and the normalization of data, you will be better
prepared to begin developing your applications.
Whether you work with MS-Access, Foxpro, MS-SQL
Server, mySQL or Oracle, you should know the methods of
normalizing the table schema in your relational database
system. They can help make your code easier to understand,
easier to expand upon, and in some cases, actually speed up
your application.

Copyright @ www.bcanotes.com
N o r m a l i z a t i o n…
Basically, the Rules of Normalization are enforced by
eliminating redundancy and inconsistent dependency in your
table designs. Here we will explain what that means by
examining the five progressive steps to normalization you
should be aware of in order to create a functional and efficient
database.
Let's say we want to create a table of user information, and
we want to store each users' Name, Company, Company
Address, and some personal bookmarks, or urls. You might
start by defining a table structure like this:

users

name company company_address url1 url2

Joe ABC 1 Work Lane abc.com xyz.com

Jill XYZ 1 Job Street abc.com xyz.com

We would say this table is in Zero Form because none of our

rules of normalization have been applied yet. Notice the url1
and url2 fields -- what do we do when our application needs to
ask for a third url? Do you want to keep adding columns to
your table and hard-coding that form input field into your
application code? Obviously not, you would want to create a
functional system that could grow with new development
requirements. Let's look at the rules for the First Normal
Form, and then apply them to this table.

1. Eliminate repeating groups in individual tables.

2. Create a separate table for each set of related data.
3. Identify each set of related data with a primary key.

Notice how we're breaking that first rule by repeating the url1
and url2 fields? And what about Rule Three, primary keys? Rule
Three basically means we want to put some form of unique,
auto-incrementing integer value into every one of our records.
Otherwise, what would happen if we had two users named Joe
and we wanted to tell them apart? When we apply the rules of
the First Normal Form we come up with the following table:

userId name company company_address url

1 Joe ABC 1 Work Lane abc.com

1 Joe ABC 1 Work Lane xyz.com

2 Jill XYZ 1 Job Street abc.com

2 Jill XYZ 1 Job Street xyz.com

Now our table is said to be in the First Normal Form. We've solved
the problem of url field limitation, but look at the headache we've
now caused ourselves. Every time we input a new record into
the users table, we've got to duplicate all that company and user
name data. Not only will our database grow much larger than
we'd ever want it to, but we could easily begin corrupting our
data by misspelling some of that redundant information. Let's
apply the rules of Second Normal Form:

Second Normal Form

1. Create separate tables for sets of values that apply to

multiple records.
2. Relate these tables with a foreign key.

We break the url values into a separate table so we can add

more in the future without having to duplicate data. We'll also
want to use our primary key value to relate these fields:

urlId relUserId url

user na comp company_a
Id me any ddress 1 1 abc.com

2 1 xyz.com
1 Joe ABC 1 Work Lane

3 2 abc.com

2 Jill XYZ 1 Job Street

4 2 xyz.com

Ok, we've created separate tables and the primary key in the users
table, userId, is now related to the foreign key in the urls table,
relUserId. We're in much better shape. But what happens when
we want to add another employee of company ABC? Or 200
employees? Now we've got company names and addresses
duplicating themselves all over the place, a situation just rife for
introducing errors into our data. So we'll want to look at
applying the Third Normal Form:

Eliminate fields that do not depend on the key.

Our Company Name and Address have nothing to do with the

User Id, so they should have their own Company Id:

users

userId name relCompId

1 Joe 1

2 Jill 2

compId company company_address

1 ABC 1 Work Lane

2 XYZ 1 Job Street

urls
urlId relUserId url
1 1 abc.com
2 1 xyz.com
3 2 abc.com
4 2 xyz.com

Now we've got the primary key compId in the companies table
related to the foreign key in the users table called relCompId,
and we can add 200 users while still only inserting the name
"ABC" once. Our users and urls tables can grow as large as they
want without unnecessary duplication or corruption of data.
Most developers will say the Third Normal Form is far enough,
and our data schema could easily handle the load of an entire
enterprise, and in most cases they would be correct.

Copyright @ www.bcanotes.com
N o r m a l i z a t i o n…
But look at our url fields - do you notice the duplication of data? This
is perfectly acceptable if we are not pre-defining these fields. If the
input page which our users are filling out to input this data allows
a free-form text input there's nothing we can do about this, and it's
just a coincidence that Joe and Jill both input the same
bookmarks. But what if it's a drop-down menu which we know only
allows those two urls, or maybe 20 or even more. We can take our
database schema to the next level, the Fourth Form, one which
many developers overlook because it depends on a very specific
type of relationship, the many-to-many relationship, which we
have not yet encountered in our application.

Copyright @ www.bcanotes.com
N o r m a l i z a t i o n…
Data Relationships
Before we define the Fourth Normal Form, let's look at the three basic
data relationships: one-to-one, one-to-many, and many-to-many.
Look at the users table in the First Normal Form example above.
For a moment let's imagine we put the url fields in a separate table,
and every time we input one record into the users table we would
input one row into the urls table. We would then have a one-to-one
relationship: each row in the users table would have exactly one
corresponding row in the urls table. For the purposes of our
application this would neither be useful nor normalized.

Copyright @ www.bcanotes.com
N o r m a l i z a t i o n…
Now look at the tables in the Second Normal Form example. Our tables
allow one user to have many urls associated with his user record.
This is a one-to-many relationship, the most common type, and
until we reached the dilemma presented in the Third Normal Form,
the only kind we needed.
The many-to-many relationship, however, is slightly more complex.
Notice in our Third Normal Form example we have one user related
to many urls. As mentioned, we want to change that structure to
allow many users to be related to many urls, and thus we want a
many-to-many relationship. Let's take a look at what that would do
to our table structure before we discuss it:

userId Name relCompId

1 Joe 1

2 Jill 2

companies

compId company company_address

1 ABC 1 Work Lane

2 XYZ 1 Job Street

url_relations

relationId relatedUrlId relatedUserId

1 1 1

2 1 2

3 2 1

4 2 2

In order to decrease the duplication of data (and in the process bring

ourselves to the Fourth Form of Normalization), we've created a
table full of nothing but primary and foriegn keysin url_relations.
We've been able to remove the duplicate entries in the urls table
by creating the url_relations table. We can now accurately express
the relationship that both Joe and Jill are related to each one of ,
and both of, the urls. So let's see exactly what the Fourth Form Of
Normalization entails:

1. In a many-to-many relationship, independent entities can not

be stored in the same table.
Since it only applies to the many-to-many relationship, most
developers can rightfully ignore this rule. But it does come in
handy in certain situations, such as this one. We've successfully
streamlined our urls table to remove duplicate entries and moved
the relationships into their own table.
Just to give you a practical example, now we can select all of Joe's
urls by performing the following SQL call:

Copyright @ www.bcanotes.com
N o r m a l i z a t i o n…
SELECT name, url FROM users, urls, url_relations WHERE
url_relations.relatedUserId = 1 AND users.userId = 1 AND urls.urlId
= url_relations.relatedUrlId

And if we wanted to loop through everybody's User and Url

information, we'd do something like this:

SELECT name, url FROM users, urls, url_relations WHERE

users.userId = url_relations.relatedUserId AND urls.urlId =
url_relations.relatedUrlId

Copyright @ www.bcanotes.com
N o r m a l i z a t i o n…
Fifth Normal Form
There is one more form of normalization which is sometimes
applied, but it is indeed very esoteric and is in most cases
probably not required to get the most functionality out of your data
structure or application. It's tenet suggests:
1. The original table must be reconstructed from the tables into
which it has been broken down.
The benefit of applying this rule ensures you have not created any
extraneous columns in your tables, and that all of the table
structures you have created are only as large as they need to be.
It's good practice to apply this rule, but unless you're dealing with
a very large data schema you probably won't need it.

C++: A Beginner's Guide, Second Edition
From Everand
C++: A Beginner's Guide, Second Edition
Herbert Schildt
No ratings yet
PT120 CT120 Database Schema Essentials: Training Manual
No ratings yet
PT120 CT120 Database Schema Essentials: Training Manual
163 pages
Book Shop Automation - SRS
100% (22)
Book Shop Automation - SRS
41 pages
52 MX Erd
No ratings yet
52 MX Erd
123 pages
Tables Joins
67% (3)
Tables Joins
26 pages
Database Normalization and Design Techniques: Zero Form
100% (1)
Database Normalization and Design Techniques: Zero Form
14 pages
Database Normalization Example
100% (1)
Database Normalization Example
3 pages
Chapter 5
No ratings yet
Chapter 5
24 pages
Research Paper on Normalization in Dbms
No ratings yet
Research Paper on Normalization in Dbms
4 pages
2nd and 3rd Unit
No ratings yet
2nd and 3rd Unit
87 pages
Bcis5420 - Lecture Note - ch5 - Data Normalization
No ratings yet
Bcis5420 - Lecture Note - ch5 - Data Normalization
43 pages
Normalization
No ratings yet
Normalization
19 pages
Dbms Theory Notes Unit IV
No ratings yet
Dbms Theory Notes Unit IV
73 pages
Normalization: Normalization Is A Systematic Way of Ensuring That A Database Structure Is Suitable For
No ratings yet
Normalization: Normalization Is A Systematic Way of Ensuring That A Database Structure Is Suitable For
6 pages
Normalization docx (Autosaved)
No ratings yet
Normalization docx (Autosaved)
33 pages
RDBMS Unit 4
No ratings yet
RDBMS Unit 4
15 pages
Normalizimi
No ratings yet
Normalizimi
26 pages
Normalization in DBMS11
No ratings yet
Normalization in DBMS11
12 pages
MYSQL DAY - 20 (Normalization)
No ratings yet
MYSQL DAY - 20 (Normalization)
13 pages
Q.1 What Is Normalisation? ANSWER:-Normalisation Is The Process of Structuring A Relational Database in Accordance
No ratings yet
Q.1 What Is Normalisation? ANSWER:-Normalisation Is The Process of Structuring A Relational Database in Accordance
9 pages
er diagram
No ratings yet
er diagram
28 pages
Normalizationl
No ratings yet
Normalizationl
17 pages
Normalization
No ratings yet
Normalization
23 pages
Database Normalization
No ratings yet
Database Normalization
7 pages
Database Normalization: Mohua Sarkar, PH.D Software Engineer California Pacific Medical Center 415-600-7003
No ratings yet
Database Normalization: Mohua Sarkar, PH.D Software Engineer California Pacific Medical Center 415-600-7003
23 pages
DBMS_UNIT_3
No ratings yet
DBMS_UNIT_3
11 pages
Normalization Paper
No ratings yet
Normalization Paper
3 pages
Week 6
No ratings yet
Week 6
9 pages
Chapter 5
No ratings yet
Chapter 5
22 pages
Normalization 01
No ratings yet
Normalization 01
34 pages
Normalization Lec4
No ratings yet
Normalization Lec4
29 pages
Description of the database normalization basics
No ratings yet
Description of the database normalization basics
5 pages
Norm LIZATION
No ratings yet
Norm LIZATION
27 pages
What Is Normalizationbykbs
No ratings yet
What Is Normalizationbykbs
17 pages
Normal
No ratings yet
Normal
10 pages
Normalization and Normal Form
No ratings yet
Normalization and Normal Form
11 pages
DBMS Normalization
No ratings yet
DBMS Normalization
53 pages
DB Normalization and Design
No ratings yet
DB Normalization and Design
11 pages
SQ L Normalization
100% (1)
SQ L Normalization
9 pages
Normalization
No ratings yet
Normalization
23 pages
Normalization
No ratings yet
Normalization
57 pages
Normalization-2
No ratings yet
Normalization-2
6 pages
DBMS Normalization
No ratings yet
DBMS Normalization
15 pages
Topic 2 - Normalization Notes
No ratings yet
Topic 2 - Normalization Notes
5 pages
Normalization in Database: First Normal Form
No ratings yet
Normalization in Database: First Normal Form
4 pages
Dbms Chapter 3
No ratings yet
Dbms Chapter 3
47 pages
Normalization: ITM 692 Sanjay Goel
No ratings yet
Normalization: ITM 692 Sanjay Goel
34 pages
DBMS 2: Anomalies + Normalization PDF
No ratings yet
DBMS 2: Anomalies + Normalization PDF
10 pages
DBMS - Lecture - 4 - Normalization
No ratings yet
DBMS - Lecture - 4 - Normalization
37 pages
NORMALIZATION
No ratings yet
NORMALIZATION
6 pages
Database Normalization
No ratings yet
Database Normalization
9 pages
Normalization 2011
No ratings yet
Normalization 2011
9 pages
DB Lecture 7. Data Normalization
No ratings yet
DB Lecture 7. Data Normalization
25 pages
Normalization Notes
No ratings yet
Normalization Notes
10 pages
DB Normalization by Prof - Manikandan
No ratings yet
DB Normalization by Prof - Manikandan
23 pages
Normalization
No ratings yet
Normalization
30 pages
Chapter 5_085706
No ratings yet
Chapter 5_085706
23 pages
Normalization
No ratings yet
Normalization
20 pages
Normalization
No ratings yet
Normalization
17 pages
Database Normalization
No ratings yet
Database Normalization
10 pages
normalization2017bybiplapbhattarai-180211151119
No ratings yet
normalization2017bybiplapbhattarai-180211151119
27 pages
What is Normalization
No ratings yet
What is Normalization
8 pages
Practical SQL
From Everand
Practical SQL
David Perry
3.5/5 (3)
Elements of Android Room
From Everand
Elements of Android Room
Mark Murphy
No ratings yet
Class 12 Python Class Notes
67% (3)
Class 12 Python Class Notes
14 pages
Database Lecture08
No ratings yet
Database Lecture08
40 pages
Dbms Assignment
No ratings yet
Dbms Assignment
31 pages
SQL W3schools
No ratings yet
SQL W3schools
110 pages
Chapter 7: Relational Database Design by ER-to-Relational Mapping
No ratings yet
Chapter 7: Relational Database Design by ER-to-Relational Mapping
18 pages
Dbms Lab Manual
No ratings yet
Dbms Lab Manual
58 pages
DBMS Technical Questions TCS
No ratings yet
DBMS Technical Questions TCS
35 pages
Table 1
No ratings yet
Table 1
30 pages
DBMS Mod 2 ppt
No ratings yet
DBMS Mod 2 ppt
116 pages
PL-SQL Interview Questions
No ratings yet
PL-SQL Interview Questions
191 pages
Bishes Upadhyaya DDD
No ratings yet
Bishes Upadhyaya DDD
64 pages
SQL
100% (1)
SQL
100 pages
Referential Integrity and Relational Database Design: Outline
No ratings yet
Referential Integrity and Relational Database Design: Outline
5 pages
Assignment_No_3_120325
No ratings yet
Assignment_No_3_120325
3 pages
Database Programming: CPC 223 (A)
No ratings yet
Database Programming: CPC 223 (A)
80 pages
Oracle8 Objects: Introduction To Oracle8 Object Technology
No ratings yet
Oracle8 Objects: Introduction To Oracle8 Object Technology
48 pages
High Order Thinking Questions (Hots) For Class 12
No ratings yet
High Order Thinking Questions (Hots) For Class 12
33 pages
Cse CSPC403 DBMS
No ratings yet
Cse CSPC403 DBMS
98 pages
HANA (High Performance Analytic Appliance)
No ratings yet
HANA (High Performance Analytic Appliance)
159 pages
FDB For Exit Exam
No ratings yet
FDB For Exit Exam
284 pages
Create Table Employee
100% (1)
Create Table Employee
2 pages
CBSE Sample Paper (For Website)
No ratings yet
CBSE Sample Paper (For Website)
16 pages
Wa0074
No ratings yet
Wa0074
114 pages
Sap Ddic
No ratings yet
Sap Ddic
15 pages
Practical List of DBMS
No ratings yet
Practical List of DBMS
19 pages
database project group 5
No ratings yet
database project group 5
22 pages