I use the term
"guideline" because, to date, no commercial relational database system
fully conforms to all 12 rules. They do represent the relational ideal,
though. For a few years, scorecards were kept that rated each commercial
product's conformity to Codd's rules. Today, the rules are not talked
about as much but remain a goal for relational database design.
Following is a list of Codd's 12 rules, including his original name for
each rule and a simplified description. I also have included a note where
certain rules are problematic to implement. Don't worry if some of these
items are confusing to you, as we move further through this newsletter
series we will fill in the details.
0. (Yes, there is a Rule 0!) For a system to qualify as a RELATIONAL,
DATABASE, MANAGEMENT system, that system must use its RELATIONAL
facilities (exclusively) to MANAGE the DATABASE.
Rule 1: The Information Rule
All data should be presented to the user in table form.
Rule 2: Guaranteed Access Rule
All data should be accessible without ambiguity. This can be accomplished
through a combination of the table name, primary key, and column name.
Rule 3: Systematic Treatment of Null Values
A field should be allowed to remain empty. This involves the support of a
null value, which is distinct from an empty string or a number with a
value of zero. Of course, this can't apply to primary keys. In addition,
most database implementations support the concept of a nun- null field
constraint that prevents null values in a specific table column.
Rule 4: Dynamic On-Line Catalog Based on the Relational Model A
relational database must provide access to its structure through the same
tools that are used to access the data. This is usually accomplished by
storing the structure definition within special system tables.
Rule 5: Comprehensive Data Sublanguage Rule
The database must support at least one clearly defined language that
includes functionality for data definition, data manipulation, data
integrity, and database transaction control. All commercial relational
databases use forms of the standard SQL (Structured Query Language) as
their supported comprehensive language.
Rule 6: View Updating Rule
Data can be presented to the user in different logical combinations,
called views. Each view should support the same full range of data
manipulation that direct-access to a table has available. In practice,
providing update and delete access to logical views is difficult and is
not fully supported by any current database.
Rule 7: High-level Insert, Update, and Delete
Data can be retrieved from a relational database in sets constructed of
data from multiple rows and/or multiple tables. This rule states that
insert, update, and delete operations should be supported for any
retrievable set rather than just for a single row in a single table.
Rule 8: Physical Data Independence
The user is isolated from the physical method of storing and retrieving
information from the database. Changes can be made to the underlying
architecture ( hardware, disk storage methods ) without affecting how the
user accesses it.
Rule 9: Logical Data Independence
How a user views data should not change when the logical structure (tables
structure) of the database changes. This rule is particularly difficult to
satisfy. Most databases rely on strong ties between the user view of the
data and the actual structure of the underlying tables.
Rule 10: Integrity Independence
The database language (like SQL) should support constraints on user input
that maintain database integrity. This rule is not fully implemented by
most major vendors. At a minimum, all databases do preserve two
constraints through SQL.
- No component of a primary key can have a null value. (see rule 3)
- If a foreign key is defined in one table, any value in it must exist
as a primary key in another table.
Rule 11: Distribution Independence
A user should be totally unaware of whether or not the database is
distributed (whether parts of the database exist in multiple locations). A
variety of reasons make this rule difficult to implement; I will spend
time addressing these reasons when we discuss distributed databases.
Rule 12: Nonsubversion Rule
There should be no way to modify the database structure other than through
the multiple row database language (like SQL). Most databases today
support administrative tools that allow some direct manipulation of the
datastructure.