Professional Documents
Culture Documents
TIM58
Professor Shawfeng Dong
23 February 2018
Homework 7
Chapter 9:
1. Object persistence refers to designing a way in which the objects are stored in the database.
The project team designs the object persistence. The four steps to object persistence design is:
1) Selecting the format of the storage.
2) Mapping the problem domain objects to the object-persistence format.
3) Optimizing the object-persistence format.
4) Designing the data access and manipulation of classes necessary to handle the
communication between the system and an the database.
2. Files are electronic lists of data that have been optimized to perform a particular transaction.
Whereas, databases are made up of a collection of data sets that are related to each other in some
way. Logical groupings of information could include such categories as customer data,
information about an order, product information, etc. A database management system (DBMS) is
software that creates and manipulates these databases. Compared to databases, data in files can
sometimes be difficult to store, search, and retrieve. Files are also more vulnerable to a security
threat while databases are more secured.
6. Relational databases are the most popular today due to their usability and conceptual
simplicity. A relational database is based on collections of tables with each table having a
primary key. The tables are related to one another by placing the primary key from one table into
the related table as a foreign key. Most relational database management systems (RDBMS)
support referential integrity, which is the idea of ensuring that values linking the tables together
through the primary and foreign keys are valid and correctly synchronized. Three examples of
RDBMS on the market includes: Microsoft Access, IBM Informix, and Microsoft SQL Server.
23. Some techniques that can help improve performance of a database include: denormalization,
clustering, indexing, and volumetrics (proper estimation). Denormalization adds selected fields
back to tables in a data model, which improves the data access speed at the price of slight
redundancy. Clustering involves physically placing records together so that similar records are
stored in close proximity. Indexing creates small, quickly searchable tables that contain values
from the table and indicate where those values can be found. Applying volumetrics and properly
estimating the data set size is crucial in assuring that adequate hardware is obtained for the
system.
26. The size of the database is based on the amount of raw data in the tables and the overhead
requirements of the DBMS. To estimate size, you will need to have a good understanding of the
initial size of your database as well as its expected growth rate over time. Raw data refers to all
the data that are stored within the tables of the database, and it is calculated based on a
bottom-up approach. After the amount of raw data has been determined, the overhead for the
table as a percentage of each record must be calculated. Overhead includes the room needed by
the DBMS to support such functions as administrative actions and indexes, and it should be
assigned based on past experience, recommendations from technology vendors, or parameters
that are built into software that was written to calculate volumetrics. Lastly, record the number of
initial records that will be loaded into the table, as well as the expected growth per month.
Exercise D
Minicase 2
Volumetrics Analysis: