You are on page 1of 3

Chia Wei Chen

TIM58
Professor Shawfeng Dong
23 February 2018
Homework 7
Chapter 9:
1. Object persistence refers to designing a way in which the objects are stored in the database.
The project team designs the object persistence. The four steps to object persistence design is:
1) Selecting the format of the storage.
2) Mapping the problem domain objects to the object-persistence format.
3) Optimizing the object-persistence format.
4) Designing the data access and manipulation of classes necessary to handle the
communication between the system and an the database.

2. Files are electronic lists of data that have been optimized to perform a particular transaction.
Whereas, databases are made up of a collection of data sets that are related to each other in some
way. Logical groupings of information could include such categories as customer data,
information about an order, product information, etc. A database management system (DBMS) is
software that creates and manipulates these databases. Compared to databases, data in files can
sometimes be difficult to store, search, and retrieve. Files are also more vulnerable to a security
threat while databases are more secured.

6. Relational databases are the most popular today due to their usability and conceptual
simplicity. A relational database is based on collections of tables with each table having a
primary key. The tables are related to one another by placing the primary key from one table into
the related table as a foreign key. Most relational database management systems (RDBMS)
support referential integrity, which is the idea of ensuring that values linking the tables together
through the primary and foreign keys are valid and correctly synchronized. Three examples of
RDBMS on the market includes: Microsoft Access, IBM Informix, and Microsoft SQL Server.
23. Some techniques that can help improve performance of a database include: denormalization,
clustering, indexing, and volumetrics (proper estimation). Denormalization adds selected fields
back to tables in a data model, which improves the data access speed at the price of slight
redundancy. Clustering involves physically placing records together so that similar records are
stored in close proximity. Indexing creates small, quickly searchable tables that contain values
from the table and indicate where those values can be found. Applying volumetrics and properly
estimating the data set size is crucial in assuring that adequate hardware is obtained for the
system.

26. The size of the database is based on the amount of ​raw data​ in the tables and the ​overhead
requirements of the DBMS. To estimate size, you will need to have a good understanding of the
initial size of your database as well as its expected growth rate over time. Raw data refers to all
the data that are stored within the tables of the database, and it is calculated based on a
bottom-up approach. After the amount of raw data has been determined, the overhead for the
table as a percentage of each record must be calculated. Overhead includes the room needed by
the DBMS to support such functions as administrative actions and indexes, and it should be
assigned based on past experience, recommendations from technology vendors, or parameters
that are built into software that was written to calculate volumetrics. Lastly, record the number of
initial records that will be loaded into the table, as well as the expected growth per month.

Exercise D

In the above data model,


The three data entities are: Musician, Group, and CD.
“Musician” attribute(s): Msc_Musician name, Msc_Group name, and Msc_Instrument
“Group” attributes(s): Grp_Date format, Grp_Agent name
“CD” attribute(s): cd_CD length
Grp_Group Name and cd_CD Title are the primary key attributes which is used to uniquely
identify particular CD name in a table. The musician can be in a single group only, and so it
represents the “one-to-one relationship” between Musician and Group entity. A group contains
many musicians, and so it represents the “one-to-many relationship” between Group and
Musician entity. Musician group can have more than one CD, and so it represents the
“one-to-many relationship” between Group and CD entity. A CD can contain only the single
musician group, and so it represents the “one-to-one relationship” between CD and Group entity.

Minicase 2
Volumetrics Analysis:

Table Name Average Overhead Total Record Initial Initial Table


Recorded Size Factor Size Table Size Volume

New Vehicle 65 35% 88 10,000 880,000

Trade-in Vehicle 48 35% 65 7,500 487,500

Sales Invoice 76 35% 103 16,000 1,648,000

Customer 61 35% 83 13,000 1,079,000

Salesperson 34 35% 46 100 4,600

Installed Option 16 35% 22 25,000 550,000

Option 28 35% 38 500 19,000

Total initial volume: 4,668,100


Growth Rate: 10%
Total volume in three years is 6,213,241
Database requires allowing 6,213,241 characters, so hardware is required to allow that much size
of character data.

You might also like