Professional Documents
Culture Documents
Subject oriented
Integrited
Time Varient
Time Varient
Architecture of Data
WareHouse
Architecture of a Data
Warehouse with a Staging Area
Architecture of a Data
Warehouse with a Staging Area
and Data Marts
How to correct
To correct the offending an architect can do
three things.
Example: suppose on july 1 Rs 500 is made in
to operational system on july 2 a snapshot
taken in data warehouse and on july 15 it
discovered that it was a entry of 250 rather
than 500 on july 1.
Then
choice 1. go back to july 2 and update 250
inspite of 500. but it can create problem if any
report has been taken between july 2 to july 15.
How to correct
choice 2.
Enter offsetting entry i.e make two
entry first debit 500 then credit 250.
some time it also can create problem.
Choice 3.
Reset the account to the proper value.
but it will not correct the error.
So depending on the situation you can
make any decision.
Granularity
Refers to the level of details of the Data
Dual level of Granularity: 1. Low Level of Detail(More details)
2. High Level of detail( less details i.e
Summary)
Mostly Data in Data warehouse is in High level
But it has Low Level of Detail also for atomic
query.
Data Granularity
Data Granularity
A significant difference between an operational system
and a data warehouse is the granularity of the data
stored.
An operational system typically stores data at the
lowest level of granularity: the maximum level of detail.
However, because the data warehouse contains data
representing a long period in time, simply storing all
detail data from an operational system can result in an
overworked system that takes too long to query.
A data warehouse typically stores data in different
levels of granularity or summarization, depending on
the data requirements of the business. If an enterprise
needs data to assist strategic planning, then only highly
summarized data is required.
Granularity
The lower the level of granularity of data required by
the enterprise, the higher the number of resources
(specifically data storage) required to build the data
warehouse. The different levels of summarization in
order of increasing granularity are:
Current operational data
Historical operational data
Aggregated data
Metadata
Current and historical operational data are taken,
unmodified, directly from operational systems.
Historical data is operational level data no longer
queried on a regular basis, and is often archived
onto secondary storage.