Professional Documents
Culture Documents
Agenda
Introduction
Best Practices Over View Deep Dive Conclusion Q&A
Introduction
Speaker
Holds Bachelor degree in Physics and Electrical and Electronics
Engineering.
Over 26 years of Experience in Power System and Information
Technology field.
Has built several large scale applications including online CRM for
multinationals.
Has managed several Data Warehousing and BI projects for Direct
reporting projects.
ISO auditor Certified Bullet Proof Manager from CrestComm USA.
of time. The framework should have certain characteristics in that they should be repeatable.
and The Process Managers ( Load / Warehouse / Query) That make information available , enabling the user to make informed decisions.
Deep Dive
In order to respond to todays requirement for instant access to corporate information , the data warehouse should be designed to respond to this need in a optimal way. Business itself probably not aware of what information is required in the future. This requires a fundamentally different approach than the traditional waterfall method of software development for the Data warehouse.
Experience so far
Most Enterprise Data Warehousing projects tend to have
development cycle of between 18 24 months from start to finish. Justification of this investment is substantial. Businesses would prefer a better approach to justify the investment.
and long term requirement of the data warehouse. An Architecture design that would evolve. Identification of quick win that delivers business benefit in the first build.
Horizontal Scalability
High Speed Network
CPU CPU CPU CPU CPU CPU CPU CPU
RAM
RAM
RAM
RAM
RAM
RAM
RAM
RAM
DB
DB
DB
DB
DB
DB
DB
DB
Multiple servers are connected thru a network and use the data partitioning feature of the Database to tie the CPUs together.
Data warehouse (System of Record) Full History in 3rd Normal Form No User Access
Data Mart
Source Systems
Strategy
Metadata Mgmt Architecture Integration Control Delivery Data Quality Spec Analysis Measurement Improvement Data Architecture Entp. DM Value Chain Data Development Analysis Data Modeling DB Design Implementation
Document / Content Mgmt Acquisition & Storage Backup & Recovery Content Retrieval Retention
Data Governance
Reference and MDM External & Internal Code Customer Data Product data
Environment
Organization & Culture
Technology
Activities
Deliverables
Architecture Requirements
Must recognize change as a constant
Take incremental development approach Existing applications must continue to work
added
High Level
Remember the different worlds
On-line transaction processing (OLTP) Business intelligence systems (BIS)
different
Best Practice #1
Use a Data model that is optimized for Information retrieval
dimensional model
denormalized
hybrid approach
September 8, 2013
24
Best Practice #2
Carefully design the data acquisition and cleansing processes for your DW
Ensure the data is processed efficiently and
accurately Consider acquiring ETL and Data Cleansing tools Use them well!
September 8, 2013
25
Best Practice #3
Design a metadata architecture that allows sharing of metadata between components of your DW
consider metadata standards such as OMGs
September 8, 2013
26
Best Practice #4
Take an approach that consolidates data into a single version of the truth
Data Warehouse Bus conformed dimensions & facts OR?
September 8, 2013
27
Best Practice #5
Consider implementing an ODS only when information retrieval requirements are near the bottom of the data abstraction pyramid and/or when there are multiple operational sources that need to be accessed
Must ensure that the data model is integrated, not just
consolidated May consider 3NF data model Avoid at all costs a data dumping ground
September 8, 2013
28
Pitfalls to be Avoided
Engagement of Non-BI Manger in a BI delivery Project.
Trying to please the client and the user community. Expecting the Service Provider to own the Project completely. Bringing the Solution Architect half way into the project.
deployed.
Thank You