Professional Documents
Culture Documents
traditional data
warehousing Data warehouse
* Donald Feinberg, Mark Beyer, Merv Adrian, Roxane Edjlali (Gartner), The State of Data Warehousing in 2012 (Stamford, CT.: Gartner, 2012)
Big Data definition
*Gartner: Survey Analysis – Hadoop Adoption Drivers and Challenges (Stamford, CT.: Gartner, 2015)
But, Microsoft has done it before Growth of data @ Microsoft
We needed to better leverage data and analytics to do
more experimentation
So we:
Yammer
• Designed a data lake for everyone to put their data into
Exabytes
Xbox Live
• Built tools approachable by any developer Office365
• Created machine learning tools for collaborating LCA
Live
across large experiment models
Bing
Petabytes
SMSG
Result: CRM/Dynamics
Skype
• Across Microsoft, ten thousand developers doing
Exchange
experimentation leading to better insights Windows
Malware Protection Microsoft Stores
• Leading to growth in our Microsoft businesses: Commerce Risk
Apps HDInsight
Event Hubs (Hadoop and Cortana Mobile
Spark) Apps
Dashboards &
Visualizations
Sensors Automated
and Power BI Systems
devices
YARN YARN
HDFS
*IDC study “The Business Value and TCO Advantage of Apache Hadoop in the Cloud with Microsoft Azure HDInsight”
Azure
Data Lake Store Petabyte size files and Trillions of objects
Scalable throughput for massively parallel
A No limits Data Lake that analytics
powers Big Data Analytics
HDFS for the cloud
Always encrypted, role-based security &
auditing
Enterprise-grade support
Azure Start in seconds, scale instantly, pay per job
Data Lake Analytics Develop massively parallel programs with
A No limits Analytics Job simplicity
Service to power intelligent Debug and optimize your big data programs
action with ease
Virtualize your analytics
Enterprise-grade security, auditing and
support
Azure Data Lake
Big Data made easy
YARN
HDFS
Store
SE Asia
Singapore
Australia East
User Adoption
Azure Data Lake
Analytics
Azure HDInsight
ANALYTICS
BIG DATA
Azure Marketplace Specific apps in a multi-
HDP | CDH | MapR tenant form factor
Workload optimized,
managed clusters
Any Hadoop technology
BIG DATA
STORAGE
Azure Storage
Big Data has new data characteristics
Petabytes
Technology advances:
• Low cost memory
• High speed networking
• Flash and SSD storage
• CPU processing doubling
*IDC study “The Business Value and TCO Advantage of Apache Hadoop in the Cloud with Microsoft Azure HDInsight”
From data to decisions and actions
Action
Value