Professional Documents
Culture Documents
Sophistication
THOUGHT LEADERSHIP
Become Analytics-Driven W H I T E P A P E R
APR. 2017
Data production will be
R Systems Inc.
44 times greater in 2020
than it was in 2009.
Source: IDC
Table of
Contents
1. Introduction 1
2. Analytics-Driven Organization 1
3. Analytics Maturity 4
By 2020, data-driven
organizations are set to
4. Analytics Sophistication 6 achieve additional $65Bn.
in productivity benefits.
5. Analytics Skill Gap 8
6. References 10
7. About R Systems 11
aspects under “data science1 .” 2 As you have heard, data is the new Oil.
1 of 11
As is typical, such interests started out of At the same time, this has brought in
specific industries that realized the need big opportunities for many players as
which were then led by the innovators well. In early years, data warehousing
(technology enthusiasts) and early and data marts were of a significant
adopters (visionaries). focus with IT-led reporting and BI. On
the advanced analytics end, the focus
In Figure 1, the technologies involved was on finding rewarding use cases
have been constantly changing at a very and right algorithms for commercial
fast pace. E.g., due to the exponential use. By mid ‘2000, the focus shifted to
growth in volume/ velocity of data and cost-effective data infrastructure, to
data variety due to internet connectivity cope up with the exponential growth
and proliferation of web products, data of data and business-led BI, with an
technologies have been under both emphasis on visualization and self-
evolutionary and revolutionary changes. service. On the advanced analytics
Analytics tools and related technologies front, with significant breakthroughs
have also been continuously improving in the processing speed, open-source
at a great pace with the most significant tools and easy availability of all kinds
change being the abundance and easy of data, it became possible to deliver
availability of open-source analytics more confidently on the promises of
tools/ software. In parallel, for classic Machine Learning and AI.
and new companies, the possibilities to
innovate/ improve their products and
services by leveraging analytics has This whitepaper focuses on the
significantly increased. This has created Two main breakthrough changes
amazing new market opportunities. The in the recent years:
use of analytics for product innovation
Big Data Revolution and ML/ AI
and improvement is the norm today. In
short, the general theme of evolution
from detailed/ granular data of all kinds
to intelligent business decisions has
stayed unchanged in the past 30 years,
while the technologies, processes and By 2018, more than half of
implementation details on achieving it large organizations globally
have been in constant change. will compete using
advanced analytics and
For many classic organizations, coping- proprietary algorithms
with constant changes in technologies
Source: Gartner
and applications has become their main
challenge towards becoming a data and
analytics-driven organization.
Thought Leadership Whitepaper | analytics.rsystems.com 2 of 11
2.1 Big Data Revolution standpoint, BI had now gone ahead of
ETL which is an indication of putting
For a moment, we first consider one of data to analytics use. BI is the fastest
these significant technology changes way to show the value of data and
that are the big data revolution and its justify investments in any new data
drastic impact on organizational data technology. Focusing on BI doesn’t
infrastructure. In early to mid ‘2010, big diminish advanced analytics’ value,
data was at its hype. So many companies that typically brings higher business
were experimenting or planning to value (ROI) although it has a longer
explore Hadoop ecosystem. At the time, turnaround time and higher risks.
top technical challenges with Hadoop
were the speed of processing, ETL and
data preparation, useage of advanced 2.2 Resurgence of Machine
analytics and real-time embedding of Learning and AI
advanced analytics in operational flows[4,5]
In 2015, per Gartner, investment in big Earlier we mentioned a few successful
commercial uses of ML in early ‘90s, in
data going strong, where 75% of all com-
financial (real-time fraud detection
panies were either investing or were
using neural networks[10]) and retail
planning to invest in big data in the next (recommendation systems) verticals
two years[7]. By 2016, big data maturity that are still in use. Since then, there
survey[6] indicated that 75% of the have been many applications of ML
respondents already had at least one used everyday, but still go unnoticed.
Hadoop cluster in some shape or form, In recent years, one very interesting
whereas 73% had it in production. The development is the resurgence of AI,
survey also established that big data was which comprise of ML. We are no
still growing fast, data governance was a longer concerned whether computers
growing concern ahead of security and (or robots) can ever display intelligent
access. It also revealed big data cloud behavior, but rather how intelligent
option was a preferred choice for partial/ they may get as witnessed by recent
full data storage or development for AI successes like Watson in Jeopardy!
many organizations3 . It also indicated and Google DeepMind AlphaGo4 are
that 95% of respondents have achieved the evidence of the possibilities in AI.
positive value from their investments in
There have been different approaches
big data, or are expecting that they will.
to AI. The most successful approach,
Given that all the investments in data
that typically outperforms others,
infrastructure, were from a workload’s involves ML applied to large datasets.
3 In 2014, there was fear of Cloud among many enterprise companies.
That mindset has quickly changed. Today, every enterprise is either 4 AlphaGo uses Deep Learning via convolutional neural networks
using the cloud or in process of moving there. This trend is witnessed trained to predict moves that expert players would make, using a
across many industries like financial, retail, insurance, healthcare, dataset of 30 million different positions from real games. The real
retail, and even government. world applications are much harder than games but ML/ AI have
recently proven their success on many new real-world use cases.
3 of 11
For some of these applications, the results Many enterprises remain in the early
are so human-like that many companies stages of their analytics initiatives and
are looking to hire more data scientists, strategically consider it as an engine
collecting more data and applying latest of innovation. Almost every classic
ML & AI technologies to improve their organization is realizing the journey
existing decision-making processes or through analytics maturity requires a
innovate new data products. lot of hard work and commitment due
to constant changes in technologies
New analytical tools for making decisions, involved and the pressures of new
like Watson, are creating entirely new competition. This commitment does
opportunities. With the digitization of touch many aspects of organizational
world commerce, the emergence of big behavior, from revamping the data
data and the advancement of analytical management and analytics skills/
technologies, organizations now have tools, to changing cultural norms[3].
amazing opportunities to differentiate
themselves through analytics. Per the
study, “Analytics: The Widening Divide,” by Decisions
Advantage
Data Infrastructure
For more than 20 years, organizations that
embraced analytics enjoyed a clear-cut
competitive advantage over their peers. Data Types (Structured, Semi-structured, Unstructured)
Prescriptive
ANALYTICS SOPHISTICATION = COMPETITIVE ADVANTAGE
Advanced Analytics:
Predictive & Proactive
OPTIMIZATION
STATISTICAL
ANALYSIS What is happening?
AD-HOC
What happened specifically? REPORTS
What exactly is the problem?
OLAP
What happened? DRILL DOWN
Figure 3: Competitive advantage has direct correlation with analytics sophistication. Here we
measure sophistication only by examples of analytics deployed from less complex
basic analytics to more sophisticatedalgorithmic techniques (advanced analytics) 7 of 11
As an enterprise recognizes the value of 5. Analytics Skill Gap
its data through the analytics processes,
it will continue to improve upon them to Data Science is a multi-disciplinary
maximize their business value. However, field that ideally requires a strong
there is a saturation point beyond which, technical knowledge of ML, pattern
the marginal contribution achieved from recognition, programming, statistics,
the basic analytics practices quickly mathematics, probability, modeling,
diminishes[1]. Precisely at this point, a databases, and computing. Given the
disruptive change has to take place in an existing silos in our higher education
organization’s vision and culture in order system, companies had to hire and
for it to transition into a higher form of train data scientists. In recent years,
analytics. In Figure 3, this transition point there have been many data science
is illustrated using a yellow lightning bolt programs (degreed or non-degreed)
while Figure 4, outlines some key factors which are introduced in different
required for this disruptive transition. At settings (academic/ non-academic),
this side of the spectrum, a company but still data science can be best
becomes more proactive in employing a learned on the job, assuming that the
data-driven predictive and prescriptive companies have experienced data
processes. Statistical Analysis, Machine scientists onboard who can mentor
Learning (ML), Predictive Analytics (PL), the new hires. However, due to lack of
Forecasting, Artificial Intelligence (AI) data science talent, high emphasis has
and mathematical optimization (under been put on the some of the data
real-world constraints) are examples of science techniques and other stand-
Advanced Analytics techniques. Data alone/ cloud-based tools like Python
scientists typically operate in this range. (Pandas, Scikit-learn, etc.), R, SAS,
It is worthy to mention that when an SPSS, SQL, Azure ML, H2O, Hadoop
organization transitions from large data ecosystem (Hive, Spark, HBase, etc.).
to big data, there would be disruptive
changes required as well. The enterprise Data scientists always deal with large
will clearly need different mindset and granular data, in all shapes and forms
thought process in its approach along and from a variety of sources. They
with various tools and techniques to have to deal with a huge number of
cope up with big data. The traditional BI variables and employ advanced sets
and advanced analytics skillset also of algorithms. A data scientist always
needs to improve and evolve to address demands more data. “All data in raw
the challenges of the big data. [Figure 5] form” is always preferred, but it may
not always be practical to store and
analyze all that data[1].
(BDA)
to be able to easily explain it to others
9 of 11
REFERENCES:
FFC483
[1] Hassibi Khosrow - 2014 High Performance Data Mining and Big Data
Analytics: The Story of Insight from Big Data (Available on Amazon)
[2] MIT Sloan Management Review Research Report: 2011 Analytics The
Widening Divide
[3] MIT Sloan Management Review Research Report: 2016 Beyond The
Hype: The Hard Work Behind Analytics Success
[4] Rogers Shawn John Myers and Barry Devlin: 2013 Operationalizing the
Buzz: Big Data 2013 Research Enterprise Management Research (EMA)
[5] Graham Bradley and M R Rangaswami 2013 "Do you Hadoop? A
Survey of Big Data Practitioners" Sand Hill Group - October 29 (Accessed
March 31, 2017) sandhill.com
[6] Hortonworks & AtScale 2016: 2016 Big Data Maturity Survey
[7]Gartner.com 2015 "Survey Analysis: Practical Challenges Mount as Big
Data Moves to Mainstream
[8] Ernst & Young (EY) 2015: Becoming an Analytics Driven Organization
to Create Value
[9] Analytics Maturity Assessment 2017 (Accessed April 30, 2017)
http:⁄⁄analytics.rsystems.com⁄data-analytics-consulting⁄analytics-maturity-assessment⁄
[10] Hassibi Khosrow 2001 "Detecting Payment Card Fraud with Neural
Networks" In Business Applications of Neural Networks: The
State-of-the-Art of Real-World Applications by Paulo J G Lisboa Alfredo
Vellido and Bill Edisbury, 141-158. World Scientific
Insights-driven businesses
http:⁄⁄bit.ly⁄IDC-estimates-big-data
will earn over $1.2 Trillion
http:⁄⁄bit.ly⁄IDC-press-release annually, by 2020
http:⁄⁄bit.ly⁄Gartner-press-release Source: Forrester
http:⁄⁄bit.ly⁄Forrester-predictions