Professional Documents
Culture Documents
MTBF: Understanding
Its Role in Reliability
By
David C. Wilson
Founder / CEO
March 24, 2012
Wilson Consulting Services, LLC 2012 Page 1 of 47 MTBF: Understanding Its Role in Reliability
Table of Contents
Section Page
1.0: Introduction---------------------------------------------------------2-47
2.0: Reliability Mathematics-------------------------------------------7-47
3.0: Bathtub Curve-----------------------------------------------------21-47
4.0: Failure Rate--------------------------------------------------------23-47
5.0: MTBF----------------------------------------------------------------30-47
6.0: MTTF----------------------------------------------------------------38-47
7.0: MTTR---------------------------------------------------------------4047
8.0: Relationships Summary & Reliability Models--------------41-47
9.0: Summary------------------------------------------------------------46-47
Wilson Consulting Services, LLC 2012 Page 2 of 47 MTBF: Understanding Its Role in Reliability
Introduction
1.0: Introduction
The objective is to enable an individual who is unfamiliar with the use of the Mean
Time Between Failures (MTBF) reliability parameter to understand its relationship
in product predictions, failure rates, field performance, etc. After completing this
tutorial, the participant will know and understand how MTBF relates to product
performance over time.
In order for the practitioner to speak intelligently and authoritatively on the
parameter, MTBF, it important at a minimum that a cursory understanding of the
mathematical concepts involved with Reliability be mastered. Therefore,
mathematical and practical treatments relative to MTBF are included in this tutorial
using the exponential distribution model.
Additionally, to achieve a thorough understanding of statistics and reliability, many
statistical experts list four kinds of understanding as shown below.*
1. Computational/Numerical
2 Visual/Graphical
3. Verbal/Interpretive
*For additional information on the items above, please contact the author
of this tutorial.
Wilson Consulting Services, LLC 2012 Page 3 of 47 MTBF: Understanding Its Role in Reliability
Introduction contd
What is MTBF?
Measure of rate of failure within the design life.
Reliability?
Failure?
Failure Rate?
Hazard Rate?
MTBF/MTTF?
Wilson Consulting Services, LLC 2012 Page 4 of 47 MTBF: Understanding Its Role in Reliability
Introduction contd
Wilson Consulting Services, LLC 2012 Page 5 of 47 MTBF: Understanding Its Role in Reliability
Introduction contd
Reliability Definitions
RELIABILITY [R(t)] - The probability that an item will
perform its intended function without failure under stated
conditions for a specified period of time.
Wilson Consulting Services, LLC 2012 Page 7 of 47 MTBF: Understanding Its Role in Reliability
Reliability Mathematics contd
P.D.F. contd
A positive continuous random variable follows an exponential
distribution if the probability density function is as shown:
ae ax ( For x0)
f ( x)
0 ( For x0)
Wilson Consulting Services, LLC 2012 Page 8 of 47 MTBF: Understanding Its Role in Reliability
Reliability Mathematics contd
P.D.F. contd
It is a probability density function (P.D.F.)
f (t ) e t
Lambda is a constant and is called the failure rate
f(t)
0 t
Figure 1
Wilson Consulting Services, LLC 2012 Page 9 of 47 MTBF: Understanding Its Role in Reliability
Reliability Mathematics contd
Figure 1A t
Wilson Consulting Services, LLC 2012 Page 10 of 47 MTBF: Understanding Its Role in Reliability
Reliability Mathematics contd
F (t ) 1 e t
F(t) 1 R(t)
QED
The probability that a component fails in the interval
t
0t is F (t ) 1 e
Wilson Consulting Services, LLC 2012 Page 12 of 47 MTBF: Understanding Its Role in Reliability
Reliability Mathematics contd
1. Hazard function
f (t ) f (t ) e t
h (t ) t
1 F (t ) R (t ) e
2. Cumulative hazard function:
(Integrating the hazard function to obtain the cum hazard function. Use dummy variable of
integration )
t t f ( ) t e t t
H ( ) h( )d d d d | t
0 0 R( ) 0 e 0 0
H (t ) ln R(t )
Taking the natural log of both sides : R (t ) e t
ln R (t ) ln e t t
t can be exp ressed as ln( e t ) ln R (t ) QED
Wilson Consulting Services, LLC 2012 Page 13 of 47 MTBF: Understanding Its Role in Reliability
Reliability Mathematics contd
Hazard Rate
The exponential P.D.F. is a valid useful life time to failure model
for many debugged electronic components.
f (t ) e
t
P.D.F.
0t
Where represents a constant failure rate that does not vary
with time. For reliability purposes, the C.D.F. is designated F(t)
rather than F(x) and F(t)t2 t1 is defined as the probability of
failure in the interval t1 < T < t2. t
f (t ) e
h(t ) t
h(t)
R(t ) e
U seful L ife
(t)
Figure 2 t time
Wilson Consulting Services, LLC 2012 Page 14 of 47 MTBF: Understanding Its Role in Reliability
Reliability Mathematics contd
1.0 Illustrations: Exponential
R(t) Reliability Curve
R(t ) e t
0 time t
Figure 3
f(t)
Probability Density Function Curve
Density
f (t ) e t
0 time Figure 4 t
Wilson Consulting Services, LLC 2012 Page 15 of 47 MTBF: Understanding Its Role in Reliability
Reliability Mathematics contd
Reliability Areas of Probabilities Illustration
U nreliability area F (t)
t
F (t ) f ( )d P(T t )
0
1.0
R eliability exponential distribution plot
R (t)
R eliability area R (t)
t
R(t ) 1 f ( )d
0
0 1 2 3 4 t
Figure 5
R (t ) 1 F (t )
t
F (t ) e | 0 t
t
R (t ) e
F (t ) 1 e
Wilson Consulting Services, LLC 2012 Page 16 of 47 MTBF: Understanding Its Role in Reliability
Reliability Mathematics contd
Example 1:
For exponential distribution, take the integral of f(t), where
f(t) = e-t , where t =1
1
F (t )
0
f ( t ) dt 0 t 1
1
F (t ) e t dt 0 t 1
0
Solution
t t
F (t ) e t dt e
t 1
1 e 1 0.3678 0.6322
0 0
Wilson Consulting Services, LLC 2012 Page 17 of 47 MTBF: Understanding Its Role in Reliability
Reliability Mathematics contd
Example 2:
An electronic device contains discrete transistors. Each transistor has
a constant failure rate of = 1105 failure rate/hour. What is the
probability that a single transistor will survive a mission of 104 hours?
R (t ) 1 F (t )
Solution
R (t ) e
t
R (t ) e
5
4
10 10
R (t ) e 0 . 905
F ( t ) 1 R ( t ) 1 0 . 906 0 . 095
Wilson Consulting Services, LLC 2012 Page 18 of 47 MTBF: Understanding Its Role in Reliability
Reliability Mathematics contd
Example 3:
What is the probability that it will survive a mission of 103 hours?
5
3
e 10 10
R(t )
Solution
R(t ) e 0.01
R(t ) 0.990
Wilson Consulting Services, LLC 2012 Page 19 of 47 MTBF: Understanding Its Role in Reliability
Bathtub Curve
3.0 Bathtub Curve
Reliability Bathtub curve for constant failure rate
Failures
Figure 6
Time
Infant mortality- often due to manufacturing defects
In electronics systems, there are models to predict MTBF for the
constant failure rate period (Bellcore Model, MIL-HDBK-217F, others)
Understanding wearout requires data on the particular device
- Semiconductor devices should not show wearout except at long times
- Electrical devices which wearout: relays, EL caps, fans, connectors, solder
Wilson Consulting Services, LLC 2012 Page 20 of 47 MTBF: Understanding Its Role in Reliability
Bathtub Curve contd
Infant M ortality Useful Life W earout
External
Stress Failures
M anufacturing W earout
Defects Failures
Figure 7
Sensors & Firewalls Time
Reliability Prediction Wearout Mechanism
Analyzing short term warranty/RMA
& Validation Analysis
data
Materials characterization
Tying designs to mfg. capabilities Beginning use of:
Long-term data mining
Instituting process CTQ checkpoints Field & industry data
ALT (test to failure)
Improving environment knowledge Prediction tools
System Life Modeling
Reliability Metrics on Dashboards Accelerated Life Testing
Wilson Consulting Services, LLC 2012 Page 21 of 47 MTBF: Understanding Its Role in Reliability
Failure Rate
4.0 Failure Rate
Graph: Cumulative distribution function (c.d.f.) for the exponential distribution
function. Failure Rate Hazard Rate
- Constant with respect to time - A function of time
- An average - Instantaneous
100%
Probability of Failure
0.000100
Instantaneous
Hazard Rate
Average
0.000095
Failure Rate
0.000090
If H ( t ) ln R ( t )
Then :
AFR
ln R ( t 2 ) ln R ( t 1 )
t 2 t1
ln R ( t 1 ) ln R ( t 2 )
AFR
t 2 t1
QED
Wilson Consulting Services, LLC 2012 Page 23 of 47 MTBF: Understanding Its Role in Reliability
Failure Rate - contd
Example 1. Exponential Distribution
R(t)
0.98
Te
xt
0.95
0
t1 t2 t
8760 17520
Figure 9
tim e in hours
ln R (t 1) ln R (t 2 )
AFR
t 2
t1
ln( 0 . 98 ) ln( 0 . 95 ) 0 . 0202 ( 0 . 0513 )
AFR
17520 8760 8760
0 . 0513 0 . 0202 0 . 0311
AFR 0 . 00000355 3 . 55 E 06
8760 8760
Wilson Consulting Services, LLC 2012 Page 24 of 47 MTBF: Understanding Its Role in Reliability
Failure Rate - contd
Example 1 continued
Illustration - Exponential Distribution
AFR can be represented by lambda ()
h(t)
3.55E-6
0 10000 20000
hours
Figure 10
Wilson Consulting Services, LLC 2012 Page 25 of 47 MTBF: Understanding Its Role in Reliability
Failure Rate - contd
Estimating Failure Rate
Lambda () can be obtained as an estimate ( -hat) of the true population
for all operating hours for all units tested including failed and those that
completed the test without failing.
number of failures
total unit test time
The denominator is obtained by adding up all operating hours on test of
all units tested, including those that failed and those that completing the
test without failing.
Wilson Consulting Services, LLC 2012 Page 26 of 47 MTBF: Understanding Its Role in Reliability
Failure Rate - contd
Example 2: Five electronics Sub-systems failed from a sample of 1200 which
were used constantly for 90 days. What is the Failure Rate?
Estimate of failure rate for
= 1.93E-06 failures/hour
Other expressions
Failures per million hours (Fpmh)
Fpmh * 10 E 06 (1.93E 06)(10 E 06) 1.93
Percent per thousand hours
This rate rate can be expressed by multiplying (1E+05) resulting in the
average failure rate = AFR = 0.193%/1000 hours
Wilson Consulting Services, LLC 2012 Page 27 of 47 MTBF: Understanding Its Role in Reliability
Failure Rate - contd
s
i 1
i
Wilson Consulting Services, LLC 2012 Page 28 of 47 MTBF: Understanding Its Role in Reliability
Failure Rate - contd
Wilson Consulting Services, LLC 2012 Page 29 of 47 MTBF: Understanding Its Role in Reliability
MTBF
5.0 MTBF
MTBF is not Life
Design Life: Intended period of use which is expected to be failure free.
EXAMPLES:
Wilson Consulting Services, LLC 2012 Page 30 of 47 MTBF: Understanding Its Role in Reliability
MTBF contd
1 Mathematical Proof
t e
t
MTBF dt Integrating by parts solution:
0
By substituti on
let : y t udv uv vdu
y let
t
u y & du dy
1 y y
dt dy v e & dv e dv
y y y
ye ye e dy
1 dv
y e
y
MTBF dy
y y
ye
0
e
Integratin g by parts
( y 1) e by factoring
1 y
MTBF
0 ( y 1) e y
1
MTBF QED Recall: e- = 0
Wilson Consulting Services, LLC 2012 Page 31 of 47 MTBF: Understanding Its Role in Reliability
MTBF contd
0 10000 20000
Figure 11 hours
1
MTBF te t dt
0
1
MTBF 281,757 hours
3.55 E 6
Wilson Consulting Services, LLC 2012 Page 32 of 47 MTBF: Understanding Its Role in Reliability
MTBF contd
Mean Time Between Failures (MTBF)
Example 2:
If a motor is repaired and returned to service six times during its
life and provides 45,000 hours of service.
failures 6
0 .00013333
time 45 ,000
1 1
MTBF 7 ,500 hours
0 .0001333
Also
Wilson Consulting Services, LLC 2012 Page 33 of 47 MTBF: Understanding Its Role in Reliability
MTBF contd
Example 3
Using Chi-square model to find MTBF
MTBF Upper and lower bound is calculated such as 50%, 80% 90%, 95%, etc.
2T
MTBF lower
2
,( 2 n )
2T
MTBF upper
2
1 ,( 2 n )
n Number of defects
T Total operating time
Degrees 0f freedom = 2n
Significance level
Wilson Consulting Services, LLC 2012 Page 34 of 47 MTBF: Understanding Its Role in Reliability
MTBF contd
, ( 2 n )
1 , ( 2 n )
Wilson Consulting Services, LLC 2012 Page 35 of 47 MTBF: Understanding Its Role in Reliability
MTBF contd
Example 4
An electronics assembly has a goal of 0.99 reliability for one year.
What is the MTBF that the designer should work towards to meet the
goal?
Reliability equation: R(t) = e -t and MTBF = 1/
t
Solution: -t
ln R(t) =
MTBF
t = M T B F * ln R (t)
t 8760
M TBF = = = 8 7 2 ,0 0 0 h o u r s
ln R (t) -ln 0 .9 9
Wilson Consulting Services, LLC 2012 Page 36 of 47 MTBF: Understanding Its Role in Reliability
MTBF contd
Mission Length, t
(miles) Reliability*
1,000 99.0%
10,000 90.5%
50,000 60.7%
100,000 36.8%
200,000 13.5%
*Constant hazard rate
Wilson Consulting Services, LLC 2012 Page 37 of 47 MTBF: Understanding Its Role in Reliability
MTTF
6.0 MTTF
Mean Time To Failure [MTTF] - For non-repairable items, the
ratio of the cumulative operating time to the number of failures
for a group of items.
Example 1: 1200 monitoring devices are operated for 90 days.
During that time, five failures occur.
failures 5 5
19.29E 07
time 1200* 24*90 2,592,000
1 1
MTTF 518,403 hours
19.29E 07
Also,
Total Operating Time 1200* 24* 90
MTTF 518,403 hours
Total Failures 5
Wilson Consulting Services, LLC 2012 Page 38 of 47 MTBF: Understanding Its Role in Reliability
MTBF contd
Example 2
An electronics assembly has a goal of 0.99 reliability for one year.
What is the MTTF that the designer should work towards to meet the goal?
t
MTTF
ln R(t )
8760
MTTF
ln R(0.99)
8760
MTTF 871,642 hours
(0.01005)
Wilson Consulting Services, LLC 2012 Page 39 of 47 MTBF: Understanding Its Role in Reliability
MTTR
7.0: MTTR
Mean Time To Repair (MTTR) - this is corrective maintenance, which includes
all actions to return a system from a failed to an operating or available state. It is
difficult to plan.
It entails, for example:
1. Preparation time: finding a person for the job, travel, obtaining tools and
test equipment, etc.
2. Active maintenance time, i.e., doing the job
3. Delay time ( logistic time: waiting for spare parts., once the job has been
started.
Summation of expected times of
MTTR
( t r
) individual failures modes
MTBF
Availability
MTBF MTTR mean preventive ma int enance time
Wilson Consulting Services, LLC 2012 Page 40 of 47 MTBF: Understanding Its Role in Reliability
Relationships Summary
Wilson Consulting Services, LLC 2012 Page 41 of 47 MTBF: Understanding Its Role in Reliability
Relationships Summary - contd
Relationships contd
. Definitions:
Failure rate
The ratio of the number of failures within a sample to the cumulative operating time.
Example: PPM / K hrs.
MTBF
Mean time between failure, which means that 63.2% of the product would have failed by this time.
te
t
Example:
MTBF tf (t)dt dt
o o
Reliability R(t)
The probability that an item will perform its intended function without failure under stated conditions for a specified period of time.
t
Example:
R(t ) 1 f (t )dt
0
Unreliability F(t) - Commonly referred to as cumulative distribution function (CDF)
The probability that an item will not perform its intended function without failure under stated conditions for a specified period of
time. Also, commonly referred to as cumulative density function.
t
Example: F (t ) o
f ( t ) dt
Hazard rate h(t)
The "instantaneous" probability of failure of an item given that it has survived up until that time. Sometimes called the
instantaneous failure rate. It is the failure rate per unit time.
f (t )
Example: 2.8E-6 / hour, h (t )
R (t )
Wilson Consulting Services, LLC 2012 Page 42 of 47 MTBF: Understanding Its Role in Reliability
Relationships Summary - contd
Relationships contd
Hazard rate h(t)
The "instantaneous" probability of failure of an item given that it has survived up until that time. Sometimes called the
instantaneous failure rate. It is the failure rate per unit time.
f (t )
Example: 2.8E-6 / hour, h (t )
R (t )
Probability Density Function (PDF) Commonly referred to as f(t)
It is denoted by f(t) where t is a the variable of interested where f(t)dt is the fraction of failure times of the population
occurring in the interval dt. Basically, it assumes a mathematical formula that provides a theoretical model
describing the way the population values are distributed. The definite integral of its domain must equal 1.
e
Example: t
f (t ) , 0t
Examples
1. (1% / 1000 hrs.)
One percent per thousand hours would mean an expected rate of 1 fail for each 100 units operating 1000 hours.
Example: 0.1280%/1000 hours means that 1280 failures each 1 million units operating for 1000 hours.
Wilson Consulting Services, LLC 2012 Page 43 of 47 MTBF: Understanding Its Role in Reliability
Reliability Models
H a z a rd fu n c tio n
T yp e o f
P ro b a b ility d e n s ity R e lia b ility fu n c tio n , (in s ta n ta n e o u s fa ilu re ra te ),
D is trib u tio n P a ra m e te rs
fu n c tio n , f(t) R (t) = 1 - F (t) h (t) = f(t) / R (t)
1
N o rm a l 4
M ean, 0 .4
3
S ta n d a rd f( t ) R( t ) 0 .5
d e v ia tio n , 0 .2
h( t ) 2
N u m e ro u s a p p lic a tio n s. 1
U se fu l w h e n it is e q u a lly 0
3 2 1 0 1 2 3 0
lik e ly th a t re a d in gs w ill 0
3 2 1 0 1 2 3 t
3 2 1 0 1 2 3
t
fa ll a b o ve o r b e lo w th e t
1 t 2
h (t )
f (t )
a ve ra ge . R (t )
f (t ) e 2
2
f ( t ) dt R (t )
2 t
E x p o n e n tia l 1 4
F a ilu re ra te ,
M TBF,
= -1 R( t )
f( t ) h( t )
D e sc rib e s c on sta n t 0 .3 6 8
F a ilu re ra te c o n d ition s.
A p p lie s for th e u se fu l
0 0 0
L ife c yc le of m a n y 0 1 2 3 0 1 2 3 0 1 2 3
P rod u c ts. F re q u e n tly, tim e (t) t t t
Is u se d for x . f (t)
t t 1
e R (t ) e h (t )
W e ib u ll 1
Shape, 1
=3
U se d for
S c a le (c h a ra c te ris tic =3 =3 1 =1
m any
life ), f ( t)
re lia b ility 0 .5
R (t)
a p p lic a tion s. 0 .5
= 0 .5 =1 h (t )
L o c a tio n (m in im u m = 0 .5
C a n te st for th e e n d =1
life ),
in fa n t m o rta lity p e riod . = 0 .5
C u rv e s s h o w n fo r 00 0
C a n a lso d e sc rib e th e 1 2 3 0
= 0 t 0 1 2 3 0 1 2 3
n orm a l a n d t t
e x p on e n tia l t 1
f( t ) (t )
1
exp t ( t )
d istrib u tion s. R( t ) exp h( t)
Wilson Consulting Services, LLC 2012 Page 44 of 47 MTBF: Understanding Its Role in Reliability
Reliability Models - contd
Hazard function
Type of Reliability function,
Probability density (instantaneous failure rate),
Distribution Parameters R(t) = 1- F(t)
function, f(t) h(t) = f(t) / R(t)
Gam m a Failure rate, = 0.5 1 2
1 a=2 a = 0.5
Describes a situation
Events per failure,
when partial failures a=1
or Time to ath R ( t)
can exist. Used to describe f (t ) a = 1 a = 0.5
h ( t) a=1
failure 0.5 1
random variables bounded 0.5 a=2
at one end. The partial
failures can be described a=2
1
0
as sub failures. Is an
2 0
0 2 4
0
t
2 4 0
0
t
2 4
appropriate model for t
the time required for
a total of exactly ? f( t )
t
h( t )
1 t
t t 1et)dt
Note: when a is
independent events f (t ) e ) 1 R( t )
( ) t
to take place if events
an integer ( ) R(t )
I
occur at a constant rate
1 2
0.8
Lognorm al M ean,
0.6
The Lognormal Standard R( t) 0.5 h( t ) 1
distribution is often deviation, f( t ) 0.4
Wilson Consulting Services, LLC 2012 Page 45 of 47 MTBF: Understanding Its Role in Reliability
Summary
9.0 Summary
This paper did elaborate on the value of using the MTBF parameter. However, there have been tremendous
improvements in solid-state devices over the years. In earlier times, electronic components were fragile, used
glass tubes, filaments, etc., had inherent wear out mechanisms. By the same token, earlier solid-state devices had
mechanisms that would cause failures in time, such as chemical contamination, metallization defects, and
packaging defects, which resulted in corrosion and delaminating. Many of these defects were accelerated by
high temperature, which resulted in successful use of the burn-in process to weed out infant mortality."
Statistical prediction during that period was valid and accepted because designs at that time consist mostly of
discrete components; therefore, reliability statistical estimates of the life of a new design had for the most part, a
reasonable correlation to the actual MTBF. Today hundreds of new components are introduced to the market
almost every week and hundreds are probably taken off the market every week; therefore, it is impossible to
make an accurate prediction based on a summation of parts reliability. Example: Mil-Std-217
Todays components do not have wear-out modes that are within most electronics technologically useful life.
Therefore, the vast majority of failures is due to defects in design or introduced in manufacturing.
Unplanned events in manufacturing such as ECN, change in machine operators, or change in vendors
capabilities of design, introduction of cost reduced parts, etc.; any of these or combinations can introduce a
decrease in design margins. Hence: this affects reliability and increase field returns. Many experts feel that it is
best to spend time, not on statistical predictions rather on discovering the real capabilities and identifying the
weak links in the design or manufacturing process, and improving them. This approach will help realize
significant improvement in reliability. The end user environment is even more uncontrolled. The end-user will
always push the limits; therefore, a robust design will have a higher survivor rate for these extremes.
Wilson Consulting Services, LLC 2012 Page 46 of 47 MTBF: Understanding Its Role in Reliability
References
References
1. Patrick D.T. OConnor, Practical reliability Engineering, third edition
John Wiley & Sons 1991
Wilson Consulting Services, LLC 2012 Page 47 of 47 MTBF: Understanding Its Role in Reliability