R-42_WangJ.pdf

COMPUTATIONAL METHODS IN ENGINEERING AND SCIENCE
EPMESC X, Aug. 21-23, 2006, Sanya, Hainan, China
©2006 Tsinghua University Press & Springer
Reliability Analysis using Saddlepoint Approximation
Jia Wang1*, Ka-Veng Yuen1, Siu-Kui Au2
1
2
Faculty of Science and Technology, University of Macau, Macao, China
Department of Building and Construction, City University of Hong Kong, Hong Kong, China
Email: ma46616@umac.mo
Abstract Saddlepoint approximation is a powerful tool for accurate estimation of the cumulative distribution function
(CDF) of a random variable, if its cumulant generating function (CGF) exists. In this paper, the saddlepoint
approximation is introduced for reliability analysis which is one of the most important tasks in engineering design. The
efficiency of the proposed method, is demonstrated by a 10-story shear building subjected to white noise excitation.
Key words: Failure probability, Monte Carlo simulation, reliability analysis, saddlepoint approximation
INTRODUCTION
The assurance of the performance of engineering structures within the design life is one of the principal responsibilities
of engineers. However, due to inherent uncertainties such as material properties and loading conditions to which they
are exposed to during their daily operation, the assurance of performance can seldom be perfect and risk generally
exists in engineered structures. Consequently, reliability analysis of the structures, concerned with the probability that
the structure will not reach some specified state of failure, arises as a probabilistic measure of the assurance of the
structural performance.
PROBLEM STATEMENT
The determination of the reliability of a structure usually requires calculating its complement, the probability of failure,
which is essentially the integration of the probability density function (PDF) in the failure domain. For an engineering
application, a random variable Y is used to specify the performance of the structure, and the failure event is defined as
F = {Y > b} where b is a corresponding bound. If the PDF f (y) of the performance variable Y can be obtained, the
failure probability will be given by the following integration:
In reliability analysis, all the uncertainties are parameterized and denoted by a random vector Z = [z1, …, zn], where n is
the number of random variables. The performance variable Y is a function of Z, but the relationship between them is
often complex such that it is not feasible to compute f (y) from the joint PDF of the uncertain parameters, q(Z) .
Consider an ns-story building under random excitation as an example. The time duration of interest T is discretized into
nt time instants {t1, …, tnt}, and the failure event is defined as the exceedance of a threshold b by any of the inter-story
drifts during T. Let Xj(tk; Z) denote the jth inter-story drift response at the kth time instant, then the failure event can be
defined as
where the performance variable Y is
The failure probability can be computed from the distribution of Z as:
⎯ 421 ⎯
where F is the failure domain within the parameter space; IF is an indicator function: IF(Z) = 1 if Z∈F and IF(Z) = 0
otherwise. Although the evaluation of this integral is straight-forward in theory, it is computationally prohibitive when
the integration dimension n is large and the failure domain is complex.
MONTE CARLO SIMULATION METHOD
Monte Carlo simulations (MCS) offer a robust methodology well suited for solving such high-dimensional reliability
problems [1- 2]. In this method, the failure probability PF = IF(Z)q(Z) dZ is viewed as the mathematical expectation of
IF(Z) with Z distributed according to q(Z). This implies that the failure probability can be estimated as a sample average
of the indicator function IF(.) over the sequence of independent and identically distributed (i.i.d.) samples {Z1, …, ZN}
generated according to their PDF q(Z) :
where N is the number of samples. The MCS estimator
in Eq. (5) converges to the failure probability PF with
probability 1 (Strong Law of Large Numbers), and is asymptotically normal distributed as N→∞ (Central Limit
Theorem). Therefore, it is unbiased, i.e., E[ ] = PF. The efficiency of the MCS method, and in general the efficiency
of a simulation-based reliability method, can be measured by the coefficient of variation (C.O.V.) of the estimator,
which is defined as the ratio of its standard deviation to its expectation, i.e.,
Note that the expression depends only on the failure probability PF and the number of samples N, regardless of the
number of uncertain parameters n. The applicability of MCS is independent of the specifications of the problem. The
specifications of the problem, such as the type of excitation models and the type of parameter PDFs as well as the
number of uncertain parameters, enter the MCS procedure through only the indicator function IF(.), whose value is
determined by structural analysis. The main drawback of MCS, however, is that it is not efficient to estimate small
failure probabilities. For small PF, Eq. (6) can be approximated as
In practice, the failure probability is expected to be small, because one does not anticipate to construct a fragile
engineering structure. In this case, Eq. (7) implies that the computational effort required by MCS to obtain an
acceptable accuracy is prohibitively large. For example, if PF = 10−6, one needs N = 108 samples (structural analyses) to
achieve the accuracy level of 10% C.O.V.
SADDLEPOINT APPROXIMATION METHOD
The saddlepoint approximation technique was introduced to statistics for approximating the PDF by Daniels (1954) [3].
Since then, the research and applications of saddlepoint approximations have vastly increased [4-6]. A detailed
account of saddlepoint approximations and related techniques can be found in [7].
Let Y be a random variable with PDF f (y). The moment generating function of Y is defined as
and the cumulant generating function (CGF) of Y is defined as the natural logarithm of M(ξ):
To restore f (y) from K(ξ), one can apply the inverse Fourier transformation:
⎯ 422 ⎯
Using the exponential power series and Hermite polynomials, Daniels (1954) [3] developed the saddlepoint
approximation formula for f (y):
(.) is the second derivative of the CGF, and ξs is called the saddlepoint, which satisfies the following
where
relationship for a given value of y:
Based on the saddlepoint approximation for f (y) in Eq. (11), Lugannani and Rice (1980) [8] derived the following
formula to approximate the CDF of Y:
where Φ(.) and φ(.) are the CDF and PDF of the standard normal random variable, with w and v given by
where sgn(.) is the signum function.
To obtain the CDF value of Y at a point y, it requires only to find its corresponding saddlepoint ξs instead of integration.
Consequently, the existence and properties of the real roots for Eq. (12) is the key to the application of the saddlepoint
approximation. According to Eq. (9), Eq. (12) can also be rewritten as
The slope of the ξ-y curve is
By applying the Cauchy-Schwarz inequality, one can show that
. Thus,
which ensures that the curve y = (ξ) is monotonically increasing and so the root ξs is unique if it exists. The property
(ξ) > 0 also guarantees that the square root in Eq. (14) gives real values. Define a function g(ξs) ≡ ξs y −K(ξs) =
ξs
(ξs) −K(ξs), then (ξs) = ξs (ξs). Since
(ξs) > 0, the function g(ξs) is monotonically decreasing when ξs < 0,
and monotonically increasing when ξs > 0. Thus, we can obtain
to give a real value for w.
CONSTRUCTION OF THE CGF OF Y
As discussed in the previous section, the key to the application of the saddlepoint method is to estimate the CGF of the
system performance Y = g(Z). In engineering applications, various sampling techniques such as MCS are available for
generating the sample values of Y provided that the distribution of Z and the analysis model Y = g(Z) are given.
According to the construction way of the CGF based on the sample values {Y1, …, YN}, the saddlepoint method is
divided into three classes in this paper.
1. Cumulant method The cumulant generating function (CGF) K(ξ) can be expanded in Taylor series at the point
ξ = 0:
⎯ 423 ⎯
Since K(0) = 0, the above equation can be simplified as
where κj ≡ K( j)(0) is called the jth cumulant of the random variable Y. For example, κ1 = E[Y] and κ2 = var[Y]. In practice,
the first four cumulants are used in the series expansion [9], i.e.,
The unbiased estimators for the cumulants of a population are generally provided by k statistics [10]. Based on the
sample values of the random variable Y, the unbiased estimators for the first four cumulants are given by
. For the application of the cumulant method, we
where Sr (r = 1, …, 4) is the rth moment estimate and Sr =
(ξ) > 0 and thus the curve y = (ξ) is monotonically increasing in the domain of ξ concerned.
have to ensure that
Noting that
(0) = κ2 > 0, according to the continuity of the function
(ξ), we can find a region of ξ in the
neighborhood of ξ = 0 such that the Lugannani-Rice formula can be used to approximate the CDF P(Y ≤ y). By
examining the curve of
(ξ), this region can be found in four different cases:
Case 1 κ4 < 0
> 0. Typical curves of y = (ξ) and its slope function
Then, Δ =
can be given by (ξl ;ξu) where ξl and ξu can be computed from
(ξ) are shown in Fig. 1, and the region
(23)
Case 2 κ4 ≥ 0 and Δ =
≤0
(ξ) ≥ 0 and y = (ξ) is monotonically increasing for all ξ, as shown in Fig. 2. A special subcase occurs
Then =
≤ 0, κ3 = 0 is required. Thus
(ξ) = κ2 is a positive
when κ4 = 0. In order to satisfy the condition Δ =
constant for all ξ.
Figure 1: The typical curves of
(ξ) and its slope function
⎯ 424 ⎯
(ξ) in the first case with the cumulant method
Figure 2: The typical curves of
Case 3 κ4 ≥ 0 and Δ =
(ξ) and its slope function
(ξ) in the second case with the cumulant method
> 0 as well as κ3 > 0
Then the region of ξ which is applicable for the cumulant method is (ξl ,∞) with ξl given by
(24)
The curves of y = (ξ) and its slope function
the expression of ξl given by
(ξ) are shown in Fig. 3. A special subcase occurs when κ4 = 0, with
(25)
Case 4 κ4 ≥ 0 and Δ =
> 0 as well as κ3 < 0
Then the region of ξ which is applicable for the cumulant method is (−∞, ξu) with ξu given by
(26)
Figure 3: The typical curves of
(ξ) and its slope function
(ξ) in the third case with the cumulant method
Figure 4: The typical curves of
(ξ) and its slope function
(ξ) in the fourth case with the cumulant method
⎯ 425 ⎯
The curves of y = (ξ) and its slope function
the expression of ξu given by
(ξ) are shown in Fig. 4. A special subcase occurs when κ4 = 0, with
(27)
From the bounds of ξ, we can compute the bounds yu (corresponding to ξu) and yl (corresponding to ξl ). If the specified
y ∈ (yl ; yu), then the corresponding CDF value can be computed using the Lugannani-Rice formula.
2. Expectation method The main idea of the expectation method is to use the sample average to substitute for the
expectation,
where E0(ξ) ≡ (1/N)
where E1(ξ) ≡
. Then the functions
= (1/N)
and E2(ξ) ≡
(ξ) and
(ξ) can be obtained through differentiation:
= (1/N)
. According to the Cauchy-Schwarz
2
inequality, we can get
, which ensures
(ξ) > 0 and
monotonically increasing for every ξ as shown in Fig. 5. Another important feature of the curve y =
Figure 5: The typical curves of
(ξ) and its slope function
(ξ) with the expectation method
Figure 6: The typical curves of (ξ) and its slope function
(ξ) with a combination of
the expectation method and the cumulant method
⎯ 426 ⎯
(ξ) is
(ξ) is
and
which implies that the expectation method is applicable for estimating P(Y ≤ y) if y ∈ [Ymin, Ymax].
3. Proposed method In practical applications, the performance variable Y is always distributed over [0,∞). A
combination of the cumulant method and the expectation method is proposed to estimate the CGF as follows:
Then,
(ξ) and
(ξ) are given by
Table 1 Selected threshold levels
Table 2 Results for the sample mean of the estimated P(Y > b)
and
with the typical curves shown in Fig. 6. The saddlepoint corresponding to a specified threshold y is obtained from the
relationship y = (ξ).
The continuity of the function
(ξ) is examined at the point ξ = 0, i.e.,
⎯ 427 ⎯
From the discussion of the cumulant method and the expectation method, the proposed method is applicable in the
interval [Ymin, yu).
EXAMPLE
A 10-story building model is used to examine the proposed method. The inter-story stiffness and floor mass is taken to
be identical for all stories. The stiffness-to-mass ratio is taken to be 1762.2s−2 so that the building has fundamental
frequency 1.0Hz. Furthermore, Rayleigh damping is used such that the damping ratios of the first two modes are 2.0%.
The building is subjected to Gaussian white noise ground accelerations with spectral intensity 0.001m2s−3. The
response of the system is computed at the discrete time instants, where the sampling interval is assumed to be Δt =
0.01s and the duration of study is T = 10s, so that the number of the time instants is nt = T/Δt +1 = 1001. The failure
event is defined as the exceedance of a threshold b by any of the inter-story drifts during the time duration. Let Xj(tk)
denote the inter-story drift response for the jth story at time tk, j = 1, …, 10, k = 1, …, 1001, then the failure event of
.
interest can also be defined as F = {Y > b}, where Y =
Table 3 Results for the sample C.O.V. (×10−3) of the estimated P(Y > b)
Table 1 shows some reference threshold levels b corresponding to typical failure probabilities (for example 10−3) using
the MCS method. In order to ensure the accuracy of the results, 10 million samples are generated so that the C.O.V. of
the estimated P(Y > b) are very small. Note that the largest C.O.V. which occurs at the failure probability of 10−4 is
316×10−4, i.e., 3.16%.
Table 2 shows the sample mean of the estimates of PF over 1000 independent simulation runs with 10000 samples for
each run at the selected b using the MCS method ( ), the cumulant method ( ), the expectation method ( ) and the
are actually the estimated PF using 10 million samples.
proposed method ( ). Note that the sample mean
are considered as ‘exact’ PF and used to measure the accuracy
Considering their small C.O.V. as shown in Table 1,
are very close to
with the largest relative error of
of other results. It is clearly seen that the sample mean and
−3
at PF = 0.90,0.80
4% and 8% (both for PF = 1.0×10 ). Note that although the deviation of the sample mean from
and , it can be seen that the
is large, its relative error is still small (1.8% and 2.1%). By comparing the results of
(than ) when PF is small, and the results
results of the sample mean using the cumulant method ( ) are closer to
of the sample mean using the expectation method ( ) are preferable when PF is large. Combining the advantages of
the cumulant method and the expectation method, the results of the sample mean using the proposed method ( ) are
shown in the last column.
Table 3 shows the sample C.O.V. of the estimates of PF over the same 1000 independent simulation runs as those for
Table 2 using the MCS method (α0), the cumulant method (α1), the expectation method (α2) and the proposed method
(α3). In order to judge the efficiency of the cumulant method, the expectation method and the proposed method, the
number of required samples using MCS to obtain the same accuracy as α1, α2 and α3 (i.e., N1, N2 and N3), is compared
with the present sample number N0. According to Eq. (6), Ni can be measured by (1−PF)=(PF ), i=0, 1, 2, 3. Thus the
ratio Ni=N0 = (α0=αi)2, i=1, 2, 3, is used to compare the required samples to obtain the same accuracy between MCS
and the saddlepoint methods, as shown in the fourth, sixth and eighth columns. It is clearly seen that the cumulant
⎯ 428 ⎯
method is efficient to estimate PF when PF is not too large (≤ 0.60), with the observation that more samples are required
for MCS to achieve the same accuracy as α1 (N1/N0=1.2 to 20); whereas the required samples using MCS to obtain the
accuracy α1 are less for most of large PF, with N1/N0 = 0.21, 0.37, 0.62 for PF = 0.99, 0.8, 0.7, respectively. It is also
noted that the efficiency of the expectation method is obvious for large PF (≥ 0.40), with more required samples for
MCS to obtain the same accuracy as α2 (N2/N0=1.2 to 1.5); while the efficiency is not always maintained for small PF,
with the ratio N2/N0 = 0.93, 0.84, 0.91 for PF = 0.2, 0.1, 1.0×10−4, respectively. The seventh and eighth columns show
the results using the proposed method, a combination of the cumulant method and the expectation method for the
estimation of small PF and large PF respectively. Note that the results using the proposed method are consistent with
those using the cumulant method for PF ≤ 0.40, and with those using the expectation method for PF ≥ 0.50, i.e., the
efficiency of the proposed method can be maintained for all PF (N3/ N0 = 1.2 to 20).
CONCLUSIONS
The saddlepoint approximation technique is presented for the reliability assessment. In practice, the key of the
application of the saddlepoint method is to estimate the cumulant generating function (CGF) of the performance
variable based on its samples. As shown in the illustrative example, the saddlepoint method, with the CGF constructed
by a combination of the cumulant method and the expectation method, can provide good estimations of the failure
probabilities, and is significantly more efficient than MCS.
Acknowledgement
The financial support from the research committee of the University of Macau under grant RG068/04-05S/YKV/FST
UMAC is gratefully acknowledged.
REFERENCES
1. Rubinstein RY. Simulation and the Monte-Carlo method. Wiley, NewYork, 1981.
2. Fishman GS. Monte Carlo: Concepts, Algorithms, and Applications. Springer-Verlag, NewYork, 1996.
3. Daniels HE. Saddlepoint approximations in statistics. Annals of Mathematical Statistics, 1954; 25: 631-650.
4. Reid N. Saddlepoint methods and statistical inference. Statistical Science, 1988; 3: 213-227.
5. Huzurbazar S. Practical saddlepoint approximations. American Statistician, 1999; 53(3): 225-232.
6. Goutis C, Casella G. Explaining the saddlepoint approximation. American Statistician, 1999; 53(3): 216-224.
7. Jensen JL. Saddlepoint Approximations. Clarendon Press, Oxford, England, 1995.
8. Lugannani R, Rice SO. Saddlpoint approximation for the distribution of the sum of independent random
variables. Advances in Applied Probability, 1980; 12: 475-490.
9. Sudjianto A, Du X, Chen W. Probabilistic sensitivity analysis in engineering design using uniform sampling and
saddlepoint approximation. SAE 2005 Transactions Journal of Passenger Cars: Mechanical Systems, Paper No.
2005-01-0344.
10. Fisher RA. Moments and product moments of sampling distributions. Proceedings of the London Mathematical
Society, 1928; 2(30): 199-238.
⎯ 429 ⎯