Nonlife Actuarial Models: B Uhlmann Credibility
Nonlife Actuarial Models: B Uhlmann Credibility
Chapter 7
Bühlmann Credibility
Learning Objectives
2. Variance decomposition
5. Bühlmann credibility
6. Bühlmann-Straub credibility
2
7.1 Framework and Notations
and
2
Var(X | θ) = σX (θ). (7.2)
3
• Assume that the insurance company has similar blocks of policies
with different risk profiles.
4
30% of the risk groups are low risk and 70% are high risk. What are the
conditional mean and variance of the claim frequency?
so that
⎧
⎪
⎨ 20, with probability 0.30,
μX (Λ) = E(X | Λ) = ⎪
⎩ 50, with probability 0.70.
5
Likewise, we have
⎧
⎪
⎨ 20, with probability 0.30,
2
σX (Λ) = Var(X | Λ) = ⎪
⎩ 50, with probability 0.70.
6
uted in [100, 200] with pdf
⎧
⎪
⎨ 0.01, for θ ∈ [100, 200],
fΘ (θ) = ⎪
⎩ 0, otherwise.
• The Bühlmann model assumes that there are n observations of losses,
denoted by {X1 , · · · , Xn }.
• The task is to update the prediction of X for the next period, i.e.,
Xn+1 , based on {X1 , · · · , Xn }.
7
7.2 Variance Components
• The variation of the loss measure X consists of two components: the
variation between risk groups and the variation within risk
groups.
• We first consider the calculation of the overall mean of the loss mea-
sure X. The unconditional mean (or overall mean) of X is
E(X) = E[E(X | Θ)] = E[μX (Θ)]. (7.3)
8
• For the unconditional variance (or total variance), we have
9
• Verbally, equation (7.4) can be written as
or
Total variance = EPV + VHM. (7.6)
2
E[Var(X | Θ)] = E[σX (Θ)] = μPV , (7.8)
10
and
2
Var[E(X | Θ)] = Var[μX (Θ)] = σHM , (7.9)
so that equation (7.4) can be written as
2
Var(X) = μPV + σHM . (7.10)
Example 7.3: For Examples 7.1 and 7.2, calculate the unconditional
mean, the expected value of the process variance, the variance of the
hypothetical means and the total variance.
11
The expected value of the process variance, EPV, is
E[Var(X | Λ)] = Pr(Λ = 20)Var(X | Λ = 20) + Pr(Λ = 50)Var(X | Λ = 50)
= (0.3)(20) + (0.7)(50)
= 41.
As the mean of the hypothetical means (i.e., the unconditional mean) is
41, the variance of the hypothetical means, VHM, is
Var[E(X | Λ)] = (0.3)(20 − 41)2 + (0.7)(50 − 41)2 = 189.
Thus, the total variance of X is
Var(X) = E[Var(X | Λ)] + Var[E(X | Λ)] = 41 + 189 = 230.
For Example 7.2, as Θ is uniformly distributed in [100, 200], the uncondi-
tional mean of X is
E(X) = E[E(X | Θ)] = E(Θ) = 150.
12
As X has a constant variance of 10, the expected value of the process
variance is
E[Var(X | Θ)] = E(10) = 10.
The variance of the hypothetical means is
(200 − 100)2
Var[E(X | Θ)] = Var(Θ) = = 833.33,
12
and the total variance of X is
13
α and β. Claim frequency and claim severity are independently distrib-
uted given a risk group, and the aggregate loss is S. The data of the risk
groups are given in Table 7.2.
Table 7.2: Data for Example 7.5
14
terized by the parameter Λ. As N is Poisson, the mean and variance are
equal to Λ, so that we have the results in Table 7.3.
which is also equal to the unconditional mean E[μN (Λ)]. For VHM, we
first calculate
15
so that
2
σHM = Var[μN (Λ)] = E{[μN (Λ)]2 } − {E[μN (Λ)]}2 = 1,080 − (32)2 = 56.
2
Var(N) = μPV + σHM = 32 + 56 = 88.
16
Table 7.4: Results for Example 7.5 (b)
17
E(X) = E[E(X | Γ)] = (0.125)(10) + (0.375)(12) + (0.5)(6) = 8.75,
and
μPV = (0.125)(20) + (0.375)(36) + (0.5)(12) = 22.
To calculate VHM, we first compute the raw second moment of the con-
ditional mean of X, which is
Hence,
2
σHM = Var[μX (Γ)] = E{[μX (Γ)]2 }−{E[μX (Γ)]}2 = 84.50−(8.75)2 = 7.9375.
18
(c) Aggregate loss The distribution of the aggregate loss S is deter-
mined jointly by Λ and Γ, which we shall denote as Θ. For the conditional
mean of S, we have
2
Var(S | Θ) = λ[σX (Γ) + μ2X (Γ)] = λ(αβ 2 + α2 β 2 ).
19
Table 7.5: Results for Example 7.5 (c)
20
= E{[μS (Θ)]2 } − {E[μS (Θ)]}2
h i
2 2 2
= (0.2)(200) + (0.4)(360) + (0.4)(240) − (280)2
= 4,480.
• EPV and VHM measure two different aspects of the total variance.
• When a risk group is homogeneous so that the loss claims are similar
within the group, the conditional variance is small. If all risk groups
have similar loss claims within the group, the expected value of the
process variance is small.
21
• On the other hand, if the risk groups have very different risk profiles
across groups, their hypothetical means will differ more and thus the
variance of the hypothetical means will be large.
• A small EPV or large VHM will gives rise to a small k. The risk
groups will be more distinguishable in the mean when k is smaller,
in which case we may put more weight on the data in updating our
revised prediction for future losses.
22
Example 7.6: Frequency of claim per year, N, is distributed as a Bi-
nomial random variable BN (10, θ), and claim severity, X, is distributed
as an exponential random variable with mean cθ, where c is a known
constant. Given θ, claim frequency and claim severity are independently
distributed. Derive an expression of k for the aggregate loss per year, S,
in terms of c and the moments of Θ, and show that it does not depend on
c. If Θ is 0.3 or 0.7 with equal probabilities, calculate k.
23
= (10Θ)(cΘ)2 + [10Θ(1 − Θ)](cΘ)2
= 10c2 Θ3 + 10c2 Θ3 (1 − Θ)
= 10c2 Θ3 (2 − Θ).
Hence, the unconditional mean of S is
E(S) = E[E(S | Θ)] = E(10cΘ2 ) = 10cE(Θ2 )
and the variance of the hypothetical means is
2
σHM = Var[E(S | Θ)]
= Var(10cΘ2 )
= 100c2 Var(Θ2 )
= 100c2 {E(Θ4 ) − [E(Θ2 )]2 }.
The expected value of the process variance is
μPV = E[Var(S | Θ)]
24
= E[10c2 Θ3 (2 − Θ)]
= 10c2 [2E(Θ3 ) − E(Θ4 )].
θ Pr(Θ = θ) θ2 θ3 θ4
0.3 0.5 0.09 0.027 0.0081
0.7 0.5 0.49 0.343 0.2401
25
E(Θ2 ) = (0.5)(0.09) + (0.5)(0.49) = 0.29,
and
so that
2(0.185) − 0.1241
k= = 0.6148.
10 [0.1241 − (0.29)2 ] 2
In this example, note that both EPV and VHM depend on c. However,
as the effects of c on these components are the same, the ratio of EPV to
26
VHM is invariant to c. Also, though X and N are independent given θ,
they are correlated unconditionally due to their common dependence on
Θ.
27
7.3 Bühlmann Credibility
U = ZD + (1 − Z)M. (7.12)
28
• The Bühlmann credibility factor Z depends on the sample size n
and the EPV to VHM ratio k. In particular, Z varies with n and k
as follows:
29
2. The parameter θ is a realization of a random variable Θ. Given θ,
the conditional mean and variance of X are
and
2
Var(X | θ) = σX (θ). (7.14)
2
E[Var(X | Θ)] = E[σX (Θ)]
= μPV
= Expected value of process variance
= EPV, (7.15)
30
and the variance of the conditional mean is
31
same distribution as X. The predictor minimizes the mean squared
error in predicting Xn+1 over the joint distribution of Θ, Xn+1 and
{X1 , · · · , Xn }. Specifically, the predictor is given by
X̂n+1 = β0 + β1 X1 + · · · + βn Xn , (7.18)
where
n
Z= . (7.35)
n+k
32
• Z defined in equation (7.35) is called the Bühlmann credibility
factor or simply the Bühlmann credibility. It depends on the
EPV to VHM ratio k, which is called the Bühlmann credibility
parameter.
33
Example 7.7: Refer to Example 7.5. Suppose the claim experience last
year was 26 claims with an average claim size of 12. Calculate the updated
prediction of (a) the claim frequency, (b) the average claim size, and (c)
the aggregate loss, for next year.
34
n = 26 and D = 12. Thus,
26
Z= = 0.9037,
26 + 2.7717
so that the updated prediction of the claim severity of this group is
35
7.4 Bühlmann-Straub Credibility
36
although it may often be the case. We then assume the following for
the conditional variance of Xi
2
σX (Θ)
Var(Xi | Θ) = , (7.36)
mi
2
for a suitably defined σX (Θ).
2. Xi is the average aggregate loss per month of the ith block of policies,
2
σX (Θ) is the variance of the aggregate loss of the block in a month,
and the exposure mi is the number of months of insurance claims
for the ith block of policies.
37
2
3. Xi is the average loss per unit premium in year i, σX (Θ) is the
variance of the claim amount of an insured per year divided by the
premium per insured, and the exposure mi is the amount of premi-
ums received in year i.
and
2
σX (θ)
Var(Xi | θ) = , (7.39)
mi
2
for i ∈ {1, · · · , n}, where σX (θ) is suitably defined as in the examples
above.
38
• The unconditional mean of Xi is E(Xi ) = E[E(Xi | Θ)] = E[μX (Θ)] =
μX . The mean of the conditional variance of Xi is
" #
2
σX (Θ)
E[Var(Xi | Θ)] = E
mi
μPV
= , (7.40)
mi
2
for i ∈ {1, · · · , n}, where μPV = E[σX (Θ)], and the variance of its
conditional mean is
• Now we define n
X
m= mi , (7.48)
i=1
39
n
1 X
X̄ = mi Xi (7.51)
m i=1
and
μPV
k= 2 . (7.52)
σHM
• Denoting
m
Z= . (7.54)
m+k
we have
X̂n+1 = Z X̄ + (1 − Z)μX . (7.57)
Example 7.9: The number of accident claims incurred per year for each
insured is distributed as a binomial random variable BN (2, θ), and the
claim incidences are independent across insureds. The probability θ of the
binomial has a beta distribution with parameters α = 1 and β = 10. The
data in Table 7.7 are given for a block of policies.
40
Table 7.7: Data for Example 7.9
41
and
2
σHM = Var[E(Xi | Θ)] = Var(2Θ) = 4Var(Θ).
As Θ has a beta distribution with parameters α = 1 and β = 10, we have
αβ 10
Var(Θ) = = = 0.006887.
(α + β)2 (α + β + 1) (11)2 (12)
For the conditional variance of Xi , we have
2Θ(1 − Θ)
Var(Xi | Θ) = .
mi
Thus,
μPV = 2E[Θ(1 − Θ)].
As
α
E(Θ) = = 0.0909,
α+β
42
we have
Thus,
μPV 0.1515
k= 2 = = 5.5.
σHM (4)(0.006887)
As m = 100 + 200 + 250 = 550, we have
550
Z= = 0.9901.
550 + 5.5
Now
43
and
7 + 13 + 18
X̄ = = 0.0691.
550
Thus, the predicted number of claims per insured is
(280)(0.0702) = 19.66.
2
44