MCMC implementation for Bayesian hidden semi-Markov models with illustrative applications

Theodoros Economou¹,
Trevor C. Bailey¹ &
Zoran Kapelan¹

1282 Accesses
10 Citations
Explore all metrics

Abstract

Hidden Markov models (HMMs) are flexible, well-established models useful in a diverse range of applications. However, one potential limitation of such models lies in their inability to explicitly structure the holding times of each hidden state. Hidden semi-Markov models (HSMMs) are more useful in the latter respect as they incorporate additional temporal structure by explicit modelling of the holding times. However, HSMMs have generally received less attention in the literature, mainly due to their intensive computational requirements. Here a Bayesian implementation of HSMMs is presented. Recursive algorithms are proposed in conjunction with Metropolis-Hastings in such a way as to avoid sampling from the distribution of the hidden state sequence in the MCMC sampler. This provides a computationally tractable estimation framework for HSMMs avoiding the limitations associated with the conventional EM algorithm regarding model flexibility. Performance of the proposed implementation is demonstrated through simulation experiments as well as an illustrative application relating to recurrent failures in a network of underground water pipes where random effects are also included into the HSMM to allow for pipe heterogeneity.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On the estimation of partially observed continuous-time Markov chains

Article 18 August 2022

Recursive estimation of multivariate hidden Markov model parameters

Article 05 March 2019

Finding the number of latent states in hidden Markov models using information criteria

Article 22 November 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

R Development Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2012)
Google Scholar
Baum, L.E., Petrie, T., Soules, G., Weiss, N.: A maximization technique in the statistical analysis of probabilistic functions of Markov chains. Ann. Math. Stat. 41, 164–171 (1970)
Article MATH MathSciNet Google Scholar
Bellone, E., Hughes, J.P., Guttorp, P.: A hidden Markov model for downscaling synoptic atmospheric patterns to precipitation amounts. Clim. Res. 15, 1–12 (2000)
Article Google Scholar
Bulla, J., Bulla, I.: Stylized facts of financial time series and hidden semi-Markov models. Comput. Stat. Data Anal. 51, 2192–2209 (2006)
Article MATH MathSciNet Google Scholar
Bulla, J., Bulla, I., Nenadic, O.: HSMM—an R package for analyzing hidden semi-Markov models. Comput. Stat. Data Anal. 54, 611–619 (2010)
Article MATH MathSciNet Google Scholar
Celeux, G., Hurn, M., Robert, C.P.: Computational and inferential difficulties with mixture posterior distributions. J. Am. Stat. Assoc. 95(451), 957–970 (2000)
Article MATH MathSciNet Google Scholar
Chib, S.: Calculating posterior distributions and modal estimates in Markov mixture models. J. Econom. 75(1), 79–97 (1996)
Article MATH MathSciNet Google Scholar
Devijver, P.A.: Baum’s forward-backward algorithm revisited. Pattern Recognit. Lett. 3(6), 369–373 (1985)
Article MATH Google Scholar
Dewar, M., Wiggins, C., Wood, F.: Inference in hidden Markov models with explicit state duration distributions. IEEE Signal Process. Lett. 19(4), 235–238 (2012)
Article Google Scholar
Dong, M., He, D.: A segmental hidden semi-Markov model (HSMM)-based diagnostics and prognostics framework and methodology. Mech. Syst. Signal Process. 21, 2248–2266 (2007)
Article Google Scholar
Economou, T., Vitolo, R., Bailey, T.C., Kapelan, Z., Waterhouse, E.K.: A latent structure model for high river flows. In: Proceedings of the 24th International Workshop on Statistical Modelling, pp. 125–129 (2009)
Google Scholar
Economou, T., Kapelan, Z., Bailey, T.C.: On the prediction of underground water pipe failures: zero-inflation and pipe specific effects. J. Hydroinform. 14(4), 872–883 (2012)
Article Google Scholar
Fearnhead, P., Sherlock, C.: An exact Gibbs sampler for the Markov-modulated Poisson process. J. R. Stat. Soc., Ser. B, Stat. Methodol. 68(5), 767–784 (2006)
Article MATH MathSciNet Google Scholar
Ferguson, J.D.: Variable duration models for speech. In: Ferguson, J.D. (ed.) Proceedings of the Symposium on the Applications of Hidden Markov Models to Text and Speech, Princeton, NJ, pp. 143–179 (1980)
Google Scholar
Gamerman, D.: Markov Chain Monte Carlo: Stochastic Simulation for Bayesian Inference. Chapman & Hall, London (1997)
MATH Google Scholar
Gelman, A., Roberts, G.O., Gilks, W.R.: Efficient Metropolis jumping rules. Bayesian Stat. 5, 599–607 (1996)
MathSciNet Google Scholar
Gelman, A., Carlin, J.B., Stern, H.S., Rubin, D.B.: Bayesian Data Analysis. Chapman & Hall/CRC, London (2004)
MATH Google Scholar
Gilks, W.R., Richardson, S., Spiegelhalter, D.J.: Markov Chain Monte Carlo in Practice. Chapman & Hall/CRC, London (1996)
Book MATH Google Scholar
Guedon, Y.: Review of several stochastic speech unit models. Comput. Speech Lang. 6, 377–402 (1992)
Article Google Scholar
Guedon, Y.: Estimating hidden semi-Markov chains from discrete sequences. J. Comput. Graph. Stat. 12(3), 604–639 (2003)
Article MathSciNet Google Scholar
Guha, S., Li, Y., Neuberg, D.: Bayesian hidden Markov modeling of array CGH data. J. Am. Stat. Assoc. 103(482), 485–497 (2008)
Article MATH MathSciNet Google Scholar
Hughes, J.P., Guttorp, P., Charles, S.P.: A non-homogeneous hidden Markov model for precipitation occurrence. J. R. Stat. Soc., Ser. C, Appl. Stat. 48(1), 15–30 (1999)
Article MATH Google Scholar
Jardine, A.K., Lin, D., Banjevic, D.: A review on machinery diagnostics and prognostics implementing condition-based maintenance. Mech. Syst. Signal Process. 20(7), 1483–1510 (2006)
Article Google Scholar
Johnson, M.J., Willsky, A.S.: Bayesian nonparametric hidden semi-Markov models. arXiv:1203.1365v2 (2012)
Jouyaux, C., Richardson, S., Longini, I.: Modeling markers of disease progression by a hidden Markov process: application to characterizing cd4 cell decline. Biometrics 56(3), 733–741 (2000)
Article MATH Google Scholar
Kleiner, Y., Rajani, B.: Comprehensive review of structural deterioration of water mains: statistical models. Urban Water 3, 131–150 (2001)
Article Google Scholar
Kozumi, H.: Bayesian analysis of discrete survival data with a hidden Markov chain. Biometrics 56(4), 1002–1006 (2000)
Article MATH MathSciNet Google Scholar
Levinson, S.E.: Continuously variable duration hidden Markov models for automatic speech recognition. Comput. Speech Lang. 1, 29–45 (1986)
Article Google Scholar
Marin, J.-M., Robert, C.P.: Bayesian Core: A Practical Approach to Computational Bayesian Statistics. Springer, Berlin (1997)
Google Scholar
Rabiner, L.: A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77(2), 257–285 (1989)
Article Google Scholar
Richardson, S., Green, P.J.: On Bayesian analysis of mixtures with an unknown number of components. J. R. Stat. Soc. B 59(4), 731–792 (1997)
Article MATH MathSciNet Google Scholar
Robert, C.P., Titterington, D.M.: Reparameterization strategies for hidden Markov models and Bayesian approaches to maximum likelihood estimation. Stat. Comput. 8, 145–158 (1998)
Article Google Scholar
Robert, C.P., Rydén, T., Titterington, D.M.: Bayesian inference in hidden Markov models through the reversible jump Markov chain Monte Carlo method. J. R. Stat. Soc. B 62(1), 57–65 (2000)
Article MATH Google Scholar
Rydén, T., Terasvirta, T., Asbrink, S.: Stylized facts of daily return series and the hidden Markov model. J. Appl. Econom. 13, 217–244 (1998)
Article Google Scholar
Sansom, J., Thomson, P.: Fitting hidden semi-Markov models to breakpoint rainfall data. J. Appl. Probab. 38A, 142–157 (2001)
Article MATH MathSciNet Google Scholar
Schmidler, S.C., Liu, J.S., Brutlag, D.L.: Bayesian segmentation of protein secondary structure. J. Comput. Biol. 7(1–2), 233–248 (2000)
Article Google Scholar
Scott, S.: Bayesian methods for hidden Markov models: recursive computing in the 21st century. J. Am. Stat. Assoc. 97, 337–351 (2002)
Article MATH Google Scholar
Scott, S., Smyth, P.: The Markov modulated Poisson process and Markov Poisson cascade with applications to web traffic modelling. Bayesian Stat. 7, 671–680 (2003)
MathSciNet Google Scholar
Spiegelhalter, D.J., Best, N.G., Carlin, B.P., Van Der Linde, A.: Bayesian measures of model complexity and fit. J. R. Stat. Soc., Ser. B, Stat. Methodol. 64(4), 583–639 (2002)
Article MATH MathSciNet Google Scholar
Stephens, M.: Dealing with label switching in mixture models. J. R. Stat. Soc. B 62(4), 795–809 (2000)
Article MATH MathSciNet Google Scholar
Tokdar, S., Xi, P., Kelly, R., Kass, R.: Detection of bursts in extracellular spike trains using hidden semi-Markov point process models. J. Comput. Neurosci. 29, 203–212 (2010)
Article Google Scholar
Yau, C., Papaspiliopoulos, O., Roberts, G.O., Holmes, C.: Bayesian non-parametric hidden Markov models with applications in genomics. J. R. Stat. Soc. B 73(1), 37–57 (2011)
Article MathSciNet Google Scholar
Yu, S.-Z.: Hidden semi-Markov models. Artif. Intell. 174, 215–243 (2010)
Article MATH Google Scholar

Download references

Acknowledgements

The pipe dataset used in this paper was provided by Dr. Yehuda Kleiner whom the authors gratefully acknowledge.

Author information

Authors and Affiliations

College of Engineering Mathematics and Physical Sciences, University of Exeter, Exeter, UK
Theodoros Economou, Trevor C. Bailey & Zoran Kapelan

Authors

Theodoros Economou
View author publications
You can also search for this author in PubMed Google Scholar
Trevor C. Bailey
View author publications
You can also search for this author in PubMed Google Scholar
Zoran Kapelan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Theodoros Economou.

Appendices

Appendix A: Pseudo code for simulation experiment with Gaussian observations

Simulate data from 2-state Gaussian HSMM, Eq. (10):

1.
Set values for μ _S,σ,ϕ _S and π _S for S=1,2.
2.
Set i=1 and sample initial state S _i from π={π _S}.
3.
Sample duration τ _i of S _i, from zero-truncated Poisson with parameter $\phi_{S_{i}}$.
4.
Sample state transition from distribution corresponding to $S_{1}^{\mathrm{th}}$ row of P.
5.
Repeat steps 3 and 4 until ∑_i τ _i≥250.
6.
If ∑_i τ _i>250, truncate so that ∑_i τ _i=250.
7.
For each T=1,…,250, sample from N(μ _S,σ ²) according to which state S the chain is at time step T.

Metropolis-Hastings:

Set i=0 and initialise parameters $\mu_{S}^{(i)}, \sigma^{(i)}, \phi _{S}^{(i)}$ and $\pi_{S}^{(i)}$.
Calculate ℓ ⁽ⁱ⁾, the log-likelihood using the forward algorithm in Sect. 3.2.
Calculate P ⁽ⁱ⁾, the log-posterior by
(15)
where p(⋅) is the prior for each parameter (Table 1).

Do for i=1,…,M where M is the required number of MCMC iterations:

μ _S::

$\mu_{S}^{\ast}= \mu_{S}^{(i-1)} + \varepsilon$, $\varepsilon\sim\mbox {N}(0,\sigma^{2}_{\mu})$

Calculate ℓ ^∗ and hence P ^∗ using Eq. (15)

Calculate acceptance probability as η=min(1,Ω) where Ω=exp{P ^∗−P ⁽ⁱ⁻¹⁾}

Sample U from U(0,1) If U≤η, set $\mu_{S}^{(i)}=\mu_{S}^{\ast}$, ℓ ⁽ⁱ⁾=ℓ ^∗ and P ⁽ⁱ⁾=P ^∗.

σ::

σ ^∗=exp{log(σ ⁽ⁱ⁻¹⁾)+ε}, $\varepsilon \sim\mbox{N}(0,\sigma^{2}_{\sigma})$

Calculate ℓ ^∗ and hence P ^∗ using Eq. (15)

Calculate acceptance probability as η=min(1,Ω) where Ω=exp{P ^∗+log(σ ^∗)−[P ⁽ⁱ⁻¹⁾+log(σ ⁽ⁱ⁻¹⁾)]}

Sample U from U(0,1)

If U≤η, set σ ⁽ⁱ⁾=σ ^∗, ℓ ⁽ⁱ⁾=ℓ ^∗ and P ⁽ⁱ⁾=P ^∗.

ϕ _S :

Same as σ, replacing ϕ _S with σ.

π _S :

Sample π ^∗∼Dir(απ ⁽ⁱ⁻¹⁾)

Calculate ℓ ^∗ and hence P ^∗ using Eq. (15)

Calculate acceptance probability as η=min(1,Ω) where

where d(π;θ) is a Dirichlet density with parameter θ

Sample U from U(0,1)

If U≤η, set $\pi_{S}^{(i)}=\pi_{S}^{\ast}$, ℓ ⁽ⁱ⁾=ℓ ^∗ and P ⁽ⁱ⁾=P ^∗.

Adjust $\sigma^{2}_{\mu}, \sigma^{2}_{\sigma}, \sigma^{2}_{\phi}$ and α to achieve desired acceptance rates.

Note that R code for implementing the HSMMs and HMMs from both simulation experiments is available both as supplementary material to the paper and on http://empslocal.ex.ac.uk/people/staff/te201/HSMM/.

Appendix B: Pseudo code for simulation experiment with NHPP observations

Simulate data from 3-state NHPP-HSMM with intensity function λ(t;x|S) as in Eq. (12), for 50 hypothetical objects:

1.
Set values for θ _S,β ₀,β ₁,ϕ _S,π _S and P for S=1,2,3.
2.
Sample B _j∼U(80,150), for j=1,…,50, to decide the observation period in discrete time steps for each object.
3.
Sample x _j∼U(50,150) to set values for the covariate x.
4.
Set j=1, i=1 and sample initial state S _i from π={π _S}.
5.
Sample duration τ _i of S _i, from zero-truncated Poisson with parameter $\phi_{S_{i}}$.
6.
Sample state transition from distribution corresponding to $S_{1}^{\mathrm{th}}$ row of P.
7.
Repeat steps 3 and 4 until ∑_i τ _i≥B _j.
8.
If ∑_i τ _i>B _j, truncate so that ∑_i τ _i=B _j.
9.
For each T=1,…,B _j, sample from Pois(Λ([T−1,T]|S)) according to which state S the chain is at time step T, where $\varLambda([T-1,T]|S)=\int_{T-1}^{T} \lambda(t;x|S) dt$.
10.
Repeat steps 4–9 for j=2,…,50.

Metropolis-Hastings:

Very similar to the one in Appendix A.
The forward algorithm needs to be ran for each of the j=1,…,50 objects so that the log-likelihood $\ell^{(i)} = \sum_{j=1}^{50} \ell_{j}^{(i)}$ (assuming independence of the objects).
The log-posterior is obtained essentially as in Eq. (15). However, we restrict the posterior to be zero in areas of the parameter space other than θ ₁<θ ₂<θ ₃ to impose the label-switching measure.
Strictly positive parameters such as θ _S and ϕ _S are updated the same as σ and ϕ _S in Appendix A.
Parameter π and vectors made up of rows from transition matrix P, excluding zero elements, are updated exactly like π in Appendix A.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Economou, T., Bailey, T.C. & Kapelan, Z. MCMC implementation for Bayesian hidden semi-Markov models with illustrative applications. Stat Comput 24, 739–752 (2014). https://doi.org/10.1007/s11222-013-9399-z

Download citation

Received: 12 September 2012
Accepted: 11 April 2013
Published: 15 May 2013
Issue Date: September 2014
DOI: https://doi.org/10.1007/s11222-013-9399-z

MCMC implementation for Bayesian hidden semi-Markov models with illustrative applications

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

On the estimation of partially observed continuous-time Markov chains

Recursive estimation of multivariate hidden Markov model parameters

Finding the number of latent states in hidden Markov models using information criteria

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix A: Pseudo code for simulation experiment with Gaussian observations

Appendix B: Pseudo code for simulation experiment with NHPP observations

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

MCMC implementation for Bayesian hidden semi-Markov models with illustrative applications

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

On the estimation of partially observed continuous-time Markov chains

Recursive estimation of multivariate hidden Markov model parameters

Finding the number of latent states in hidden Markov models using information criteria

Explore related subjects

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix A: Pseudo code for simulation experiment with Gaussian observations

Appendix B: Pseudo code for simulation experiment with NHPP observations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation