Markov decision chains with unbounded costs and applications to the control of queues

D. R. Robinson

doi:10.2307/1426027

Abstract

A discrete-time Markov decision model with a denumerable set of states and unbounded costs is considered. It is shown that the optimality equation of dynamic programming along with some additional, easily checked, conditions may be used to establish the optimality or ∊ -optimality of policies with respect to the average expected cost criterion. The results are used to derive optimal policies in two queueing examples.

References

[1] Bather, J. A. (1973) Optimal decision procedures for finite Markov chains. Part I: Examples. Adv. Appl. Prob. 5, 328–339. Part II: Communicating systems. Adv. Appl. Prob. 5, 521–540. Part III: General convex systems. Adv. Appl. Prob. 5, 541–553.Google Scholar

[2] Derman, C. (1966) Denumerable state Markovian decision processes-average cost criterion. Ann. Math. Statist. 37, 1545–1554.Google Scholar

[3] Derman, C. and Veinott, A. F. Jr. (1967) A solution to a countable system of equations arising in Markovian decision processes. Ann. Math. Statist. 38, 582–584.CrossRef Google Scholar

[4] Hordijk, A. (1974) Dynamic Programming and Markov Potential Theory. Mathematical Centre Tracts, No. 51, Amsterdam.Google Scholar

[5] Howard, R. A. (1960) Dynamic Programming and Markov Processes. M.I.T. Press, Cambridge, Mass. Google Scholar

[6] Jaiswal, N. K. (1968) Priority Queues. Academic Press, New York.Google Scholar

[7] Lippman, S. A. (1973) Semi-Markov decision processes with unbounded rewards. Management Sci. 7, 717–731.CrossRef Google Scholar

Crossref Citations

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Bather, John 1976. Optimal stationary policies for denumerable Markov chains in continuous time. Advances in Applied Probability, Vol. 8, Issue. 01, p. 144.

Federgruen, A. and Tijms, H. C. 1978. The optimality equation in average cost denumerable state semi-Markov decision problems, recurrency conditions and algorithms. Journal of Applied Probability, Vol. 15, Issue. 2, p. 356.

Federgruen, A. Hordijk, A. and Tijms, H.C. 1979. Denumerable state semi-Markov decision processes with unbounded costs, average cost criterion. Stochastic Processes and their Applications, Vol. 9, Issue. 2, p. 223.

Whittle, P. 1979. A simple condition for regularity in negative programming. Journal of Applied Probability, Vol. 16, Issue. 02, p. 305.

Serfozo, Richard 1981. Optimal control of random walks, birth and death processes, and queues. Advances in Applied Probability, Vol. 13, Issue. 01, p. 61.

1982. The Single Server Queue. Vol. 8, Issue. , p. 676.

Kitayev, M. Yu. 1986. Semi-Markov and Jump Markov Controlled Models: Average Cost Criterion. Theory of Probability & Its Applications, Vol. 30, Issue. 2, p. 272.

Piunovskii, A. B. and Khametov, V. M. 1991. New exactly solvable examples for controlled discrete-time Markov chains. Cybernetics, Vol. 27, Issue. 3, p. 420.

Arapostathis, Aristotle Borkar, Vivek S. Fernández-Gaucherand, Emmanuel Ghosh, Mrinal K. and Marcus, Steven I. 1993. Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey. SIAM Journal on Control and Optimization, Vol. 31, Issue. 2, p. 282.

1994. Markov Decision Processes. p. 613.

Narayan, Prakash 1994. Jointly optimal admission and routing controls at a network node. Communications in Statistics. Stochastic Models, Vol. 10, Issue. 1, p. 223.

Makowski, Armand M. and Shwartz, Adam 2002. Handbook of Markov Decision Processes. Vol. 40, Issue. , p. 269.

Guo, Xianping and Zhu, Quanxin 2006. Average optimality for Markov decision processes in borel spaces: a new condition and approach. Journal of Applied Probability, Vol. 43, Issue. 02, p. 318.

Li, Quan-Lin Ma, Jing-Yu Fan, Rui-Na and Xia, Li 2019. Stochastic Models in Reliability, Network Security and System Safety. Vol. 1102, Issue. , p. 44.

Article contents

Markov decision chains with unbounded costs and applications to the control of queues

Abstract

Keywords

Access options

References

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Article contents

Markov decision chains with unbounded costs and applications to the control of queues

Abstract

Keywords

Access options

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests