Crossref Citations
This article has been cited by the following publications. This list is generated based on data provided by Crossref.
Bharath, B
and
Borkar, V S
1999.
Stochastic approximation algorithms: Overview and recent trends.
Sadhana,
Vol. 24,
Issue. 4-5,
p.
425.
Abounadi, J.
Bertsekas, D.
and
Borkar, V. S.
2001.
Learning Algorithms for Markov Decision Processes with Average Cost.
SIAM Journal on Control and Optimization,
Vol. 40,
Issue. 3,
p.
681.
Ormoneit, D.
and
Glynn, P.
2002.
Kernel-based reinforcement learning in average-cost problems.
IEEE Transactions on Automatic Control,
Vol. 47,
Issue. 10,
p.
1624.
Borkar, V. S.
2002.
Q-Learning for Risk-Sensitive Control.
Mathematics of Operations Research,
Vol. 27,
Issue. 2,
p.
294.
Van Roy, Benjamin
2002.
Handbook of Markov Decision Processes.
Vol. 40,
Issue. ,
p.
431.
Melo, Francisco S.
and
Ribeiro, M. Isabel
2007.
Learning Theory.
Vol. 4539,
Issue. ,
p.
308.
Malikopoulos, Andreas A.
2009.
Convergence Properties of a Computational Learning Model for Unknown Markov Chains.
Journal of Dynamic Systems, Measurement, and Control,
Vol. 131,
Issue. 4,
Malikopoulos, Andreas A.
Papalambros, Panos Y.
and
Assanis, Dennis N.
2009.
A Real-Time Computational Learning Model for Sequential Decision-Making Problems Under Uncertainty.
Journal of Dynamic Systems, Measurement, and Control,
Vol. 131,
Issue. 4,
Malikopoulos, Andreas A.
Assanis, Dennis N.
and
Papalambros, Panos Y.
2009.
Real-Time Self-Learning Optimization of Diesel Engine Calibration.
Journal of Engineering for Gas Turbines and Power,
Vol. 131,
Issue. 2,