Reinforcement Learning for Call Admission Control and Routing under Quality of Service Constraints in Multimedia Networks Hui TongTimothy X Brown OriginalPaper Pages: 111 - 139
Building a Basic Block Instruction Scheduler with Reinforcement Learning and Rollouts Amy McGovernEliot MossAndrew G. Barto OriginalPaper Pages: 141 - 160
On Average Versus Discounted Reward Temporal-Difference Learning John N. TsitsiklisBenjamin Van Roy OriginalPaper Pages: 179 - 191
A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes Michael KearnsYishay MansourAndrew Y. Ng OriginalPaper Pages: 193 - 208
Near-Optimal Reinforcement Learning in Polynomial Time Michael KearnsSatinder Singh OriginalPaper Pages: 209 - 232
Technical Update: Least-Squares Temporal Difference Learning Justin A. Boyan OriginalPaper Pages: 233 - 246
Continuous-Action Q-Learning José del R. MillánDaniele PosenatoEric Dedieu OriginalPaper Pages: 247 - 265
Variable Resolution Discretization in Optimal Control Rémi MunosAndrew Moore OriginalPaper Pages: 291 - 323