<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<item>
  <id>06058606</id>
  <dt>a</dt>
  <an>06058606</an>
  <augroup>
    <au>Runarsson, Thomas Philip</au>
  </augroup>
  <ti>Learning heuristic policies -- a reinforcement learning problem.</ti>
  <so>Coello Coello, Carlos A. (ed.), Learning and intelligent optimization. 5th international conference, LION 5, Rome, Italy, January 17--21, 2011. Selected papers. Berlin: Springer (ISBN 978-3-642-25565-6/pbk). Lecture Notes in Computer Science 6683, 423-432 (2011).</so>
  <py>2011</py>
  <pu>Berlin: Springer</pu>
  <lagroup>
    <la>EN</la>
  </lagroup>
  <ccgroup>
  </ccgroup>
  <utgroup>
  </utgroup>
  <cigroup>
  </cigroup>
  <ligroup>
    <li>doi:10.1007/978-3-642-25566-3_31</li>
  </ligroup>
  <abgroup>
    <ab>Summary: How learning heuristic policies may be formulated as a reinforcement learning problem is discussed. Reinforcement learning algorithms are commonly centred around estimating value functions. Here a value function represents the average performance of the learned heuristic algorithm over a problem domain. Heuristics correspond to actions and states to solution instances. The problem of bin packing is used to illustrate the key concepts. Experimental studies show that the reinforcement learning approach is compatible with the current techniques used for learning heuristics. The framework opens up further possibilities for learning heuristics by exploring the numerous techniques available in the reinforcement learning literature.</ab>
    <rv></rv>
  </abgroup>
</item>