Result 1 to 20 of 859 total
Reinforcement learning for joint pricing, lead-time and scheduling decisions in make-to-order systems. (English)
Eur. J. Oper. Res. 221, No. 1, 99-109 (2012).
1
Minimizing mean weighted tardiness in unrelated parallel machine scheduling with reinforcement learning. (English)
Comput. Oper. Res. 39, No. 7, 1315-1324 (2012).
2
Monte Carlo hyper-heuristics for examination timetabling. (English)
Ann. Oper. Res. 196, 73-90 (2012).
3
Induced states in a decision tree constructed by Q-learning. (English)
Inf. Sci. 213, 39-49 (2012).
4
Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs. (English)
Artif. Intell. 187-188, 115-132 (2012).
5
Automatic algorithm development using new reinforcement programming techniques. (English)
Comput. Intell. 28, No. 2, 176-208 (2012).
6
An adaptive witness selection method for reputation-based trust models. (English)
Rahwan, Iyad (ed.) et al., PRIMA 2012: Principles and practice of multi-agent systems. 15th international conference, Kuching, Sarawak, Malaysia, September 3‒7, 2012. Proceedings. Berlin: Springer (ISBN 978-3-642-32728-5/pbk). Lecture Notes in Computer Science 7455. Lecture Notes in Artificial Intelligence, 184-198 (2012).
7
A modular hierarchical reinforcement learning algorithm. (English)
Huang, De-Shuang (ed.) et al., Intelligent computing theories and applications. 8th international conference, ICIC 2012, Huangshan, China, July 25‒29, 2012. Proceedings. Berlin: Springer (ISBN 978-3-642-31575-6/pbk). Lecture Notes in Computer Science 7390. Lecture Notes in Artificial Intelligence, 375-382 (2012).
8
Integrating relational reinforcement learning with reasoning about actions and change. (English)
Muggleton, Stephen H. (ed.) et al., Inductive logic programming. 21st international conference, ILP 2011, Windsor Great Park, UK, July 31 ‒ August 3, 2011. Revised selected papers. Berlin: Springer (ISBN 978-3-642-31950-1/pbk). Lecture Notes in Computer Science 7207. Lecture Notes in Artificial Intelligence, 255-269 (2012).
9
Distributed self-organizing bandwidth allocation for priority-based bus communication. (English)
Concurrency Comput. Pract. Exp. 24, No. 16, 1903-1917 (2012).
10
Reinforcement learning as heuristic for action-rule preferences. (English)
Collier, Rem (ed.) et al., Programming multi-agent systems. 8th international workshop, ProMAS 2010, Toronto, ON, Canada, May 11, 2010. Revised selected papers. Berlin: Springer (ISBN 978-3-642-28938-5/pbk). Lecture Notes in Computer Science 6599. Lecture Notes in Artificial Intelligence, 25-40 (2012).
11
Reinforcement learning for Golog programs with first-order state-abstraction. (English)
Log. J. IGPL 20, No. 5, 909-942 (2012).
12
Genetic programming for generalised helicopter hovering control. (English)
Moraglio, Alberto (ed.) et al., Genetic programming. 15th European conference, EuroGP 2012, Málaga, Spain, April 11‒13, 2012. Proceedings. Berlin: Springer (ISBN 978-3-642-29138-8/pbk). Lecture Notes in Computer Science 7244, 25-36 (2012).
13
Reinforcement learning and the creative, automated music improviser. (English)
Machado, Penousal (ed.) et al., Evolutionary and biologically inspired music, sound, art and design. First international conference, EvoMUSART 2012, Málaga, Spain, April 11‒13, 2012. Proceedings. Berlin: Springer (ISBN 978-3-642-29141-8/pbk). Lecture Notes in Computer Science 7247, 223-234 (2012).
14
PAC bounds for discounted MDPs. (English)
Bshouty, Nader H. (ed.) et al., Algorithmic learning theory. 23rd international conference, ALT 2012, Lyon, France, October 29‒31, 2012. Proceedings. Berlin: Springer (ISBN 978-3-642-34105-2/pbk). Lecture Notes in Computer Science 7568. Lecture Notes in Artificial Intelligence, 320-334 (2012).
15
Decentralized multi-tasks distribution in heterogeneous robot teams by means of ant colony optimization and learning automata. (English)
Corchado, Emilio (ed.) et al., Hybrid artificial intelligent systems. 7th international conference, HAIS 2012, Salamanca, Spain, March 28‒30, 2012. Proceedings, Part I. Berlin: Springer (ISBN 978-3-642-28941-5/pbk). Lecture Notes in Computer Science 7208. Lecture Notes in Artificial Intelligence, 103-114 (2012).
16
Evaluation of the improved penalty avoiding rational policy making algorithm in real world environment. (English)
Pan, Jeng-Shyang (ed.) et al., Intelligent information and database systems. 4th Asian conference, ACIIDS 2012, Kaohsiung, Taiwan, March 19‒21, 2012. Proceedings, Part I. Berlin: Springer (ISBN 978-3-642-28486-1/pbk). Lecture Notes in Computer Science 7196. Lecture Notes in Artificial Intelligence, 270-280 (2012).
17
Self-organizing reinforcement learning model. (English)
Pan, Jeng-Shyang (ed.) et al., Intelligent information and database systems. 4th Asian conference, ACIIDS 2012, Kaohsiung, Taiwan, March 19‒21, 2012. Proceedings, Part I. Berlin: Springer (ISBN 978-3-642-28486-1/pbk). Lecture Notes in Computer Science 7196. Lecture Notes in Artificial Intelligence, 218-227 (2012).
18
A system for the use of answer set programming in reinforcement learning. (English)
Fariñas del Cerro, Luis (ed.) et al., Logics in artificial intelligence. 13th European conference, JELIA 2012, Toulouse, France, September 26‒28, 2012. Proceeding. Berlin: Springer (ISBN 978-3-642-33352-1/pbk). Lecture Notes in Computer Science 7519. Lecture Notes in Artificial Intelligence, 488-491 (2012).
19
Task allocation in mesh structure: 2Side LeapFrog algorithm and Q-learning based algorithm. (English)
Murgante, Beniamino (ed.) et al., Computational science and its applications ‒ ICCSA 2012. 12th international conference, Salvador de Bahia, Brazil, June 18‒21, 2012. Proceedings, Part IV. Berlin: Springer (ISBN 978-3-642-31127-7/pbk). Lecture Notes in Computer Science 7336, 576-587 (2012).
20
Result 1 to 20 of 859 total