Result 1 to 20 from 78 total
Beyond reward: The problem of knowledge and data. (English)
Muggleton, Stephen H. (ed.) et al., Inductive logic programming. 21st international conference, ILP 2011, Windsor Great Park, UK, July 31 ‒ August 3, 2011. Revised selected papers. Berlin: Springer (ISBN 978-3-642-31950-1/pbk). Lecture Notes in Computer Science 7207. Lecture Notes in Artificial Intelligence, 2-6 (2012).
1
Temporal-difference search in computer go. (English)
Mach. Learn. 87, No. 2, 183-219 (2012).
2
What to do with 500M location requests a day? (English)
COM.Geo, 67 (2011).
3
Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction (English)
AAMAS, 761-768 (2011).
4
Automated flexion crease identification using internal image seams. (English)
Pattern Recognition 43, No. 3, 630-635 (2010).
5
Toward off-policy learning control with function approximation (English)
ICML, 719-726 (2010).
6
Natural actor-critic algorithms. (English)
Automatica 45, No. 11, 2471-2482 (2009).
7
Natural actor-critic algorithms (English)
Automatica 45, No. 11, 2471-2482 (2009).
8
How and why theories matter: A comment on felin and foss (2009) (English)
Organization Science 20, No. 3, 669-675 (2009).
9
Fast gradient-descent methods for temporal-difference learning with linear function approximation (English)
ICML, 125 (2009).
10
Convergent temporal-difference learning with arbitrary smooth function approximation (English)
NIPS, 1204-1212 (2009).
11
Multi-step dyna planning for policy evaluation and control (English)
NIPS, 2187-2195 (2009).
12
Simulation-assisted saddlepoint approximation. (English)
J. Stat. Comput. Simulation 78, No. 8, 731-745 (2008).
13
Agent learning using action-dependent learning rates in computer role-playing games (English)
AIIDE (2008).
14
Sample-based learning and search with permanent and transient memories (English)
ICML, 968-975 (2008).
15
Dyna-style planning with linear function approximation and prioritized sweeping (English)
UAI, 528-536 (2008).
16
A computational model of hippocampal function in trace conditioning (English)
NIPS, 993-1000 (2008).
17
A convergent $O(n)$ temporal-difference algorithm for off-policy learning with linear function approximation (English)
NIPS, 1609-1616 (2008).
18
An integrated collision avoidance system for autonomous underwater vehicles. (English)
Int. J. Control 80, No. 7, 1027-1049 (2007).
19
Soft computing techniques in the design of a navigation, guidance and control system for an autonomous underwater vehicle. (English)
Int. J. Adapt. Control Signal Process. 21, No. 2-3, 205-236 (2007).
20
Result 1 to 20 from 78 total