2009 |
18 | EE | Istvan Szita,
András Lörincz:
Optimistic Initialization and Greediness Lead to Polynomial Time Learning in Factored MDPs - Extended Version
CoRR abs/0904.3352: (2009) |
2008 |
17 | | Guillaume Chaslot,
Sander Bakkes,
Istvan Szita,
Pieter Spronck:
Monte-Carlo Tree Search: A New Framework for Game AI.
AIIDE 2008 |
16 | EE | Istvan Szita,
András Lörincz:
The many faces of optimism: a unifying approach.
ICML 2008: 1048-1055 |
15 | EE | Istvan Szita,
András Lörincz:
Online variants of the cross-entropy method
CoRR abs/0801.1988: (2008) |
14 | EE | Istvan Szita,
András Lörincz:
Factored Value Iteration Converges
CoRR abs/0801.2069: (2008) |
13 | EE | Istvan Szita,
András Lörincz:
The many faces of optimism - Extended version
CoRR abs/0810.3451: (2008) |
2007 |
12 | EE | Istvan Szita,
András Lörincz:
Learning to Play Using Low-Complexity Rule-Based Policies: Illustrations through Ms. Pac-Man.
J. Artif. Intell. Res. (JAIR) 30: 659-684 (2007) |
2006 |
11 | EE | Istvan Szita,
Viktor Gyenes,
András Lörincz:
Reinforcement Learning with Echo State Networks.
ICANN (1) 2006: 830-839 |
10 | EE | Istvan Szita,
András Lörincz:
Low-complexity modular policies: learning to play Pac-Man and a new framework beyond MDPs
CoRR abs/cs/0610170: (2006) |
9 | EE | Istvan Szita,
András Lörincz:
Learning Tetris Using the Noisy Cross-Entropy Method.
Neural Computation 18(12): 2936-2941 (2006) |
2004 |
8 | EE | Istvan Szita,
András Lörincz:
Applying Policy Iteration for Training Recurrent Neural Networks
CoRR cs.AI/0410004: (2004) |
7 | EE | Istvan Szita,
András Lörincz:
Kalman Filter Control Embedded into the Reinforcement Learning Framework.
Neural Computation 16(3): 491-499 (2004) |
2003 |
6 | EE | Bálint Takács,
Istvan Szita,
András Lörincz:
Temporal plannability by variance of the episode length
CoRR cs.AI/0301006: (2003) |
5 | EE | Istvan Szita,
András Lörincz:
Kalman filter control in the reinforcement learning framework
CoRR cs.LG/0301007: (2003) |
4 | EE | Istvan Szita,
András Lörincz:
Reinforcement Learning with Linear Function Approximation and LQ control Converges
CoRR cs.LG/0306120: (2003) |
2002 |
3 | | Istvan Szita,
Bálint Takács,
András Lörincz:
Reinforcement Learning Integrated with a Non-Markovian Controller.
ECAI 2002: 365-369 |
2 | EE | Istvan Szita,
Bálint Takács,
András Lörincz:
Searching for Plannable Domains can Speed up Reinforcement Learning
CoRR cs.AI/0212025: (2002) |
1 | EE | Istvan Szita,
Bálint Takács,
András Lörincz:
MDPs: Learning in Varying Environments.
Journal of Machine Learning Research 3: 145-174 (2002) |