| 2009 |
| 18 | EE | Istvan Szita,
András Lörincz:
Optimistic Initialization and Greediness Lead to Polynomial Time Learning in Factored MDPs - Extended Version
CoRR abs/0904.3352: (2009) |
| 2008 |
| 17 | | Guillaume Chaslot,
Sander Bakkes,
Istvan Szita,
Pieter Spronck:
Monte-Carlo Tree Search: A New Framework for Game AI.
AIIDE 2008 |
| 16 | EE | Istvan Szita,
András Lörincz:
The many faces of optimism: a unifying approach.
ICML 2008: 1048-1055 |
| 15 | EE | Istvan Szita,
András Lörincz:
Online variants of the cross-entropy method
CoRR abs/0801.1988: (2008) |
| 14 | EE | Istvan Szita,
András Lörincz:
Factored Value Iteration Converges
CoRR abs/0801.2069: (2008) |
| 13 | EE | Istvan Szita,
András Lörincz:
The many faces of optimism - Extended version
CoRR abs/0810.3451: (2008) |
| 2007 |
| 12 | EE | Istvan Szita,
András Lörincz:
Learning to Play Using Low-Complexity Rule-Based Policies: Illustrations through Ms. Pac-Man.
J. Artif. Intell. Res. (JAIR) 30: 659-684 (2007) |
| 2006 |
| 11 | EE | Istvan Szita,
Viktor Gyenes,
András Lörincz:
Reinforcement Learning with Echo State Networks.
ICANN (1) 2006: 830-839 |
| 10 | EE | Istvan Szita,
András Lörincz:
Low-complexity modular policies: learning to play Pac-Man and a new framework beyond MDPs
CoRR abs/cs/0610170: (2006) |
| 9 | EE | Istvan Szita,
András Lörincz:
Learning Tetris Using the Noisy Cross-Entropy Method.
Neural Computation 18(12): 2936-2941 (2006) |
| 2004 |
| 8 | EE | Istvan Szita,
András Lörincz:
Applying Policy Iteration for Training Recurrent Neural Networks
CoRR cs.AI/0410004: (2004) |
| 7 | EE | Istvan Szita,
András Lörincz:
Kalman Filter Control Embedded into the Reinforcement Learning Framework.
Neural Computation 16(3): 491-499 (2004) |
| 2003 |
| 6 | EE | Bálint Takács,
Istvan Szita,
András Lörincz:
Temporal plannability by variance of the episode length
CoRR cs.AI/0301006: (2003) |
| 5 | EE | Istvan Szita,
András Lörincz:
Kalman filter control in the reinforcement learning framework
CoRR cs.LG/0301007: (2003) |
| 4 | EE | Istvan Szita,
András Lörincz:
Reinforcement Learning with Linear Function Approximation and LQ control Converges
CoRR cs.LG/0306120: (2003) |
| 2002 |
| 3 | | Istvan Szita,
Bálint Takács,
András Lörincz:
Reinforcement Learning Integrated with a Non-Markovian Controller.
ECAI 2002: 365-369 |
| 2 | EE | Istvan Szita,
Bálint Takács,
András Lörincz:
Searching for Plannable Domains can Speed up Reinforcement Learning
CoRR cs.AI/0212025: (2002) |
| 1 | EE | Istvan Szita,
Bálint Takács,
András Lörincz:
MDPs: Learning in Varying Environments.
Journal of Machine Learning Research 3: 145-174 (2002) |