Istvan Szita

18EEIstvan Szita, András Lörincz: Optimistic Initialization and Greediness Lead to Polynomial Time Learning in Factored MDPs - Extended Version CoRR abs/0904.3352: (2009)
17 Guillaume Chaslot, Sander Bakkes, Istvan Szita, Pieter Spronck: Monte-Carlo Tree Search: A New Framework for Game AI. AIIDE 2008
16EEIstvan Szita, András Lörincz: The many faces of optimism: a unifying approach. ICML 2008: 1048-1055
15EEIstvan Szita, András Lörincz: Online variants of the cross-entropy method CoRR abs/0801.1988: (2008)
14EEIstvan Szita, András Lörincz: Factored Value Iteration Converges CoRR abs/0801.2069: (2008)
13EEIstvan Szita, András Lörincz: The many faces of optimism - Extended version CoRR abs/0810.3451: (2008)
12EEIstvan Szita, András Lörincz: Learning to Play Using Low-Complexity Rule-Based Policies: Illustrations through Ms. Pac-Man. J. Artif. Intell. Res. (JAIR) 30: 659-684 (2007)
11EEIstvan Szita, Viktor Gyenes, András Lörincz: Reinforcement Learning with Echo State Networks. ICANN (1) 2006: 830-839
10EEIstvan Szita, András Lörincz: Low-complexity modular policies: learning to play Pac-Man and a new framework beyond MDPs CoRR abs/cs/0610170: (2006)
9EEIstvan Szita, András Lörincz: Learning Tetris Using the Noisy Cross-Entropy Method. Neural Computation 18(12): 2936-2941 (2006)
8EEIstvan Szita, András Lörincz: Applying Policy Iteration for Training Recurrent Neural Networks CoRR cs.AI/0410004: (2004)
7EEIstvan Szita, András Lörincz: Kalman Filter Control Embedded into the Reinforcement Learning Framework. Neural Computation 16(3): 491-499 (2004)
6EEBálint Takács, Istvan Szita, András Lörincz: Temporal plannability by variance of the episode length CoRR cs.AI/0301006: (2003)
5EEIstvan Szita, András Lörincz: Kalman filter control in the reinforcement learning framework CoRR cs.LG/0301007: (2003)
4EEIstvan Szita, András Lörincz: Reinforcement Learning with Linear Function Approximation and LQ control Converges CoRR cs.LG/0306120: (2003)
3 Istvan Szita, Bálint Takács, András Lörincz: Reinforcement Learning Integrated with a Non-Markovian Controller. ECAI 2002: 365-369
2EEIstvan Szita, Bálint Takács, András Lörincz: Searching for Plannable Domains can Speed up Reinforcement Learning CoRR cs.AI/0212025: (2002)
1EEIstvan Szita, Bálint Takács, András Lörincz: MDPs: Learning in Varying Environments. Journal of Machine Learning Research 3: 145-174 (2002)

1Sander Bakkes [17]
2Guillaume Chaslot [17]
3Viktor Gyenes [11]
4András Lörincz [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [18]
5Pieter Spronck [17]
6Bálint Takács [1] [2] [3] [6]

