dblp.uni-trier.dewww.uni-trier.de

Gerald Tesauro

List of publications from the DBLP Bibliography Server - FAQ
Coauthor Index - Ask others: ACM DL/Guide - CiteSeer - CSB - Google - MSN - Yahoo

2008
45EERajarshi Das, Jeffrey O. Kephart, Charles Lefurgy, Gerald Tesauro, David W. Levine, Hoi Chan: Autonomic multi-agent management of power and performance in data centers. AAMAS (Industry Track) 2008: 107-114
2007
44EEJeffrey O. Kephart, Hoi Chan, Rajarshi Das, David W. Levine, Gerald Tesauro, Freeman L. Rawson III, Charles Lefurgy: Coordinating Multiple Autonomic Managers to Achieve Specified Power-Performance Tradeoffs. ICAC 2007: 24
43EEIrina Rish, Gerald Tesauro: Estimating End-to-End Performance by Collaborative Prediction with Active Sampling. Integrated Network Management 2007: 294-303
42EEGerald Tesauro, Rajarshi Das, Hoi Chan, Jeffrey O. Kephart, David Levine, Freeman L. Rawson III, Charles Lefurgy: Managing Power Consumption and Performance of Computing Systems Using Reinforcement Learning. NIPS 2007
41EEGerald Tesauro, Nicholas K. Jong, Rajarshi Das, Mohamed N. Bennani: On the use of hybrid reinforcement learning for autonomic resource allocation. Cluster Computing 10(3): 287-299 (2007)
40EEGerald Tesauro: Reinforcement Learning in Autonomic Computing: A Manifesto and Case Studies. IEEE Internet Computing 11(1): 22-30 (2007)
2006
39EEGerald Tesauro, Nicholas K. Jong, Rajarshi Das, Mohamed N. Bennani: Improvement of Systems Management Policies Using Hybrid Reinforcement Learning. ECML 2006: 783-791
2005
38 Relu Patrascu, Craig Boutilier, Rajarshi Das, Jeffrey O. Kephart, Gerald Tesauro, William E. Walsh: New Approaches to Optimization and Utility Elicitation in Autonomic Computing. AAAI 2005: 140-145
37 Gerald Tesauro: Online Resource Allocation Using Decompositional Reinforcement Learning. AAAI 2005: 886-891
36EEGerald Tesauro, Rajarshi Das, William E. Walsh, Jeffrey O. Kephart: Utility-Function-Driven Resource Allocation in Autonomic Systems. ICAC 2005: 342-343
2004
35EEGerald Tesauro, David M. Chess, William E. Walsh, Rajarshi Das, Alla Segal, Ian Whalley, Jeffrey O. Kephart, Steve R. White: A Multi-Agent Systems Approach to Autonomic Computing. AAMAS 2004: 464-471
34EEWilliam E. Walsh, Gerald Tesauro, Jeffrey O. Kephart, Rajarshi Das: Utility Functions in Autonomic Systems. ICAC 2004: 70-77
2003
33EECuihong Li, Gerald Tesauro: A strategic decision model for multi-attribute bilateral negotiation with alternating. ACM Conference on Electronic Commerce 2003: 208-209
32EEJames E. Hanson, Gerald Tesauro, Jeffrey O. Kephart, E. C. Snibl: Multi-agent implementation of asymmetric protocol for bilateral negotiations. ACM Conference on Electronic Commerce 2003: 224-225
31EEGerald Tesauro: Extending Q-Learning to General Adaptive Multi-Agent Systems. NIPS 2003
30 Craig Boutilier, Rajarshi Das, Jeffrey O. Kephart, Gerald Tesauro, William E. Walsh: Cooperative Negotiation in Autonomic Systems using Incremental Utility Elicitation. UAI 2003: 89-97
2002
29EEGerald Tesauro, Jonathan Bredin: Strategic sequential bidding in auctions using dynamic programming. AAMAS 2002: 591-598
28EEGerald Tesauro: Programming backgammon using self-teaching neural nets. Artif. Intell. 134(1-2): 181-199 (2002)
27 Gerald Tesauro, Jeffrey O. Kephart: Pricing in Agent Economies Using Multi-Agent Q-Learning. Autonomous Agents and Multi-Agent Systems 5(3): 289-304 (2002)
2001
26EEGerald Tesauro, Rajarshi Das: High-performance bidding agents for the continuous double auction. ACM Conference on Electronic Commerce 2001: 206-209
25 Rajarshi Das, James E. Hanson, Jeffrey O. Kephart, Gerald Tesauro: Agent-Human Interactions in the Continuous Double Auction. IJCAI 2001: 1169-1187
24EEGerald Tesauro: Pricing in Agent Economies Using Neural Networks and Multi-agent Q-Learning. Sequence Learning 2001: 288-307
2000
23EEManu Sridharan, Gerald Tesauro: Multi-Agent Q-Learning and Regression Trees for Automated Pricing Decisions. ICMAS 2000: 447-448
22 Jeffrey O. Kephart, Gerald Tesauro: Pseudo-convergent Q-Learning by Competitive Pricebots. ICML 2000: 463-470
21 Manu Sridharan, Gerald Tesauro: Multi-agent Q-learning and Regression Trees for Automated Pricing Decisions. ICML 2000: 927-934
20EEGerald Tesauro, Jeffrey O. Kephart: Foresight-based pricing algorithms in agent economies. Decision Support Systems 28(1-2): 49-60 (2000)
1999
19EEAmy R. Greenwald, Jeffrey O. Kephart, Gerald Tesauro: Strategic pricebot dynamics. ACM Conference on Electronic Commerce 1999: 58-67
1998
18 Gerald Tesauro: Comments on ``Co-Evolution in the Successful Learning of Backgammon Strategy''. Machine Learning 32(3): 241-243 (1998)
1996
17EEGerald Tesauro, Gregory R. Galperin: On-line Policy Improvement using Monte-Carlo Search. NIPS 1996: 1068-1074
1995
16 Gerald Tesauro, David S. Touretzky, Todd K. Leen: Advances in Neural Information Processing Systems 7, [NIPS Conference, Denver, Colorado, USA, 1994] MIT Press 1995
15 Jeffrey O. Kephart, Gregory B. Sorkin, William C. Arnold, David M. Chess, Gerald Tesauro, Steve R. White: Biologically Inspired Defenses Against Computer Viruses. IJCAI (1) 1995: 985-996
14 Gerald Tesauro: Temporal Difference Learning and TD-Gammon. Commun. ACM 38(3): 58-68 (1995)
1994
13 Jack D. Cowan, Gerald Tesauro, Joshua Alspector: Advances in Neural Information Processing Systems 6, [7th NIPS Conference, Denver, Colorado, USA, 1993] Morgan Kaufmann 1994
1992
12 Gerald Tesauro: Temporal Difference Learning of Backgammon Strategy. ML 1992: 451-457
11 Gerald Tesauro: Practical Issues in Temporal Difference Learning. Machine Learning 8: 257-277 (1992)
1991
10EEGerald Tesauro: Practical Issues in Temporal Difference Learning. NIPS 1991: 259-266
9 Jakub Wejchert, Gerald Tesauro: Visualizing processes in neural networks. IBM Journal of Research and Development 35(1): 244-253 (1991)
1990
8EEDavid A. Cohn, Gerald Tesauro: Can Neural Networks Do Better Than the Vapnik-Chervonenkis Bounds? NIPS 1990: 911-917
1989
7EEJakub Wejchert, Gerald Tesauro: Neural Network Visualization. NIPS 1989: 465-472
6EESubutai Ahmad, Gerald Tesauro, Yu He: Asymptotic Convergence of Backpropagation: Numerical Experiments. NIPS 1989: 606-613
5 Gerald Tesauro, Terrence J. Sejnowski: A Parallel Network that Learns to Play Backgammon. Artif. Intell. 39(3): 357-390 (1989)
1988
4 Gerald Tesauro: Connectionist Learning of Expert Backgammon Evaluations. ML 1988: 200-206
3EESubutai Ahmad, Gerald Tesauro: Scaling and Generalization in Neural Networks: A Case Study. NIPS 1988: 160-168
2EEGerald Tesauro: Connectionist Learning of Expert Preferences by Comparison Training. NIPS 1988: 99-106
1987
1EEGerald Tesauro, Terrence J. Sejnowski: A 'Neural' Network that Learns to Play Backgammon. NIPS 1987: 794-803

Coauthor Index

1Subutai Ahmad [3] [6]
2Joshua Alspector [13]
3William C. Arnold [15]
4Mohamed N. Bennani [39] [41]
5Craig Boutilier [30] [38]
6Jonathan Bredin [29]
7Hoi Chan [42] [44] [45]
8David M. Chess [15] [35]
9David A. Cohn [8]
10Jack D. Cowan [13]
11Rajarshi Das [25] [26] [30] [34] [35] [36] [38] [39] [41] [42] [44] [45]
12Gregory R. Galperin [17]
13Amy R. Greenwald [19]
14James E. Hanson [25] [32]
15Yu He [6]
16Nicholas K. Jong [39] [41]
17Jeffrey O. Kephart [15] [19] [20] [22] [25] [27] [30] [32] [34] [35] [36] [38] [42] [44] [45]
18Todd K. Leen [16]
19Charles Lefurgy [42] [44] [45]
20David Levine [42]
21David W. Levine [44] [45]
22Cuihong Li [33]
23Relu Patrascu [38]
24Freeman L. Rawson III [42] [44]
25Irina Rish [43]
26Alla Segal [35]
27Terrence J. Sejnowski [1] [5]
28E. C. Snibl [32]
29Gregory B. Sorkin [15]
30Manu Sridharan [21] [23]
31David S. Touretzky [16]
32William E. Walsh [30] [34] [35] [36] [38]
33Jakub Wejchert [7] [9]
34Ian Whalley [35]
35Steve R. White [15] [35]

Colors in the list of coauthors

Copyright © Sun May 17 03:24:02 2009 by Michael Ley (ley@uni-trier.de)