Gerald Tesauro

List of publications from the DBLP Bibliography Server - FAQ

Coauthor Index - Ask others: ACM DL/Guide - CiteSeer - CSB - Google - MSN - Yahoo

2008

45 EE Rajarshi Das, Jeffrey O. Kephart, Charles Lefurgy, Gerald Tesauro, David W. Levine, Hoi Chan: Autonomic multi-agent management of power and performance in data centers. AAMAS (Industry Track) 2008: 107-114

2007

44 EE Jeffrey O. Kephart, Hoi Chan, Rajarshi Das, David W. Levine, Gerald Tesauro, Freeman L. Rawson III, Charles Lefurgy: Coordinating Multiple Autonomic Managers to Achieve Specified Power-Performance Tradeoffs. ICAC 2007: 24

43 EE Irina Rish, Gerald Tesauro: Estimating End-to-End Performance by Collaborative Prediction with Active Sampling. Integrated Network Management 2007: 294-303

42 EE Gerald Tesauro, Rajarshi Das, Hoi Chan, Jeffrey O. Kephart, David Levine, Freeman L. Rawson III, Charles Lefurgy: Managing Power Consumption and Performance of Computing Systems Using Reinforcement Learning. NIPS 2007

41 EE Gerald Tesauro, Nicholas K. Jong, Rajarshi Das, Mohamed N. Bennani: On the use of hybrid reinforcement learning for autonomic resource allocation. Cluster Computing 10(3): 287-299 (2007)

40 EE Gerald Tesauro: Reinforcement Learning in Autonomic Computing: A Manifesto and Case Studies. IEEE Internet Computing 11(1): 22-30 (2007)

2006

39 EE Gerald Tesauro, Nicholas K. Jong, Rajarshi Das, Mohamed N. Bennani: Improvement of Systems Management Policies Using Hybrid Reinforcement Learning. ECML 2006: 783-791

2005

38 Relu Patrascu, Craig Boutilier, Rajarshi Das, Jeffrey O. Kephart, Gerald Tesauro, William E. Walsh: New Approaches to Optimization and Utility Elicitation in Autonomic Computing. AAAI 2005: 140-145

37 Gerald Tesauro: Online Resource Allocation Using Decompositional Reinforcement Learning. AAAI 2005: 886-891

36 EE Gerald Tesauro, Rajarshi Das, William E. Walsh, Jeffrey O. Kephart: Utility-Function-Driven Resource Allocation in Autonomic Systems. ICAC 2005: 342-343

2004

35 EE Gerald Tesauro, David M. Chess, William E. Walsh, Rajarshi Das, Alla Segal, Ian Whalley, Jeffrey O. Kephart, Steve R. White: A Multi-Agent Systems Approach to Autonomic Computing. AAMAS 2004: 464-471

34 EE William E. Walsh, Gerald Tesauro, Jeffrey O. Kephart, Rajarshi Das: Utility Functions in Autonomic Systems. ICAC 2004: 70-77

2003

33 EE Cuihong Li, Gerald Tesauro: A strategic decision model for multi-attribute bilateral negotiation with alternating. ACM Conference on Electronic Commerce 2003: 208-209

32 EE James E. Hanson, Gerald Tesauro, Jeffrey O. Kephart, E. C. Snibl: Multi-agent implementation of asymmetric protocol for bilateral negotiations. ACM Conference on Electronic Commerce 2003: 224-225

31 EE Gerald Tesauro: Extending Q-Learning to General Adaptive Multi-Agent Systems. NIPS 2003

30 Craig Boutilier, Rajarshi Das, Jeffrey O. Kephart, Gerald Tesauro, William E. Walsh: Cooperative Negotiation in Autonomic Systems using Incremental Utility Elicitation. UAI 2003: 89-97

2002

29 EE Gerald Tesauro, Jonathan Bredin: Strategic sequential bidding in auctions using dynamic programming. AAMAS 2002: 591-598

28 EE Gerald Tesauro: Programming backgammon using self-teaching neural nets. Artif. Intell. 134(1-2): 181-199 (2002)

27 Gerald Tesauro, Jeffrey O. Kephart: Pricing in Agent Economies Using Multi-Agent Q-Learning. Autonomous Agents and Multi-Agent Systems 5(3): 289-304 (2002)

2001

26 EE Gerald Tesauro, Rajarshi Das: High-performance bidding agents for the continuous double auction. ACM Conference on Electronic Commerce 2001: 206-209

25 Rajarshi Das, James E. Hanson, Jeffrey O. Kephart, Gerald Tesauro: Agent-Human Interactions in the Continuous Double Auction. IJCAI 2001: 1169-1187

24 EE Gerald Tesauro: Pricing in Agent Economies Using Neural Networks and Multi-agent Q-Learning. Sequence Learning 2001: 288-307

2000

23 EE Manu Sridharan, Gerald Tesauro: Multi-Agent Q-Learning and Regression Trees for Automated Pricing Decisions. ICMAS 2000: 447-448

22 Jeffrey O. Kephart, Gerald Tesauro: Pseudo-convergent Q-Learning by Competitive Pricebots. ICML 2000: 463-470

21 Manu Sridharan, Gerald Tesauro: Multi-agent Q-learning and Regression Trees for Automated Pricing Decisions. ICML 2000: 927-934

20 EE Gerald Tesauro, Jeffrey O. Kephart: Foresight-based pricing algorithms in agent economies. Decision Support Systems 28(1-2): 49-60 (2000)

1999

19 EE Amy R. Greenwald, Jeffrey O. Kephart, Gerald Tesauro: Strategic pricebot dynamics. ACM Conference on Electronic Commerce 1999: 58-67

1998

18 Gerald Tesauro: Comments on ``Co-Evolution in the Successful Learning of Backgammon Strategy''. Machine Learning 32(3): 241-243 (1998)

1996

17 EE Gerald Tesauro, Gregory R. Galperin: On-line Policy Improvement using Monte-Carlo Search. NIPS 1996: 1068-1074

1995

16 Gerald Tesauro, David S. Touretzky, Todd K. Leen: Advances in Neural Information Processing Systems 7, [NIPS Conference, Denver, Colorado, USA, 1994] MIT Press 1995

15 Jeffrey O. Kephart, Gregory B. Sorkin, William C. Arnold, David M. Chess, Gerald Tesauro, Steve R. White: Biologically Inspired Defenses Against Computer Viruses. IJCAI (1) 1995: 985-996

14 Gerald Tesauro: Temporal Difference Learning and TD-Gammon. Commun. ACM 38(3): 58-68 (1995)

1994

13 Jack D. Cowan, Gerald Tesauro, Joshua Alspector: Advances in Neural Information Processing Systems 6, [7th NIPS Conference, Denver, Colorado, USA, 1993] Morgan Kaufmann 1994

1992

12 Gerald Tesauro: Temporal Difference Learning of Backgammon Strategy. ML 1992: 451-457

11 Gerald Tesauro: Practical Issues in Temporal Difference Learning. Machine Learning 8: 257-277 (1992)

1991

10 EE Gerald Tesauro: Practical Issues in Temporal Difference Learning. NIPS 1991: 259-266

9 Jakub Wejchert, Gerald Tesauro: Visualizing processes in neural networks. IBM Journal of Research and Development 35(1): 244-253 (1991)

1990

8 EE David A. Cohn, Gerald Tesauro: Can Neural Networks Do Better Than the Vapnik-Chervonenkis Bounds? NIPS 1990: 911-917

1989

7 EE Jakub Wejchert, Gerald Tesauro: Neural Network Visualization. NIPS 1989: 465-472

6 EE Subutai Ahmad, Gerald Tesauro, Yu He: Asymptotic Convergence of Backpropagation: Numerical Experiments. NIPS 1989: 606-613

5 Gerald Tesauro, Terrence J. Sejnowski: A Parallel Network that Learns to Play Backgammon. Artif. Intell. 39(3): 357-390 (1989)

1988

4 Gerald Tesauro: Connectionist Learning of Expert Backgammon Evaluations. ML 1988: 200-206

3 EE Subutai Ahmad, Gerald Tesauro: Scaling and Generalization in Neural Networks: A Case Study. NIPS 1988: 160-168

2 EE Gerald Tesauro: Connectionist Learning of Expert Preferences by Comparison Training. NIPS 1988: 99-106

1987

1 EE Gerald Tesauro, Terrence J. Sejnowski: A 'Neural' Network that Learns to Play Backgammon. NIPS 1987: 794-803

2008
45	EE	Rajarshi Das, Jeffrey O. Kephart, Charles Lefurgy, Gerald Tesauro, David W. Levine, Hoi Chan: Autonomic multi-agent management of power and performance in data centers. AAMAS (Industry Track) 2008: 107-114
2007
44	EE	Jeffrey O. Kephart, Hoi Chan, Rajarshi Das, David W. Levine, Gerald Tesauro, Freeman L. Rawson III, Charles Lefurgy: Coordinating Multiple Autonomic Managers to Achieve Specified Power-Performance Tradeoffs. ICAC 2007: 24
43	EE	Irina Rish, Gerald Tesauro: Estimating End-to-End Performance by Collaborative Prediction with Active Sampling. Integrated Network Management 2007: 294-303
42	EE	Gerald Tesauro, Rajarshi Das, Hoi Chan, Jeffrey O. Kephart, David Levine, Freeman L. Rawson III, Charles Lefurgy: Managing Power Consumption and Performance of Computing Systems Using Reinforcement Learning. NIPS 2007
41	EE	Gerald Tesauro, Nicholas K. Jong, Rajarshi Das, Mohamed N. Bennani: On the use of hybrid reinforcement learning for autonomic resource allocation. Cluster Computing 10(3): 287-299 (2007)
40	EE	Gerald Tesauro: Reinforcement Learning in Autonomic Computing: A Manifesto and Case Studies. IEEE Internet Computing 11(1): 22-30 (2007)
2006
39	EE	Gerald Tesauro, Nicholas K. Jong, Rajarshi Das, Mohamed N. Bennani: Improvement of Systems Management Policies Using Hybrid Reinforcement Learning. ECML 2006: 783-791
2005
38		Relu Patrascu, Craig Boutilier, Rajarshi Das, Jeffrey O. Kephart, Gerald Tesauro, William E. Walsh: New Approaches to Optimization and Utility Elicitation in Autonomic Computing. AAAI 2005: 140-145
37		Gerald Tesauro: Online Resource Allocation Using Decompositional Reinforcement Learning. AAAI 2005: 886-891
36	EE	Gerald Tesauro, Rajarshi Das, William E. Walsh, Jeffrey O. Kephart: Utility-Function-Driven Resource Allocation in Autonomic Systems. ICAC 2005: 342-343
2004
35	EE	Gerald Tesauro, David M. Chess, William E. Walsh, Rajarshi Das, Alla Segal, Ian Whalley, Jeffrey O. Kephart, Steve R. White: A Multi-Agent Systems Approach to Autonomic Computing. AAMAS 2004: 464-471
34	EE	William E. Walsh, Gerald Tesauro, Jeffrey O. Kephart, Rajarshi Das: Utility Functions in Autonomic Systems. ICAC 2004: 70-77
2003
33	EE	Cuihong Li, Gerald Tesauro: A strategic decision model for multi-attribute bilateral negotiation with alternating. ACM Conference on Electronic Commerce 2003: 208-209
32	EE	James E. Hanson, Gerald Tesauro, Jeffrey O. Kephart, E. C. Snibl: Multi-agent implementation of asymmetric protocol for bilateral negotiations. ACM Conference on Electronic Commerce 2003: 224-225
31	EE	Gerald Tesauro: Extending Q-Learning to General Adaptive Multi-Agent Systems. NIPS 2003
30		Craig Boutilier, Rajarshi Das, Jeffrey O. Kephart, Gerald Tesauro, William E. Walsh: Cooperative Negotiation in Autonomic Systems using Incremental Utility Elicitation. UAI 2003: 89-97
2002
29	EE	Gerald Tesauro, Jonathan Bredin: Strategic sequential bidding in auctions using dynamic programming. AAMAS 2002: 591-598
28	EE	Gerald Tesauro: Programming backgammon using self-teaching neural nets. Artif. Intell. 134(1-2): 181-199 (2002)
27		Gerald Tesauro, Jeffrey O. Kephart: Pricing in Agent Economies Using Multi-Agent Q-Learning. Autonomous Agents and Multi-Agent Systems 5(3): 289-304 (2002)
2001
26	EE	Gerald Tesauro, Rajarshi Das: High-performance bidding agents for the continuous double auction. ACM Conference on Electronic Commerce 2001: 206-209
25		Rajarshi Das, James E. Hanson, Jeffrey O. Kephart, Gerald Tesauro: Agent-Human Interactions in the Continuous Double Auction. IJCAI 2001: 1169-1187
24	EE	Gerald Tesauro: Pricing in Agent Economies Using Neural Networks and Multi-agent Q-Learning. Sequence Learning 2001: 288-307
2000
23	EE	Manu Sridharan, Gerald Tesauro: Multi-Agent Q-Learning and Regression Trees for Automated Pricing Decisions. ICMAS 2000: 447-448
22		Jeffrey O. Kephart, Gerald Tesauro: Pseudo-convergent Q-Learning by Competitive Pricebots. ICML 2000: 463-470
21		Manu Sridharan, Gerald Tesauro: Multi-agent Q-learning and Regression Trees for Automated Pricing Decisions. ICML 2000: 927-934
20	EE	Gerald Tesauro, Jeffrey O. Kephart: Foresight-based pricing algorithms in agent economies. Decision Support Systems 28(1-2): 49-60 (2000)
1999
19	EE	Amy R. Greenwald, Jeffrey O. Kephart, Gerald Tesauro: Strategic pricebot dynamics. ACM Conference on Electronic Commerce 1999: 58-67
1998
18		Gerald Tesauro: Comments on ``Co-Evolution in the Successful Learning of Backgammon Strategy''. Machine Learning 32(3): 241-243 (1998)
1996
17	EE	Gerald Tesauro, Gregory R. Galperin: On-line Policy Improvement using Monte-Carlo Search. NIPS 1996: 1068-1074
1995
16		Gerald Tesauro, David S. Touretzky, Todd K. Leen: Advances in Neural Information Processing Systems 7, [NIPS Conference, Denver, Colorado, USA, 1994] MIT Press 1995
15		Jeffrey O. Kephart, Gregory B. Sorkin, William C. Arnold, David M. Chess, Gerald Tesauro, Steve R. White: Biologically Inspired Defenses Against Computer Viruses. IJCAI (1) 1995: 985-996
14		Gerald Tesauro: Temporal Difference Learning and TD-Gammon. Commun. ACM 38(3): 58-68 (1995)
1994
13		Jack D. Cowan, Gerald Tesauro, Joshua Alspector: Advances in Neural Information Processing Systems 6, [7th NIPS Conference, Denver, Colorado, USA, 1993] Morgan Kaufmann 1994
1992
12		Gerald Tesauro: Temporal Difference Learning of Backgammon Strategy. ML 1992: 451-457
11		Gerald Tesauro: Practical Issues in Temporal Difference Learning. Machine Learning 8: 257-277 (1992)
1991
10	EE	Gerald Tesauro: Practical Issues in Temporal Difference Learning. NIPS 1991: 259-266
9		Jakub Wejchert, Gerald Tesauro: Visualizing processes in neural networks. IBM Journal of Research and Development 35(1): 244-253 (1991)
1990
8	EE	David A. Cohn, Gerald Tesauro: Can Neural Networks Do Better Than the Vapnik-Chervonenkis Bounds? NIPS 1990: 911-917
1989
7	EE	Jakub Wejchert, Gerald Tesauro: Neural Network Visualization. NIPS 1989: 465-472
6	EE	Subutai Ahmad, Gerald Tesauro, Yu He: Asymptotic Convergence of Backpropagation: Numerical Experiments. NIPS 1989: 606-613
5		Gerald Tesauro, Terrence J. Sejnowski: A Parallel Network that Learns to Play Backgammon. Artif. Intell. 39(3): 357-390 (1989)
1988
4		Gerald Tesauro: Connectionist Learning of Expert Backgammon Evaluations. ML 1988: 200-206
3	EE	Subutai Ahmad, Gerald Tesauro: Scaling and Generalization in Neural Networks: A Case Study. NIPS 1988: 160-168
2	EE	Gerald Tesauro: Connectionist Learning of Expert Preferences by Comparison Training. NIPS 1988: 99-106
1987
1	EE	Gerald Tesauro, Terrence J. Sejnowski: A 'Neural' Network that Learns to Play Backgammon. NIPS 1987: 794-803

Coauthor Index

1 Subutai Ahmad [3] [6]

2 Joshua Alspector [13]

3 William C. Arnold [15]

4 Mohamed N. Bennani [39] [41]

5 Craig Boutilier [30] [38]

6 Jonathan Bredin [29]

7 Hoi Chan [42] [44] [45]

8 David M. Chess [15] [35]

9 David A. Cohn [8]

10 Jack D. Cowan [13]

11 Rajarshi Das [25] [26] [30] [34] [35] [36] [38] [39] [41] [42] [44] [45]

12 Gregory R. Galperin [17]

13 Amy R. Greenwald [19]

14 James E. Hanson [25] [32]

15 Yu He [6]

16 Nicholas K. Jong [39] [41]

17 Jeffrey O. Kephart [15] [19] [20] [22] [25] [27] [30] [32] [34] [35] [36] [38] [42] [44] [45]

18 Todd K. Leen [16]

19 Charles Lefurgy [42] [44] [45]

20 David Levine [42]

21 David W. Levine [44] [45]

22 Cuihong Li [33]

23 Relu Patrascu [38]

24 Freeman L. Rawson III [42] [44]

25 Irina Rish [43]

26 Alla Segal [35]

27 Terrence J. Sejnowski [1] [5]

28 E. C. Snibl [32]

29 Gregory B. Sorkin [15]

30 Manu Sridharan [21] [23]

31 David S. Touretzky [16]

32 William E. Walsh [30] [34] [35] [36] [38]

33 Jakub Wejchert [7] [9]

34 Ian Whalley [35]

35 Steve R. White [15] [35]

Colors in the list of coauthors

1	Subutai Ahmad	[3] [6]
2	Joshua Alspector	[13]
3	William C. Arnold	[15]
4	Mohamed N. Bennani	[39] [41]
5	Craig Boutilier	[30] [38]
6	Jonathan Bredin	[29]
7	Hoi Chan	[42] [44] [45]
8	David M. Chess	[15] [35]
9	David A. Cohn	[8]
10	Jack D. Cowan	[13]
11	Rajarshi Das	[25] [26] [30] [34] [35] [36] [38] [39] [41] [42] [44] [45]
12	Gregory R. Galperin	[17]
13	Amy R. Greenwald	[19]
14	James E. Hanson	[25] [32]
15	Yu He	[6]
16	Nicholas K. Jong	[39] [41]
17	Jeffrey O. Kephart	[15] [19] [20] [22] [25] [27] [30] [32] [34] [35] [36] [38] [42] [44] [45]
18	Todd K. Leen	[16]
19	Charles Lefurgy	[42] [44] [45]
20	David Levine	[42]
21	David W. Levine	[44] [45]
22	Cuihong Li	[33]
23	Relu Patrascu	[38]
24	Freeman L. Rawson III	[42] [44]
25	Irina Rish	[43]
26	Alla Segal	[35]
27	Terrence J. Sejnowski	[1] [5]
28	E. C. Snibl	[32]
29	Gregory B. Sorkin	[15]
30	Manu Sridharan	[21] [23]
31	David S. Touretzky	[16]
32	William E. Walsh	[30] [34] [35] [36] [38]
33	Jakub Wejchert	[7] [9]
34	Ian Whalley	[35]
35	Steve R. White	[15] [35]