2008 | ||
---|---|---|
5 | EE | Abhijit Gosavi: On step sizes, stochastic shortest paths, and survival probabilities in Reinforcement Learning. Winter Simulation Conference 2008: 525-531 |
2006 | ||
4 | EE | Abhijit Gosavi: A risk-sensitive approach to total productive maintenance. Automatica 42(8): 1321-1330 (2006) |
2004 | ||
3 | EE | Ganesh Subramaniam, Abhijit Gosavi: Simulation-Based Optimization for Material Dispatching in a Retailer Network. Winter Simulation Conference 2004: 1412-1417 |
2 | EE | Abhijit Gosavi: Reinforcement learning for long-run average cost. European Journal of Operational Research 155(3): 654-674 (2004) |
1 | EE | Abhijit Gosavi: A Reinforcement Learning Algorithm Based on Policy Iteration for Average Reward: Empirical Results with Yield Management and Convergence Analysis. Machine Learning 55(1): 5-29 (2004) |
1 | Ganesh Subramaniam | [3] |