2007 | ||
---|---|---|
2 | EE | Mohammed Shahid Abdulla, Shalabh Bhatnagar: Reinforcement Learning Based Algorithms for Average Cost Markov Decision Processes. Discrete Event Dynamic Systems 17(1): 23-52 (2007) |
2006 | ||
1 | EE | Mohammed Shahid Abdulla, Shalabh Bhatnagar: SPSA algorithms with measurement reuse. Winter Simulation Conference 2006: 320-328 |
1 | Shalabh Bhatnagar | [1] [2] |