![]() |
| 2007 | ||
|---|---|---|
| 2 | EE | Mohammed Shahid Abdulla, Shalabh Bhatnagar: Reinforcement Learning Based Algorithms for Average Cost Markov Decision Processes. Discrete Event Dynamic Systems 17(1): 23-52 (2007) |
| 2006 | ||
| 1 | EE | Mohammed Shahid Abdulla, Shalabh Bhatnagar: SPSA algorithms with measurement reuse. Winter Simulation Conference 2006: 320-328 |
| 1 | Shalabh Bhatnagar | [1] [2] |