2008 | ||
---|---|---|
7 | EE | David Vengerov: A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments. Future Generation Comp. Syst. 24(7): 687-693 (2008) |
6 | EE | David Vengerov: A reinforcement learning framework for online data migration in hierarchical storage systems. The Journal of Supercomputing 43(1): 1-19 (2008) |
2007 | ||
5 | EE | Ilya Gluhovsky, David Vengerov, Brian O'Krafka: Comprehensive multivariate extrapolation modeling of multiprocessor cache miss rates. ACM Trans. Comput. Syst. 25(1): (2007) |
2005 | ||
4 | EE | David Vengerov: Adaptive Utility-Based Scheduling in Resource-Constrained Systems. Australian Conference on Artificial Intelligence 2005: 477-488 |
3 | EE | David Vengerov, Nikolai Iakovlev: A Reinforcement Learning Framework for Dynamic Resource Allocation: First Results.. ICAC 2005: 339-340 |
2 | EE | David Vengerov, Nicholas Bambos, Hamid R. Berenji: A fuzzy reinforcement learning approach to power control in wireless transmitters. IEEE Transactions on Systems, Man, and Cybernetics, Part B 35(4): 768-778 (2005) |
2001 | ||
1 | Hamid R. Berenji, David Vengerov: On Convergence of Fuzzy Reinforcement Learning. FUZZ-IEEE 2001: 618-621 |
1 | Nicholas Bambos | [2] |
2 | Hamid R. Berenji | [1] [2] |
3 | Ilya Gluhovsky | [5] |
4 | Nikolai Iakovlev | [3] |
5 | Brian O'Krafka | [5] |