![]() |
| 2007 | ||
|---|---|---|
| 1 | EE | Yugo Hasegawa, Satoko Takada, Hidehiro Nakano, Shuichi Arai, Arata Miyauchi: A reinforcement learning method using a dynamic reinforcement function based on action selection probability. Systems and Computers in Japan 38(7): 1-11 (2007) |
| 1 | Shuichi Arai | [1] |
| 2 | Yugo Hasegawa | [1] |
| 3 | Arata Miyauchi | [1] |
| 4 | Hidehiro Nakano | [1] |