![]() | ![]() |
2007 | ||
---|---|---|
1 | EE | Yugo Hasegawa, Satoko Takada, Hidehiro Nakano, Shuichi Arai, Arata Miyauchi: A reinforcement learning method using a dynamic reinforcement function based on action selection probability. Systems and Computers in Japan 38(7): 1-11 (2007) |
1 | Shuichi Arai | [1] |
2 | Yugo Hasegawa | [1] |
3 | Arata Miyauchi | [1] |
4 | Hidehiro Nakano | [1] |