![]() | ![]() |
2008 | ||
---|---|---|
7 | EE | Harukazu Igarashi, K. Nakamura, Seiji Ishihara: Learning of soccer player agents using a policy gradient method: Coordination between kicker and receiver during free kicks. IJCNN 2008: 46-52 |
6 | EE | Seiji Ishihara, Harukazu Igarashi: Behavior Learning Based on a Policy Gradient Method: Separation of Environmental Dynamics and State Values in Policies. PRICAI 2008: 164-174 |
2006 | ||
5 | EE | Seiji Ishihara, Harukazu Igarashi: A Task Decomposition Algorithm Using Mixtures of Normal Distributions for Classification Problems. HIS 2006: 28 |
4 | EE | Seiji Ishihara, Harukazu Igarashi: Applying the policy gradient method to behavior learning in multiagent systems: The pursuit problem. Systems and Computers in Japan 37(10): 101-109 (2006) |
2005 | ||
3 | EE | Seiji Ishihara, Harukazu Igarashi: A Task Decomposition Algorithm Using Radial Basis Functions for Classification Problems. DICTA 2005: 2 |
2003 | ||
2 | Seiji Ishihara, Harukazu Igarashi: Policy Gradient Methods in Multi-Agent Systems. HIS 2003: 789-798 | |
1998 | ||
1 | Seiji Ishihara, Takashi Nagano: A Modular Type Network for Incremental Learning. ICONIP 1998: 1651-1654 |
1 | Harukazu Igarashi | [2] [3] [4] [5] [6] [7] |
2 | Takashi Nagano | [1] |
3 | K. Nakamura | [7] |