Search research articles
Contact Us
Filters
Showing results (1-10 of 8) with videos related to
Page
of 1
Sort By:
Neural Computation
|
February 24, 2023
Multistream-Based Marked Point Process With Decomposed Cumulative Hazard Functions
Hirotaka Hachiya, Sujun Hong
Neural Computation
|
August 20, 2011
Reward-weighted regression with sample reuse for direct policy search in reinforcement learning
Hirotaka Hachiya, Jan Peters, Masashi Sugiyama
Neural Networks : the Official Journal of the International Neural Network Society
|
January 19, 2010
Efficient exploration through active learning for value function approximation in reinforcement learning
Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiyama
Neural Networks : the Official Journal of the International Neural Network Society
|
October 25, 2011
Analysis and improvement of policy gradient estimation
Tingting Zhao, Hirotaka Hachiya, Gang Niu, et al.
Neural Networks : the Official Journal of the International Neural Network Society
|
February 14, 2009
Adaptive importance sampling for value function approximation in off-policy reinforcement learning
Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiayma, et al.
Neural Computation
|
March 23, 2013
Efficient sample reuse in policy gradients with parameter-based exploration
Tingting Zhao, Hirotaka Hachiya, Voot Tangkaratt, et al.
Neural Computation
|
April 4, 2013
Relative density-ratio estimation for robust distribution comparison
Makoto Yamada, Taiji Suzuki, Takafumi Kanamori, et al.
Neural Computation
|
October 10, 2013
Information-maximization clustering based on squared-loss mutual information
Masashi Sugiyama, Gang Niu, Makoto Yamada, et al.
Page
of 1
Search research articles
Search
Showing results (1-10 of 8) with videos related to
Sort By:
Page
of 1
Neural Computation
|
February 24, 2023
Multistream-Based Marked Point Process With Decomposed Cumulative Hazard Functions
Hirotaka Hachiya, Sujun Hong
Neural Computation
|
August 20, 2011
Reward-weighted regression with sample reuse for direct policy search in reinforcement learning
Hirotaka Hachiya, Jan Peters, Masashi Sugiyama
Neural Networks : the Official Journal of the International Neural Network Society
|
January 19, 2010
Efficient exploration through active learning for value function approximation in reinforcement learning
Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiyama
Neural Networks : the Official Journal of the International Neural Network Society
|
October 25, 2011
Analysis and improvement of policy gradient estimation
Tingting Zhao, Hirotaka Hachiya, Gang Niu, et al.
Neural Networks : the Official Journal of the International Neural Network Society
|
February 14, 2009
Adaptive importance sampling for value function approximation in off-policy reinforcement learning
Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiayma, et al.
Neural Computation
|
March 23, 2013
Efficient sample reuse in policy gradients with parameter-based exploration
Tingting Zhao, Hirotaka Hachiya, Voot Tangkaratt, et al.
Neural Computation
|
April 4, 2013
Relative density-ratio estimation for robust distribution comparison
Makoto Yamada, Taiji Suzuki, Takafumi Kanamori, et al.
Neural Computation
|
October 10, 2013
Information-maximization clustering based on squared-loss mutual information
Masashi Sugiyama, Gang Niu, Makoto Yamada, et al.
Page
of 1