ABSTRACT: Second order optimization algorithms have a long history in scientific computing, but they tend not to be used much in machine learning. This is [...]
ABSTRACT: In many real-world applications of reinforcement learning (RL) such as healthcare, dialogue systems and robotics, running a new policy on humans or robots can [...]