EWRL and NIPS 2016

EWRL and NIPS 2016 I went to the European Workshop on Reinforcement Learning and NIPS last month and saw several interesting things. At EWRL, I particularly liked the talks from: Remi Munos on off-policy evaluation Mohammad Ghavamzadeh on learning safe policies Emma Brunskill on optimizing biased-but safe estimators (sense a theme?) Sergey Levine on low sample complexity applications of RL in robotics. My talk is here. Overall, this was a well organized workshop with diverse and interesting subjects, with the only caveat being that they had to limit registration At NIPS itself, I found the poster sessions fairly interesting. Allen-Zhu and Hazan had a new notion of a reduction (video). Zhao, Poupart, and Gordon had a new way to learn Sum-Product Networks Ho, Littman, MacGlashan, Cushman, and Austerwell, had a paper on how “Showing” is different from “Doing”. Toulis and…
Original Post: EWRL and NIPS 2016

The Multiworld Testing Decision Service

The Multiworld Testing Decision Service We made a tool that you can use. It is the first general purpose reinforcement-based learning system Reinforcement learning is much discussed these days with successes like AlphaGo. Wouldn’t it be great if Reinforcement Learning algorithms could easily be used to solve all reinforcement learning problems? But there is a well-known problem: It’s very easy…
Original Post: The Multiworld Testing Decision Service

AlphaGo is not the solution to AI

AlphaGo is not the solution to AI Congratulations are in order for the folks at Google Deepmind who have mastered Go. However, some of the discussion around this seems like giddy overstatement. Wired says Machines have conquered the last games and Slashdot says We know now that we don’t need any big new breakthroughs to get to true AI. The…
Original Post: AlphaGo is not the solution to AI