研究

RLlib:分布式强化学习的抽象

作者:Eric Liang, Richard Liaw, Philipp Moritz, Robert Nishihara, Roy Fox, Ken Goldberg, Joseph E. Gonzalez, Michael I. Jordan, Ion Stoica

下载论文

摘要

强化学习(RL)算法涉及高度不规则计算模式的深度嵌套,每种模式通常都展示了分布式计算的机会。我们主张通过自顶向下的分层控制算法以可组合的方式分布RL组件,从而将并行性和资源需求封装在短时间运行的计算任务中。我们通过RLlib证明了这一原则的好处:RLlib是一个为RL提供可伸缩软件原语的库。这些原语使各种算法能够以高性能、可伸缩性和大量代码重用实现。RLlib是开源Ray项目的一部分。bob下载地址

相关内容

作者:Andrew Chen, Andy Chow, Aaron Davidson, Arjun DCunha, Ali Ghodsi, Sue Ann Hong, Andy Konwinski, Clemens Mewald, Siddharth Murching, Tomas Nykodym, Paul Ogilvie, Mani Parkhe, Avesh Singh, Fen Xie, Matei Zaharia, Richard Zang, Juntai郑俊泰,Corey Zumar, Databricks, Inc.

作者:Matei Zaharia, Andrew Chen, Aaron Davidson, Ali Ghodsi, Sue Ann Hong, Andy Konwinski, Siddharth Murching, Tomas Nykodym, Paul Ogilvie, Mani Parkhe, Fen Xie, Corey Zumar, Databricks Inc.

作者:Philipp Moritz, Robert Nishihara, Stephanie Wang, Alexey Tumanov, Richard Liaw, Eric Liang, Melih Elibol, Yang Zongheng, William Paul, Michael I. Jordan和Ion Stoica, UC Berkeley

作者:Roy Fox, Richard Shin, Sanjay Krishnan, Ken Goldberg, Dawn Song, Ion Stoica

作者:Firas Abuzaid, Joseph Bradley, Feynman Liang, Andrew Feng, Lee Yang, Matei Zaharia, Ameet Talwalkar

作者:Cody Coleman, Deepak Narayanan, Daniel Kang,赵田,张健,Luigi Nardi, Peter Bailis, Kunle Olukotun, Chris Ré, Matei Zaharia

作者:Daniel Crankshaw, Wang Xin, Giulio Zhou, Michael J. Franklin, Joseph E. Gonzalez, Ion Stoica

作者:Reza Bosagh Zadeh,向瑞孟,Alexander Ulanov, Burak Yavuz, Li Pu, Shivaram Venkataraman, Evan Sparks, Aaron staples, Matei Zaharia

作者:祥瑞孟,约瑟夫·布拉德利,Burak Yavuz, Evan Sparks, Shivaram Venkataraman, Davies Liu, Jeremy Freeman, DB Tsai, Manish Amde, Sean Owen, Doris Xin, Reynold Xin, Michael J. Franklin, Reza Zadeh, Matei Zaharia, Ameet Talwalkar