000 a
999 _c108959
_d108958
005 20221121144820.0
008 221019b ||||| |||| 00| 0 eng d
020 _a9789811540974
080 _a004.85
_bDON/D
100 _a Dong Hao
245 _aDeep reinforcement learning
260 _bSpringer
_c2020
_aSingapore
300 _axxvii,514p.
653 _aReinforcement learning
653 _aQ Networks
700 _aDing Zihan
700 _aZhang Shanghang
942 _cBK