�ЈD�W

>

Ӌ��C��Փ

��ǰչ��Ե��c�ֲ�ʽ��W��

��

��] ��ǰչ��Ե��c�ֲ�ʽ��W��

��ߣ�(��)��÷�͡�P.��ِ��˹

��磺��A��W��r�g��2022-04-01

�_�� 16�_ 퓔�� 483

���Σ�Ӌ��C/�W�j�N��

�� D �r:¥80.6(5.8��) ��r ~~¥139.0~~ ��䛺�ɿ��T�r

��ُ��܇ �ղ�

�_��٣� ȫ��]

?�½��س��

��Ǖ��>

>
ȫ��Ӌ��C�ȼ��ԇ��濼�}��ģ�M��Ԕ�⡤��MSOffice�߼��

ȫ��Ӌ��C�ȼ��ԇ��濼�}��ģ�M��Ԕ�⡤��MSOffice�߼��

¥14.4¥45
>
�Q��Мy5000�}(��Z��c��_)

�Q��Мy5000�}(��Z��c��_)

¥44.1¥88
>
ܛ��ܜyԇ.��c�{��`֮·

ܛ��ܜyԇ.��c�{��`֮·

¥56.2¥69
>
��һ�д��aAndroid

��һ�д��aAndroid

¥55.4¥99
>
JAVA��m��

JAVA��m��

¥58.1¥119
>
EXCEL��̿ƕ�(��ȫ��)(ȫ��ӡˢ)

EXCEL��̿ƕ�(��ȫ��)(ȫ��ӡˢ)

¥31.1¥69.9
>
��ȌW��

��ȌW��

¥92.4¥168

��ƷԔ��
��Ʒ�uՓ(0�l)

�ЈD�r:¥80.6 ��ُ��܇

��Ϣ
��ɫ
��ݺ��
Ŀ�
��ߺ��

��ǰչ��Ե��c�ֲ�ʽ��W�� Ϣ

ISBN��9787302599388
�l�δa��9787302599388 ; 978-7-302-59938-8
�b��һ��z�漈
�Ԕ��o
��o
��ٷ��
Ӌ��C/�W�j
>
Ӌ��C��Փ

��ǰչ��Ե��c�ֲ�ʽ��W�� ɫ

�x��ͨ�^��˽⏊��W��в��Ե��؄e��Rollout��ڷֲ�ʽ�Ͷ��w��µ��Mչ�͑��á��˹��ܻ�ϵ�y�c��ƿƌW��P��I�ĸ��꼉��о��һ��W�ڵ��n�̲̽ġ�Ҳ�m��_չ��P�о��Č��I��g�ˆT��酢��x��

��ǰչ��Ե��c�ֲ�ʽ��W�� ݺ��

��Ҫ��ݣ��1��ӑBҎ��ԭ��2��ǰչ�c��Ը��M��3�錣�ò��ǰչ�㷨��4��ֵ�Ͳ��ԵČW��5��o�ޕr�g�ֲ�ʽ�Ͷ��w�㷨�� M�ճ��ć��ܛ��AlphaZero�㷨��кܴ�Ӱ푡��ͬ�ӻ��ڲ��Ե��ֵ�W�j�Ͳ��ԾW�j��񽛾W�j��Ʊ�ʾ��c�ֲ�ʽӋ��ǰհ*С��s��g�ĺ��Ŀ�ܘ��㷨��m�÷��չ��ɫ��ڽo��˷ֲ�ʽӋ��Ͷ��wϵ�y��µď��W��Ը��MӋ��Ч��g��һ��Ը��M��ǰչ��ͬ��ϵ�y�ЏV��ʹ�õ�ģ��A�y��ƣ�MPC��OӋ��֮�g��ϵ��˲��ǰչ��ڏ��s�xɢ�ͽM�σ��}��đ��á� ͨ�^��x��x�߿��˽⏊��W��еĲ��Ե��؄e�ǲ��ǰչ��ڷֲ�ʽ�Ͷ��w��µĽ��Mչ�͑��á��˹��ܻ�ϵ�y�c��ƿƌW��P��I�ĸ��꼉��о��Ľ̲ģ�Ҳ�m��_չ��P�о��Č��I��g�ˆT��酢��

��ǰչ��Ե��c�ֲ�ʽ��W�� Ŀ�

1 Exact and Approximate Dynamic Programming Principles
1.1 AlphaZero, Off-Line Training, and On-Line Play
1.2 Deterministic Dynamic Programming
1.2.1 Finite Horizon Problem Formulation
1.2.2 The Dynamic Programming Algorithm
1.2.3 Approximation in Value Space
1.3 Stochastic Dynamic Programming
1.3.1 Finite Horizon Problems
1.3.2 Approximation in Value Space for Stochastic DP
1.3.3 Infinite Horizon Problems-An Overview
1.3.4 Infinite Horizon-Approximation in Value Space
1.3.5 Infinite Horizon-Policy Iteration, Rollout, andNewton's Method
1.4 Examples, Variations, and Simplifications
1.4.1 A Few Words About Modeling
1.4.2 Problems with a Termination State
1.4.3 State Augmentation, Time Delays, Forecasts, and Uncontrollable State Components
1.4.4 Partial State Information and Belief States
1.4.5 Multiagent Problems and Multiagent Rollout
1.4.6 Problems with Unknown Parameters-AdaptiveControl
1.4.7 Adaptive Control by Rollout and On-LineReplanning
1.5 Reinforcement Learning and Optimal Control-SomeTerminology
1.6 Notes and Sources
2 General Principles of Approximation in Value Space
2.1 Approximation in Value and Policy Space
2.1.1 Approximation in Value Space-One-Step and Multistep Lookahead
2.1.2 Approximation in Policy Space
2.1.3 Combined Approximation in Value and Policy Space
2.2 Approaches for Value Space Approximation
2.2.1 Off-Line and On-Line Implementations
2.2.2 Model-Based and Model-Free Implementations
2.2.3 Methods for Cost-to-Go Approximation
2.2.4 Methods for Expediting the Lookahead Minimization
2.3 Deterministic Rollout and the Policy Improvement Principle
2.3.1 On-Line Rollout for Deterministic Discrete Optimization
2.3.2 Using Multiple Base Heuristics-Parallel Rollout
2.3.3 The Simplified Rollout Algorithm
2.3.4 The Fortified Rollout Algorithm
2.3.5 Rollout with Multistep Lookahead
2.3.6 Rollout with an Expert
2.3.7 Rollout with Small Stage Costs and Long Horizon-Continuous-Time Rollout
2.4 Stochastic Rollout and Monte Carlo Tree Search
2.4.1 Simulation-Based Implementation of the Rollout Algorithm
2.4.2 Monte Carlo Tree Search
2.4.3 Randomized Policy Improvement by Monte Carlo Tree Search
2.4.4 The Effect of Errors in Rollout-Variance Reduction
2.4.5 Rollout Parallelization
2.5 Rollout for Infinite-Spaces Problems-Optimization Heuristics
2.5.1 Rollout for Infinite-Spaces Deterministic Problems
2.5.2 Rollout Based on Stochastic Programming
2.6 Notes and Sources
3 Specialized Rollout Algorithms
3.1 Model Predictive Control
3.1.1 Target Tubes and Constrained Controllability
3.1.2 Model Predictive Control with Terminal Cost
3.1.3 Variants of Model Predictive Control
3.1.4 Target Tubes and State-Constrained Rollout
3.2 Multiagent Rollout
3.2.1 Asynchronous and Autonomous Multiagent Rollout
3.2.2 Multiagent Coupling Through Constraints
3.2.3 Multiagent Model Predictive Control
3.2.4 Separable and Multiarmed Bandit Problems
3.3 Constrained Rollout-Deterministic Optimal Control
3.3.1 Sequential Consistency, Sequential Improvement, and the Cost Improvement Property
3.3.2 The Fortified Rollout Algorithm and Other Variations
3.4 Constrained Rollout-Discrete Optimization
3.4.1 General Discrete Optimization Problems
3.4.2 Multidimensional Assignment
3.5 Rollout for Surrogate Dynamic Programming and Bayesian Optimization
3.6 Rollout for Minimax Control
3.7 Notes and Sources
4 Learning Values and Policies
4.1 Parametric Approximation Architectures
4.1.1 Cost Function Approximation
4.1.2 Feature-Based Architectures
4.1.3 Training of Linear and Nonlinear Architectures
4.2 Neural Networks
4.2.1 Training of Neural Networks
4.2

չ�_ȫ��

��ǰչ��Ե��c�ֲ�ʽ��W�� ߺ��

Dimitri P. Bertsekas,��÷�� P.��˹��Dimitri P. Bertseka��,��MIT�K��ڣ��ҹ��ԺԺʿ��A��W��s�c�W�j��ϵ�y�о��Ŀ��ڡ�늚⹤��cӋ��C�ƌW�I��H֪��ߣ��С��Ǿ��Ҏ��W�j��ӑBҎ��͹��W��c���ơ��ʮ�ױ��N�̲ĺ͌��

��Ʒ�uՓ(0�l)

��u ٍ��

��o�uՓ��

��]

>
��Z�ڴ��ϵ�С��˼��20:Փ��Ȼ�x��(Ӣ�h�p�Z)
��Z�ڴ��ϵ�С��˼��20:Փ��Ȼ�x��(Ӣ�h�p�Z)
[Ӣ] �_�� ܷ �g
¥8.8~~¥14.0~~
>
�ϵ�֮��:��˵��挍�ó�
�ϵ�֮��:��˵��挍�ó�
[��] �_��ء��R ��/�R�ĸ� �g
¥20.2~~¥35.0~~
>
��S��-��Ծ��
��S��-��Ծ��
�� ֹ�� Уӆ
¥6.1~~¥16.0~~
>
��b�L��o��ӵ��Ї��Ԓ
��b�L��o��ӵ��Ї��Ԓ
ʩӢΡ
¥18.6~~¥55.0~~
>
��{��,��Ҫȥ��(2021�°�)
��{��,��Ҫȥ��(2021�°�)
[��] ��ɭ�ՠ� �� g
¥20.9~~¥49.8~~
>
��Ԣ��-��ČW��-ȫ�g��
��Ԣ��-��ČW��-ȫ�g��
[��ϣ�D] �� g
¥6.7~~¥19.0~~
>
��~��Փ/��С��
��~��Փ/��С��
��
¥9.8~~¥24.0~~
>
��x�c�ղء��ČW��:һ��Ĺ��
��x�c�ղء��ČW��:һ��Ĺ��
��Ѹ
¥19.2~~¥45.8~~

����N

��ģ�͑��_�l�O��T ��GPT-4��ChatGPT

(��)�W��SҮ��,(��)��-��

¥41.9~~¥59.8~~
�˹�� F�� 4��(ȫ2��)

(��)˹�D��ء��_��,(��)�˵á��Z�S��

¥120.8~~¥198~~
Ӌ��Cҕ�X:�㷨�c��

RichardSzeliski��

¥95.9~~¥139~~
��Ȼ(��ӆ��)

KevinKelly��P�ġ��P��

¥53.5~~¥89~~
GPT�r��v�w

(��)��¡��,��GPT-4

¥54.9~~¥89.9~~
��ʽ�˹��(AIGC)��

��ͤ��,��,��

¥64.9~~¥90~~

中图网(原中国图书网)：网上书店，中文字幕在线一区二区三区，尾货特色书店，中文字幕在线一区，30万种特价书低至2折！

��] ��ǰչ��Ե��c�ֲ�ʽ��W��

��ǰչ��Ե��c�ֲ�ʽ��W�� Ϣ

��ǰչ��Ե��c�ֲ�ʽ��W�� ɫ

��ǰչ��Ե��c�ֲ�ʽ��W�� ݺ��

��ǰչ��Ե��c�ֲ�ʽ��W�� Ŀ�

��ǰչ��Ե��c�ֲ�ʽ��W�� ߺ��

��Z�ڴ��ϵ�С��˼��20:Փ��Ȼ�x��(Ӣ�h�p�Z)

�ϵ�֮��:��˵��挍�ó�

��S��-��Ծ��

��b�L��o��ӵ��Ї��Ԓ

��{��,��Ҫȥ��(2021�°�)

��Ԣ��-��ČW��-ȫ�g��

��~��Փ/��С��

��x�c�ղء��ČW��:һ��Ĺ��

��ģ�͑��_�l�O��T ��GPT-4��ChatGPT

�˹�� F�� 4��(ȫ2��)

Ӌ��Cҕ�X:�㷨�c��

��Ȼ(��ӆ��)

GPT�r��v�w

��ʽ�˹��(AIGC)��

�Ϻ��Z˹͡��Ȳ�

߅��-��D��

ÿ��Փ�Z

��ӛ��N

��:��

�r�gֹͣ��һ��

���] ����ǰչ�����Ե����c�ֲ�ʽ�����W��

����ǰչ�����Ե����c�ֲ�ʽ�����W�� �����Ϣ

����ǰչ�����Ե����c�ֲ�ʽ�����W�� ������ɫ

����ǰչ�����Ե����c�ֲ�ʽ�����W�� ���ݺ���

����ǰչ�����Ե����c�ֲ�ʽ�����W�� Ŀ�

����ǰչ�����Ե����c�ֲ�ʽ�����W�� ���ߺ���

��] ��ǰչ��Ե��c�ֲ�ʽ��W��

��ǰչ��Ե��c�ֲ�ʽ��W�� Ϣ

��ǰչ��Ե��c�ֲ�ʽ��W�� ɫ

��ǰչ��Ե��c�ֲ�ʽ��W�� ݺ��

��ǰչ��Ե��c�ֲ�ʽ��W�� Ŀ�

��ǰչ��Ե��c�ֲ�ʽ��W�� ߺ��