BIBLIOTECA MANUEL BELGRANO - Facultad de Ciencias Económicas - UNC

Imagen de cubierta local
Imagen de cubierta local
Imagen de Google Jackets

Algorithms for decision making / Mykel J. Kochenderfer, Tim A. Wheeler, Kyle H. Wray. [recurso electrónico - acceso abierto]

Por: Colaborador(es): Tipo de material: TextoTextoDetalles de publicación: Cambridge, Mass. : The MIT Press, c2022Descripción: 1 recurso en línea (700 p.)Tema(s): Clasificación CDD:
  • 658.403
Recursos en línea:
Contenidos:
Introduction -- PART I: PROBABILISTIC REASONING: Representation. Inference. Parameter Learning. Structure Learning. Simple Decisions -- PART II: SEQUENTIAL PROBLEMS: Approximate Value Functions. Online Planning. Policy Search. Policy Gradient Estimation. Policy Gradient Optimization. Actor-Critic Methods. Policy Validation -- PART III: MODEL UNCERTAINTY: Exploration and Exploitation. Model-Based Methods. Model-Free Methods. Imitation Learning -- PART IV: STATE UNCERTAINTY: Beliefs. Exact Belief State Planning. Offline Belief State Planning. Online Belief State Planning. Controller Abstractions -- PART V: MULTIAGENT SYSTEMS: Sequential Problems. State Uncertainty. Collaborative Agents -- APPENDICES: A: Mathematical Concepts -- B: Probability Distributions -- C: Computational Complexity -- D: Neural Representations -- E: Search Algorithms -- F: Problems -- G: Julia.
Resumen: The book first addresses the problem of reasoning about uncertainty and objectives in simple decisions at a single point in time, and then turns to sequential decision problems in stochastic environments where the outcomes of our actions are uncertain. It goes on to address model uncertainty, when we do not start with a known model and must learn how to act through interaction with the environment; state uncertainty, in which we do not know the current state of the environment due to imperfect perceptual information; and decision contexts involving multiple agents. The book focuses primarily on planning and reinforcement learning, although some of the techniques presented draw on elements of supervised learning and optimization. Algorithms are implemented in the Julia programming language. Figures, examples, and exercises convey the intuition behind the various approaches presented.
Existencias
Tipo de ítem Biblioteca actual Signatura topográfica Estado Fecha de vencimiento Código de barras
Libro electrónico Libro electrónico Biblioteca Manuel Belgrano Recurso en línea (Navegar estantería(Abre debajo)) Disponible

Introduction -- PART I: PROBABILISTIC REASONING: Representation. Inference. Parameter Learning. Structure Learning.
Simple Decisions -- PART II: SEQUENTIAL PROBLEMS: Approximate Value Functions. Online Planning. Policy Search.
Policy Gradient Estimation. Policy Gradient Optimization. Actor-Critic Methods. Policy Validation -- PART III: MODEL UNCERTAINTY: Exploration and Exploitation. Model-Based Methods. Model-Free Methods. Imitation Learning --
PART IV: STATE UNCERTAINTY: Beliefs. Exact Belief State Planning. Offline Belief State Planning. Online Belief State Planning. Controller Abstractions -- PART V: MULTIAGENT SYSTEMS: Sequential Problems. State Uncertainty.
Collaborative Agents -- APPENDICES: A: Mathematical Concepts -- B: Probability Distributions -- C: Computational Complexity -- D: Neural Representations -- E: Search Algorithms -- F: Problems -- G: Julia.

The book first addresses the problem of reasoning about uncertainty and objectives in simple decisions at a single point in time, and then turns to sequential decision problems in stochastic environments where the outcomes of our actions are uncertain. It goes on to address model uncertainty, when we do not start with a known model and must learn how to act through interaction with the environment; state uncertainty, in which we do not know the current state of the environment due to imperfect perceptual information; and decision contexts involving multiple agents. The book focuses primarily on planning and reinforcement learning, although some of the techniques presented draw on elements of supervised learning and optimization. Algorithms are implemented in the Julia programming language. Figures, examples, and exercises convey the intuition behind the various approaches presented.

No hay comentarios en este titulo.

para colocar un comentario.

Haga clic en una imagen para verla en el visor de imágenes

Imagen de cubierta local

Bv. Enrique Barros s/n - Ciudad Universitaria. X5000HRV-Córdoba, Argentina - Tel. 00-54-351-4437300, Interno 48505
Horario de Atención: Lunes a Viernes de 8 a 18

Contacto sobre Información bibliográfica: proinfo.bmb@eco.uncor.edu