Williams et al., 2017 - Google Patents

Information theoretic MPC for model-based reinforcement learning

Williams et al., 2017

Document ID: 15474208887248163578
Author: Williams G; Wagener N; Goldfain B; Drews P; Rehg J; Boots B; Theodorou E
Publication year: 2017
Publication venue: 2017 IEEE international conference on robotics and automation (ICRA)

External Links

Cited by

Snippet

We introduce an information theoretic model predictive control (MPC) algorithm capable of handling complex cost criteria and general nonlinear dynamics. The generality of the approach makes it possible to use multi-layer neural networks as dynamics models, which …

Continue reading at homes.cs.washington.edu (PDF) (other versions)

230000002787 reinforcement 0 title abstract description 10

Classifications

- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
- G05D1/02—Control of position or course in two dimensions
- G05D1/021—Control of position or course in two dimensions specially adapted to land vehicles
- G05D1/0268—Control of position or course in two dimensions specially adapted to land vehicles using internal positioning means
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
- G05D1/02—Control of position or course in two dimensions
- G05D1/021—Control of position or course in two dimensions specially adapted to land vehicles
- G05D1/0287—Control of position or course in two dimensions specially adapted to land vehicles involving a plurality of land vehicles, e.g. fleet or convoy travelling
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
- G05D1/02—Control of position or course in two dimensions
- G05D1/021—Control of position or course in two dimensions specially adapted to land vehicles
- G05D1/0212—Control of position or course in two dimensions specially adapted to land vehicles with means for defining a desired trajectory
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
- G05D1/0011—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot associated with a remote control arrangement
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
- G05D1/10—Simultaneous control of position or course in three dimensions
- G05D1/101—Simultaneous control of position or course in three dimensions specially adapted for aircraft

Similar Documents

Publication	Publication Date	Title
Williams et al.	2017	Information theoretic MPC for model-based reinforcement learning
Zhu et al.	2021	A survey of deep RL and IL for autonomous driving policy learning
Cai et al.	2020	High-speed autonomous drifting with deep reinforcement learning
Song et al.	2022	Policy search for model predictive control with application to agile drone flight
Fridovich-Keil et al.	2020	Efficient iterative linear-quadratic approximations for nonlinear multi-player general-sum differential games
Williams et al.	2017	Model predictive path integral control: From theory to parallel computation
Cutler et al.	2016	Autonomous drifting using simulation-aided reinforcement learning
Cutler et al.	2015	Efficient reinforcement learning for robots using informative simulated priors
US20210263526A1 (en)	2021-08-26	Method and device for supporting maneuver planning for an automated driving vehicle or a robot
Pulver et al.	2021	Pilot: Efficient planning by imitation learning and optimisation for safe autonomous driving
Kapania	2016	Trajectory planning and control for an autonomous race vehicle
Andersson et al.	2015	Model-based reinforcement learning in continuous environments using real-time constrained optimization
Löckel et al.	2020	A probabilistic framework for imitating human race driver behavior
Goecks	2020	Human-in-the-loop methods for data-driven and reinforcement learning systems
Xu et al.	2019	Toward modularization of neural network autonomous driving policy using parallel attribute networks
Kim et al.	2023	Bridging active exploration and uncertainty-aware deployment using probabilistic ensemble neural network dynamics
Xiao et al.	2024	Anycar to anywhere: Learning universal dynamics model for agile and adaptive mobility
Gregory et al.	2021	Improving trajectory tracking accuracy for faster and safer autonomous navigation of ground vehicles in off-road settings
Chen et al.	2024	Imitation learning from imperfect demonstrations for AUV path tracking and obstacle avoidance
Picotti et al.	2023	A learning-based nonlinear model predictive controller for a real go-kart based on black-box dynamics modeling through gaussian processes
Löckel et al.	2023	An adaptive human driver model for realistic race car simulations
Heeg et al.	2024	Learning Quadrotor Control From Visual Features Using Differentiable Simulation
Flad et al.	2014	Individual driver modeling via optimal selection of steering primitives
Williams et al.	2016	Information theoretic MPC using neural network dynamics
Samsani et al.	2022	Rapid Autonomous Vehicle Drifting with Deep Reinforcement Learning