Williams et al., 2017 - Google Patents
Information theoretic MPC for model-based reinforcement learningWilliams et al., 2017
View PDF- Document ID
- 15474208887248163578
- Author
- Williams G
- Wagener N
- Goldfain B
- Drews P
- Rehg J
- Boots B
- Theodorou E
- Publication year
- Publication venue
- 2017 IEEE international conference on robotics and automation (ICRA)
External Links
Snippet
We introduce an information theoretic model predictive control (MPC) algorithm capable of handling complex cost criteria and general nonlinear dynamics. The generality of the approach makes it possible to use multi-layer neural networks as dynamics models, which …
- 230000002787 reinforcement 0 title abstract description 10
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
- G05D1/02—Control of position or course in two dimensions
- G05D1/021—Control of position or course in two dimensions specially adapted to land vehicles
- G05D1/0268—Control of position or course in two dimensions specially adapted to land vehicles using internal positioning means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
- G05D1/02—Control of position or course in two dimensions
- G05D1/021—Control of position or course in two dimensions specially adapted to land vehicles
- G05D1/0287—Control of position or course in two dimensions specially adapted to land vehicles involving a plurality of land vehicles, e.g. fleet or convoy travelling
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
- G05D1/02—Control of position or course in two dimensions
- G05D1/021—Control of position or course in two dimensions specially adapted to land vehicles
- G05D1/0212—Control of position or course in two dimensions specially adapted to land vehicles with means for defining a desired trajectory
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
- G05D1/0011—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot associated with a remote control arrangement
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
- G05D1/10—Simultaneous control of position or course in three dimensions
- G05D1/101—Simultaneous control of position or course in three dimensions specially adapted for aircraft
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Williams et al. | Information theoretic MPC for model-based reinforcement learning | |
Zhu et al. | A survey of deep RL and IL for autonomous driving policy learning | |
Cai et al. | High-speed autonomous drifting with deep reinforcement learning | |
Song et al. | Policy search for model predictive control with application to agile drone flight | |
Fridovich-Keil et al. | Efficient iterative linear-quadratic approximations for nonlinear multi-player general-sum differential games | |
Williams et al. | Model predictive path integral control: From theory to parallel computation | |
Cutler et al. | Autonomous drifting using simulation-aided reinforcement learning | |
Cutler et al. | Efficient reinforcement learning for robots using informative simulated priors | |
US20210263526A1 (en) | Method and device for supporting maneuver planning for an automated driving vehicle or a robot | |
Pulver et al. | Pilot: Efficient planning by imitation learning and optimisation for safe autonomous driving | |
Kapania | Trajectory planning and control for an autonomous race vehicle | |
Andersson et al. | Model-based reinforcement learning in continuous environments using real-time constrained optimization | |
Löckel et al. | A probabilistic framework for imitating human race driver behavior | |
Goecks | Human-in-the-loop methods for data-driven and reinforcement learning systems | |
Xu et al. | Toward modularization of neural network autonomous driving policy using parallel attribute networks | |
Kim et al. | Bridging active exploration and uncertainty-aware deployment using probabilistic ensemble neural network dynamics | |
Xiao et al. | Anycar to anywhere: Learning universal dynamics model for agile and adaptive mobility | |
Gregory et al. | Improving trajectory tracking accuracy for faster and safer autonomous navigation of ground vehicles in off-road settings | |
Chen et al. | Imitation learning from imperfect demonstrations for AUV path tracking and obstacle avoidance | |
Picotti et al. | A learning-based nonlinear model predictive controller for a real go-kart based on black-box dynamics modeling through gaussian processes | |
Löckel et al. | An adaptive human driver model for realistic race car simulations | |
Heeg et al. | Learning Quadrotor Control From Visual Features Using Differentiable Simulation | |
Flad et al. | Individual driver modeling via optimal selection of steering primitives | |
Williams et al. | Information theoretic MPC using neural network dynamics | |
Samsani et al. | Rapid Autonomous Vehicle Drifting with Deep Reinforcement Learning |