[go: up one dir, main page]

Williams et al., 2017 - Google Patents

Information theoretic MPC for model-based reinforcement learning

Williams et al., 2017

View PDF
Document ID
15474208887248163578
Author
Williams G
Wagener N
Goldfain B
Drews P
Rehg J
Boots B
Theodorou E
Publication year
Publication venue
2017 IEEE international conference on robotics and automation (ICRA)

External Links

Snippet

We introduce an information theoretic model predictive control (MPC) algorithm capable of handling complex cost criteria and general nonlinear dynamics. The generality of the approach makes it possible to use multi-layer neural networks as dynamics models, which …
Continue reading at homes.cs.washington.edu (PDF) (other versions)

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/0265Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
    • G05B13/027Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05DSYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
    • G05D1/00Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
    • G05D1/02Control of position or course in two dimensions
    • G05D1/021Control of position or course in two dimensions specially adapted to land vehicles
    • G05D1/0268Control of position or course in two dimensions specially adapted to land vehicles using internal positioning means
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05DSYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
    • G05D1/00Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
    • G05D1/02Control of position or course in two dimensions
    • G05D1/021Control of position or course in two dimensions specially adapted to land vehicles
    • G05D1/0287Control of position or course in two dimensions specially adapted to land vehicles involving a plurality of land vehicles, e.g. fleet or convoy travelling
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05DSYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
    • G05D1/00Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
    • G05D1/02Control of position or course in two dimensions
    • G05D1/021Control of position or course in two dimensions specially adapted to land vehicles
    • G05D1/0212Control of position or course in two dimensions specially adapted to land vehicles with means for defining a desired trajectory
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/04Architectures, e.g. interconnection topology
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B17/00Systems involving the use of models or simulators of said systems
    • G05B17/02Systems involving the use of models or simulators of said systems electric
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/04Inference methods or devices
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05DSYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
    • G05D1/00Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
    • G05D1/0011Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot associated with a remote control arrangement
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05DSYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
    • G05D1/00Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
    • G05D1/10Simultaneous control of position or course in three dimensions
    • G05D1/101Simultaneous control of position or course in three dimensions specially adapted for aircraft

Similar Documents

Publication Publication Date Title
Williams et al. Information theoretic MPC for model-based reinforcement learning
Zhu et al. A survey of deep RL and IL for autonomous driving policy learning
Cai et al. High-speed autonomous drifting with deep reinforcement learning
Song et al. Policy search for model predictive control with application to agile drone flight
Fridovich-Keil et al. Efficient iterative linear-quadratic approximations for nonlinear multi-player general-sum differential games
Williams et al. Model predictive path integral control: From theory to parallel computation
Cutler et al. Autonomous drifting using simulation-aided reinforcement learning
Cutler et al. Efficient reinforcement learning for robots using informative simulated priors
US20210263526A1 (en) Method and device for supporting maneuver planning for an automated driving vehicle or a robot
Pulver et al. Pilot: Efficient planning by imitation learning and optimisation for safe autonomous driving
Kapania Trajectory planning and control for an autonomous race vehicle
Andersson et al. Model-based reinforcement learning in continuous environments using real-time constrained optimization
Löckel et al. A probabilistic framework for imitating human race driver behavior
Goecks Human-in-the-loop methods for data-driven and reinforcement learning systems
Xu et al. Toward modularization of neural network autonomous driving policy using parallel attribute networks
Kim et al. Bridging active exploration and uncertainty-aware deployment using probabilistic ensemble neural network dynamics
Xiao et al. Anycar to anywhere: Learning universal dynamics model for agile and adaptive mobility
Gregory et al. Improving trajectory tracking accuracy for faster and safer autonomous navigation of ground vehicles in off-road settings
Chen et al. Imitation learning from imperfect demonstrations for AUV path tracking and obstacle avoidance
Picotti et al. A learning-based nonlinear model predictive controller for a real go-kart based on black-box dynamics modeling through gaussian processes
Löckel et al. An adaptive human driver model for realistic race car simulations
Heeg et al. Learning Quadrotor Control From Visual Features Using Differentiable Simulation
Flad et al. Individual driver modeling via optimal selection of steering primitives
Williams et al. Information theoretic MPC using neural network dynamics
Samsani et al. Rapid Autonomous Vehicle Drifting with Deep Reinforcement Learning