Finding the Limits of Machine Learning in Optimization

David A Edwards; Binan Gu; Katherine Johnston; Maia Wichman; Maxim Zyskin

doi:10.33774/miir-2022-q537t-v2

Computing & Robotics

Search within Computing & Robotics

Finding the Limits of Machine Learning in Optimization

08 November 2022, Version 2

Working Paper

Show author details

Abstract

The initial position and velocity of a robot is given, and the problem posed is to make it stop at the origin in the shortest possible time, given a maximum acceleration and speed. The robot can control its acceleration vector, and hence the full optimization problem can be specified as a Hamiltonian system where the solution will minimize the transit time. This problem is discussed in both the one- and two-dimensional cases. The key control parameter is the acceleration direction; reducing the problem to a one-dimensional optimization opens up several areas of exploration. The direction can be optimized using a global search algorithm, or can be updated periodically using a local search algorithm with a penalty function. Numerical solutions are presented in these cases, including when physical obstacles are included in the penalty function. The one-dimensional optimization also allows the use of reinforced learning to minimize the transit time.

Content

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.