Finding the Limits of Machine Learning in Optimization

08 November 2022, Version 2

Abstract

The initial position and velocity of a robot is given, and the problem posed is to make it stop at the origin in the shortest possible time, given a maximum acceleration and speed. The robot can control its acceleration vector, and hence the full optimization problem can be specified as a Hamiltonian system where the solution will minimize the transit time. This problem is discussed in both the one- and two-dimensional cases. The key control parameter is the acceleration direction; reducing the problem to a one-dimensional optimization opens up several areas of exploration. The direction can be optimized using a global search algorithm, or can be updated periodically using a local search algorithm with a penalty function. Numerical solutions are presented in these cases, including when physical obstacles are included in the penalty function. The one-dimensional optimization also allows the use of reinforced learning to minimize the transit time.

Content

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.