Published online by Cambridge University Press: 05 August 2016
Having considered the components of an actor and their relation to the actor's environment, we now need to develop some representational and algorithmic tools for performing the actor's deliberation functions. In this chapter, we develop a simple kind of descriptive model for use in planning, describe some planning algorithms that can use this kind of model, and discuss some ways for actors to use those algorithms.
This chapter is organized as follows. Section 2.1 develops state-variable representations of planning domains. Sections 2.2 and 2.3 describe forward-search planning algorithms, and heuristics to guide them. Sections 2.4 and 2.5 describe backward-search and plan-space planning algorithms. Section 2.6 describes some ways for an actor to use online planning. Sections 2.7 and 2.8 contain the discussion and historical remarks, and the student exercises.
STATE-VARIABLE REPRESENTATION
The descriptive models used by planning systems are often called planning domains. However, it is important to keep in mind that a planning domain is not an a priori definition of the actor and its environment. Rather, it is necessarily an imperfect approximation that must incorporate trade-offs among several competing criteria: accuracy, computational performance, and understandability to users.
State-Transition Systems
In this chapter, we use a simple planning-domain formalism that is similar to a finitestate automaton:
Definition 2.1. A state-transition system (also called a classical planning domain) is a triple Σ= (S,A, γ) or 4-tuple Σ = (S,A, γ, cost), where
• S is a finite set of states in which the system may be.
• A is a finite set of actions that the actor may perform.
• γ : S × A → S is a partial function called the prediction function or state transition function. If (s, a) is in γ 's domain (i.e., γ (s, a) is defined), then a is applicable in s, with γ (s, a) being the predicted outcome.Otherwise a is inapplicable in s.
• cost : S × A → [0,∞) is a partial function having the same domain as γ. Although we call it the cost function, its meaning is arbitrary: it may represent monetary cost, time, or something else that one might want tominimize. If the cost function isn't given explicitly (i.e., if Σ = (S,A, γ)), then cost(s, a) = 1 whenever γ (s, a) is defined.
To save this book to your Kindle, first ensure no-reply@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
Find out more about the Kindle Personal Document Service.
To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.
To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.