Deliberation with Deterministic Models

Malik Ghallab; Dana Nau; Paolo Traverso

doi:10.1017/CBO9781139583923.004

2 - Deliberation with Deterministic Models

Published online by Cambridge University Press: 05 August 2016

Malik Ghallab ,

Dana Nau and

Paolo Traverso

Show author details

Malik Ghallab: Affiliation:
Centre National de la Recherche Scientifique (CNRS), Paris
Dana Nau: Affiliation:
University of Maryland, College Park
Paolo Traverso: Affiliation:
FBK ICT – IRST (Center for Scientific and Technological Research), Italy

Book contents

Get access

Summary

Having considered the components of an actor and their relation to the actor's environment, we now need to develop some representational and algorithmic tools for performing the actor's deliberation functions. In this chapter, we develop a simple kind of descriptive model for use in planning, describe some planning algorithms that can use this kind of model, and discuss some ways for actors to use those algorithms.

This chapter is organized as follows. Section 2.1 develops state-variable representations of planning domains. Sections 2.2 and 2.3 describe forward-search planning algorithms, and heuristics to guide them. Sections 2.4 and 2.5 describe backward-search and plan-space planning algorithms. Section 2.6 describes some ways for an actor to use online planning. Sections 2.7 and 2.8 contain the discussion and historical remarks, and the student exercises.

STATE-VARIABLE REPRESENTATION

The descriptive models used by planning systems are often called planning domains. However, it is important to keep in mind that a planning domain is not an a priori definition of the actor and its environment. Rather, it is necessarily an imperfect approximation that must incorporate trade-offs among several competing criteria: accuracy, computational performance, and understandability to users.

State-Transition Systems

In this chapter, we use a simple planning-domain formalism that is similar to a finitestate automaton:

Definition 2.1. A state-transition system (also called a classical planning domain) is a triple Σ= (S,A, γ) or 4-tuple Σ = (S,A, γ, cost), where

• S is a finite set of states in which the system may be.
• A is a finite set of actions that the actor may perform.
• γ : S × A → S is a partial function called the prediction function or state transition function. If (s, a) is in γ 's domain (i.e., γ (s, a) is defined), then a is applicable in s, with γ (s, a) being the predicted outcome.Otherwise a is inapplicable in s.
• cost : S × A → [0,∞) is a partial function having the same domain as γ. Although we call it the cost function, its meaning is arbitrary: it may represent monetary cost, time, or something else that one might want tominimize. If the cost function isn't given explicitly (i.e., if Σ = (S,A, γ)), then cost(s, a) = 1 whenever γ (s, a) is defined.

Information

Type: Chapter
Information: Automated Planning and Acting , pp. 19 - 73

DOI: https://doi.org/10.1017/CBO9781139583923.004 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2016

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book purchase

Temporarily unavailable

Accessibility standard: Unknown

Why this information is here

This section outlines the accessibility features of this content - including support for screen readers, full keyboard navigation and high-contrast display options. This may not be relevant for you.

Accessibility Information

Accessibility compliance for the PDF of this book is currently unknown and may be updated in the future.