Speech Technologies on the Way to a Natural User Interface

Jeff Zhuk

doi:10.1017/CBO9780511547058.015

Chapter 12 - Speech Technologies on the Way to a Natural User Interface

Published online by Cambridge University Press: 17 August 2009

Jeff Zhuk

Show author details

Jeff Zhuk: Affiliation:
Internet Technology School, Inc.

Book contents

Get access

Summary

Know how to listen, and you will profit even from those who talk badly.

—Plutarch

This chapter is about speech technologies and related APIs: VoiceXML, SALT, Java Speech API, and MS Speech SDK. It looks into unified scenarios with audio/video interface (AVI) definitions, considers design and code examples, and introduces important skills for a new world of wired and wireless speech applications.

WHAT IS A NATURAL USER INTERFACE?

Is a natural user interface (NUI) another set of tags and rules covered by a nice name? Absolutely not!

This time, end users—not a standards committee—make the determination on what they prefer for their methods of interaction. An NUI allows end users to give their preferences at the time of the service request, and to change them flexibly.

Are you a “computer” person?

My guess is that you are, because you are reading this book. “Computer literate” folks like you and me enjoy exploring the capacities of computer programs via traditional interfaces. Even so, there are times, such as when we are on vacation, on the go, and in the car, when we prefer “hands-free” conversation rather than using keyboards to access computerized services.

One person prefers handwriting, and someone else is comfortable with typing. One would like to forget keywords and use commonsense terminology instead. Can a computer understand that “find” is the same as “search,” and “Bob” is actually “Robert”? Can it understand that someone has chosen a foreign (non-English) language to interact with it?

Type: Chapter
Information: Integration-Ready Architecture and Design
Software Engineering with XML, Java, .NET, Wireless, Speech, and Knowledge Technologies
, pp. 338 - 386

DOI: https://doi.org/10.1017/CBO9780511547058.015 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2004

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

Chapter 12 - Speech Technologies on the Way to a Natural User Interface

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive