Structures for indexes

Maxime Crochemore; Christophe Hancart; Thierry Lecroq

doi:10.1017/CBO9780511546853.006

5 - Structures for indexes

Published online by Cambridge University Press: 03 October 2009

Maxime Crochemore ,

Christophe Hancart and

Thierry Lecroq

Show author details

Thierry Lecroq: Affiliation:
Université de Rouen

Book contents

Get access

Summary

In this chapter, we present data structures for storing the suffixes of a text. These structures are conceived for providing a direct and fast access to the factors of the text. They allow to work on the factors of the string in almost the same way as the suffix array of Chapter 4 does, but the more important part of the technique is put on the structuring of data rather than on algorithms to search the text.

The main application of these techniques is to provide the basis of an index implementation as described in Chapter 6. The direct access to the factors of a string allows a large number of other applications. In particular, the structures can be used for matching patterns by considering them as search machines (see Chapter 6).

Two types of objects are considered in this chapter, trees and automata, together with their compact versions. Trees have for effect to factorize the prefixes of the strings in the set. Automata additionally factorize their common suffixes. The structures are presented in decreasing order of size.

The representation of the suffixes of a string by a trie (Section 5.1) has the advantage to be simple but can lead to a quadratic memory space according to the length of the considered string.

Type: Chapter
Information: Algorithms on Strings , pp. 177 - 218

DOI: https://doi.org/10.1017/CBO9780511546853.006 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2007

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

5 - Structures for indexes

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive