Ranking, clustering and data visualisation

John Shawe-Taylor; Nello Cristianini

doi:10.1017/CBO9780511809682.009

8 - Ranking, clustering and data visualisation

from Part II - Pattern analysis algorithms

Published online by Cambridge University Press: 29 March 2011

John Shawe-Taylor and

Nello Cristianini

Show author details

John Shawe-Taylor: Affiliation:
University of Southampton
Nello Cristianini: Affiliation:
University of California, Davis

Book contents

Get access

Summary

In this chapter we conclude our presentation of kernel-based pattern analysis algorithms by discussing three further common tasks in data analysis: ranking, clustering and data visualisation.

Ranking is the problem of learning a ranking function from a training set of ranked data. The number of ranks need not be specified though typically the training data comes with a relative ordering specified by assignment to one of an ordered sequence of labels.

Clustering is perhaps the most important and widely used method of unsupervised learning: it is the problem of identifying groupings of similar points that are relatively ‘isolated’ from each other, or in other words to partition the data into dissimilar groups of similar items. The number of such clusters may not be specified a priori. As exact solutions are often computationally hard to find, effective approximations via relaxation procedures need to be sought.

Data visualisation is often overlooked in pattern analysis and machine learning textbooks, despite being very popular in the data mining literature. It is a crucial step in the process of data analysis, enabling an understanding of the relations that exist within the data by displaying them in such a way that the discovered patterns are emphasised. These methods will allow us to visualise the data in the kernel-defined feature space, something very valuable for the kernel selection process. Technically it reduces to finding low-dimensional embeddings of the data that approximately retain the relevant information.

Information

Type: Chapter
Information: Kernel Methods for Pattern Analysis , pp. 252 - 288

DOI: https://doi.org/10.1017/CBO9780511809682.009 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2004

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book purchase

Temporarily unavailable

Accessibility standard: Unknown

Accessibility compliance for the PDF of this book is currently unknown and may be updated in the future.