Skip to main content Accessibility help
×
Hostname: page-component-77c89778f8-gvh9x Total loading time: 0 Render date: 2024-07-16T12:22:00.975Z Has data issue: false hasContentIssue false

4 - Reconciling

Published online by Cambridge University Press:  10 September 2022

Get access

Summary

Learning outcomes of this chapter

  • • Why controlled vocabularies are important for linked data

  • • Being able to compare the lightweight linked data approach with the full-fledged semantic web

  • • Understanding the role of SKOS

  • • Learning to select the most suitable vocabularies to leverage your metadata

  • • Case study: reconciling the metadata of the Powerhouse Museum

Introduction

‘Controlled vocabularies are like underwear. Everyone thinks they are a good idea but no one wants to use someone else’s.’ So goes a classic joke within library and information science circles. As this chapter will demonstrate, there are nonetheless important reasons why one would want to share and re-use vocabularies.

Thesauri, taxonomies, classification schemes or any other manifestation of controlled vocabularies constitute the very core of the LIS profession. The creation of library classifications such as the Dewey Decimal Classification (DDC) or the Universal Decimal Classification (UDC) represent in many ways the intellectual birth of the LIS discipline more than a century ago. The creation of a controlled vocabulary and its use to describe and give access to collections has been a central activity within the profession of librarians, archivists and curatorial staff.

Throwing money into a black hole?

The creation and use of a controlled vocabulary is expensive, as both the development of a vocabulary and its subsequent use for indexing has to be performed by domain experts. For decades, computer science has tried to automate both processes. On the whole, these attempts have not been terribly successful. Currently, data mining and natural language processing (NLP) techniques most certainly can be used to speed up the collection of potential terms for the construction of a thesaurus. The same techniques can at a second stage be used to analyse what terms from the thesaurus may be used to describe a document. In practice, both procedures need to be supervised by a domain expert, and are in that sense semi-automated methods.

The arrival of the web drastically affected views on the use of controlled vocabularies. Even their most ardent advocates from the LIS domain realize that maintaining and applying a controlled vocabulary on the scale of the web is a utopian idea. Moreover, the success of Google's indexing services based on a full-text search of unstructured HTML pages led to a questioning of the traditional indexing practices.

Type
Chapter
Information
Linked Data for Libraries, Archives and Museums
How to clean, Link and Publish your Metadata
, pp. 109 - 158
Publisher: Facet
Print publication year: 2015

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×