Book contents
- Frontmatter
- Content
- Acknowledgements
- 1 Introduction
- 2 What is a thesaurus?
- 3 Tools for subject access and retrieval
- 4 What a thesaurus is used for
- 5 Why use a thesaurus?
- 6 Types of thesaurus
- 7 The format of a thesaurus
- 8 Building a thesaurus 1: vocabulary collection
- 9 Vocabulary control 1: selection of terms
- 10 Vocabulary control 2: form of entry
- 11 Building a thesaurus 2: term extraction from document titles
- 12 Building a thesaurus 3: vocabulary analysis
- 13 The thesaural relationships
- 14 Building a thesaurus 4: introducing internal structure
- 15 Building a thesaurus 5: imposing hierarchy
- 16 Building a thesaurus 6: compound subjects and citation order
- 17 Building a thesaurus 7: conversion of the taxonomy to alphabetical format
- 18 Building a thesaurus 8: creating the thesaurus records
- 19 Managing and maintaining the thesaurus: thesaurus software
- 20 Conclusion
- Glossary
- Bibliography
- Appendix 1 Sample titles for thesaurus vocabulary
- Appendix 2 Sample terms for the thesaurus
- Appendix 3 Facets at stage 1 of analysis
- Appendix 4 Facets at stage 2 of analysis
- Appendix 5 Completed systematic display
- Appendix 6 Thesaurus entries for sample page
- Index
9 - Vocabulary control 1: selection of terms
Published online by Cambridge University Press: 09 June 2018
- Frontmatter
- Content
- Acknowledgements
- 1 Introduction
- 2 What is a thesaurus?
- 3 Tools for subject access and retrieval
- 4 What a thesaurus is used for
- 5 Why use a thesaurus?
- 6 Types of thesaurus
- 7 The format of a thesaurus
- 8 Building a thesaurus 1: vocabulary collection
- 9 Vocabulary control 1: selection of terms
- 10 Vocabulary control 2: form of entry
- 11 Building a thesaurus 2: term extraction from document titles
- 12 Building a thesaurus 3: vocabulary analysis
- 13 The thesaural relationships
- 14 Building a thesaurus 4: introducing internal structure
- 15 Building a thesaurus 5: imposing hierarchy
- 16 Building a thesaurus 6: compound subjects and citation order
- 17 Building a thesaurus 7: conversion of the taxonomy to alphabetical format
- 18 Building a thesaurus 8: creating the thesaurus records
- 19 Managing and maintaining the thesaurus: thesaurus software
- 20 Conclusion
- Glossary
- Bibliography
- Appendix 1 Sample titles for thesaurus vocabulary
- Appendix 2 Sample terms for the thesaurus
- Appendix 3 Facets at stage 1 of analysis
- Appendix 4 Facets at stage 2 of analysis
- Appendix 5 Completed systematic display
- Appendix 6 Thesaurus entries for sample page
- Index
Summary
In the preliminary stages in the construction of the thesaurus you have begun to assemble your vocabulary and you will have a number of terms. If you have followed the methods of vocabulary collection described, these have come from a number of different sources, and the style and form of the terms may vary quite a lot. The vocabulary is only at the first stage of being properly managed, and you should start to work towards a fuller level of standardization.
Creating a list of standard terms (that is, a vocabulary) for indexing requires the compiler to do several things:
• reduce the number of terms overall, by abandoning natural language in favour of a limited set of terms
• when terms are similar in meaning, decide on which of them should be used
• when terms have different spellings, decide on which should be used
• decide on the exact form of the entry term, for example whether it should be plural or singular, and how punctuation should be employed
• decide about the handling of compound terms, for example when and how words should be combined to make multi-word terms and phrases, and word order.
This process is even more important when any degree of automation is involved, since the search software will demand (nearly) exact matching between indexing and search terms if retrieval is to occur at all. Although the development of fuzzy searching systems allows some variation in terms (such as a mistyped letter), the form of terms still needs to be very close to the original to get a result.
All of the above actions together constitute what is known as vocabulary control. You will see that there are two main aspects of this process:
• the selection of terms
• deciding on the exact form of terms.
The selection of terms
Synonym control: preferred and non-preferred terms
Synonymy, that is the occurrence of a number of different words for the same idea, creates considerable problems for the online searcher. In a system such as a classification scheme, where notational codes are used to represent concepts, the existence of different names for the same idea doesn’ present a problem, since the notation equates to as many words as necessary and acts as a kind of shorthand for the concept.
- Type
- Chapter
- Information
- Essential Thesaurus Construction , pp. 70 - 80Publisher: FacetPrint publication year: 2006