Book contents
- Designing and Evaluating Language Corpora
- Designing and Evaluating Language Corpora
- Copyright page
- Contents
- Figures
- Tables
- Acknowledgments
- 1 Introduction
- 2 Approaches to Representativeness in Previous Corpus Linguistic Research
- 3 Corpus Representativeness
- 4 Domain Considerations
- 5 Distribution Considerations
- 6 The Influence of Domain and Distribution Considerations on Corpus Representativeness
- 7 Corpus Design and Representativeness in Practice – With Daniel Keller
- Glossary
- Book part
- References
- Index
1 - Introduction
Published online by Cambridge University Press: 07 April 2022
- Designing and Evaluating Language Corpora
- Designing and Evaluating Language Corpora
- Copyright page
- Contents
- Figures
- Tables
- Acknowledgments
- 1 Introduction
- 2 Approaches to Representativeness in Previous Corpus Linguistic Research
- 3 Corpus Representativeness
- 4 Domain Considerations
- 5 Distribution Considerations
- 6 The Influence of Domain and Distribution Considerations on Corpus Representativeness
- 7 Corpus Design and Representativeness in Practice – With Daniel Keller
- Glossary
- Book part
- References
- Index
Summary
We show that empirical corpus-based research is prevalent across subdisciplines of (applied) linguistics, not just in “corpus linguistics” journals. We define a corpus as a large, principled sample of texts designed to represent a target domain of language use. Corpus representativeness is conceptualized as the extent to which a corpus permits accurate and meaningful generalizations about linguistic patterns that are typical in a domain. Corpus representativeness involves two main considerations, which are both relative to the linguistic research goal of interest: domain considerations (adequate representation of the text varieties in the domain), and distribution considerations (adequate representation of the distribution of linguistic features in the domain).
- Type
- Chapter
- Information
- Designing and Evaluating Language CorporaA Practical Framework for Corpus Representativeness, pp. 1 - 27Publisher: Cambridge University PressPrint publication year: 2022