Hostname: page-component-cd9895bd7-dk4vv Total loading time: 0 Render date: 2025-01-05T22:14:44.179Z Has data issue: false hasContentIssue false

Sample Size Determination Within the Scope of Conditional Maximum Likelihood Estimation with Special Focus on Testing the Rasch Model

Published online by Cambridge University Press:  01 January 2025

Clemens Draxler*
Affiliation:
The Health and Life Sciences University
Rainer W. Alexandrowicz
Affiliation:
University of Klagenfurt, Psychology Institute
*
Correspondence should be made to Clemens Draxler, The Health and Life Sciences University, EWZ 1, 6060 Hall, Austria. Email: clemens.draxler@umit.at

Abstract

This paper refers to the exponential family of probability distributions and the conditional maximum likelihood (CML) theory. It is concerned with the determination of the sample size for three groups of tests of linear hypotheses, known as the fundamental trinity of Wald, score, and likelihood ratio tests. The main practical purpose refers to the special case of tests of the class of Rasch models. The theoretical background is discussed and the formal framework for sample size calculations is provided, given a predetermined deviation from the model to be tested and the probabilities of the errors of the first and second kinds.

Type
Original Paper
Copyright
Copyright © 2015 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Agresti, A. (2002). Categorical data analysis (2nd ed.). New York: Wiley.CrossRefGoogle Scholar
Aitchison, J., & Silvey, S.D. (1958). Maximum likelihood estimation of parameters subject to restraints. The Annals of Mathematical Statistics, 29, 813828.CrossRefGoogle Scholar
Andersen, E.B. (1970). Asymptotic properties of conditional maximum likelihood estimators. Journal of the Royal Statistical Society, Series B, 32, 283301.CrossRefGoogle Scholar
Andersen, E.B. (1973). A goodness of fit test for the Rasch model. Psychometrika, 38, 123140.CrossRefGoogle Scholar
Andersen, E.B. (1977). Sufficient statistics and latent trait models. Psychometrika, 42, 6981.CrossRefGoogle Scholar
Andersen, E.B. (1980). Discrete statistical models with social science applications. Amsterdam: North-Holland.Google Scholar
Andrich, D. (1978). A rating formulation for ordered response categories. Psychometrika, 43, 561573.CrossRefGoogle Scholar
Bahadur, R.R. (1954). Sufficiency and statistical decision functions. Annals of the Institute of Statistical Mathematics, 25, 423462.CrossRefGoogle Scholar
Barndorff-Nielsen, O. (1978). Information and exponential families in statistical theory. New York: Wiley.Google Scholar
Cohen, J. (1988). Statistical power analyses for the behavioral sciences. New York: Erlbaum.Google Scholar
Davidson, R. R., & Lever, E. L. (1967). The limiting distribution of the likelihood ratio statistic under a class of local alternatives. Florida State University Statistics Report M126, Tallahassee.Google Scholar
Diamond, E.L. (1963). The limiting power of categorical data chi-square tests analogous to normal analysis of variance. Annals of Mathematical Statistics, 34, 14321441.CrossRefGoogle Scholar
Draxler, C. (2010). Sample size determination for Rasch model tests. Psychometrika, 75, 708724.CrossRefGoogle Scholar
Dynkin, E.B. (1951). Necessary and sufficient statistics for a family of probability distributions. Uspekhi Matematicheskikh Nauk, 6, 6890.Google Scholar
Feder, P.I. (1968). On the distribution of the log likelihood ratio test statistic when the true parameter is near the boundaries of the hypothesis regions. The Annals of Mathematical Statistics, 39, 20442055.CrossRefGoogle Scholar
Fischer, G.H. (1981). On the existence and uniqueness of maximum-likelihood estimates in the Rasch model. Psychometrika, 46, 5977.CrossRefGoogle Scholar
Fischer, G.H., & Molenaar, I.W. (1995). Rasch models-foundations, recent developments and applications. New York: Springer.Google Scholar
Fleiss, J.L. (1981). Statistical methods for rates and proportions (2nd ed.). New York: Wiley.Google Scholar
Gaffke, N., Steyer, R., & von Davier, A.A. (1999). On the asymptotic null-distribution of the Wald statistic at singular parameter points. Statistics & Decisions, 17, 339358.Google Scholar
Glas, C.A.W. (1988). The derivation of some tests for the Rasch model from the multinomial distribution. Psychometrika, 53, 525546.CrossRefGoogle Scholar
Glas, C.A.W., & Verhelst, N.D. (1989). Extensions of the partial credit model. Psychometrika, 54, 635659.CrossRefGoogle Scholar
Glas, C.A.W., & Verhelst, N.D. (1995). Testing the Rasch model. In Fischer, G.H., & Molenaar, I.W. (Eds.), Rasch models-foundations, recent developments and applications (pp. 6995). New York: Springer.Google Scholar
Glas, C.A.W., & Verhelst, N.D. (1995). Tests of fit for polytomous Rasch models. In Fischer, G.H., & Molenaar, I.W. (Eds.), Rasch models- foundations, recent developments and applications (pp. 325352). New York: Springer.Google Scholar
Glas, C.A.W. (2006). Testing generalized Rasch models. In von Davier, M., & Carstensen, C.H. (Eds.), Multivariate and mixture distribution rasch models: Extensions and applications (pp. 3756). New York: Springer.Google Scholar
Haberman, S.J. (1974). The analysis of frequency data. Chicago: University of Chicago Press.Google Scholar
Haberman, S.J. (1981). Tests for independence in two-way contingency tables based on canonical correlation and on linear-by-linear interaction. The Annals of Statistics, 9, 11781186.CrossRefGoogle Scholar
Kelderman, H. (1984). Log linear Rasch model tests. Psychometrika, 49, 223245.CrossRefGoogle Scholar
Kelderman, H. (1989). Item bias detection using log linear IRT. Psychometrika, 54, 681697.CrossRefGoogle Scholar
Martin- Löf, P. (1973). Statistiska Modeller. (Statistical models. Notes from seminars 1969–1970 by Rolf Sundberg.) Stockholm.Google Scholar
Masters, G.N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47, 149174.CrossRefGoogle Scholar
Maydeu-Olivares, A., & Montano, R. (2013). How should we assess the fit of Rasch-type models? Approximating the power of goodness-of-fit statistics in categorical data analysis. Psychometrika, 78, 116133.CrossRefGoogle ScholarPubMed
Mitra, S.K. (1958). On the limiting power function of the frequency chi-square test. Annals of Statistics, 29, 12211233.CrossRefGoogle Scholar
Müller, H. (1987). A Rasch model for continuous ratings. Psychometrika, 52, 165181.CrossRefGoogle Scholar
Neyman, J., & Pearson, E. S. (1928). On the use and interpretation of certain test criteria for purposes of statistical inference. Biometrika, 20 A, 263–294.Google Scholar
Neyman, J., & Scott, E.L. (1948). Consistent estimates based on partially consistent observations. Econometrica, 16, 132.CrossRefGoogle Scholar
Pearson, K. (1900). On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling. Philosophical Magazine Series, 5(50), 157175.CrossRefGoogle Scholar
Pfanzagl, J. (1993). On the consistency of conditional maximum likelihood estimators. Annals of the Institute of Statistical Mathematics, 45, 703719.CrossRefGoogle Scholar
Rao, C.R. (1948). Large sample tests of statistical hypotheses concerning several parameters with applications to problems of estimation. Proceedings of the Cambridge Philosophical Society, 44, 5057.Google Scholar
Rasch, G. (1960). Probabilistic models for some intelligence and attainment tests. Copenhagen: The Danish Institute of Education Research (Expanded Edition, 1980. Chicago: University of Chicago Press).Google Scholar
Rasch, G. (1961). On general laws and the meaning of measurement in psychology. Berkeley: University of California Press.Google Scholar
Satorra, A., & Saris, W.E. (1985). The power of the likelihood ratio test in covariance structure analysis. Psychometrika, 50, 8390.CrossRefGoogle Scholar
Silvey, S.D. (1959). The Lagrangian multiplier test. The Annals of Mathematical Statistics, 30, 389407.CrossRefGoogle Scholar
Stroud, T.W.F. (1972). Fixed alternatives and Wald’s formulation of the noncentral asymptotic behavior of the likelihood ratio statistic. The Annals of Mathematical Statistics, 43, 447454.CrossRefGoogle Scholar
van den Wollenberg, A. (1982). Two new test statistics for the Rasch model. Psychometrika, 47, 123140.CrossRefGoogle Scholar
Verhelst, N.D., & Glas, A.W. (1995). The one parameter logistic model. In Fischer, G.H., & Molenaar, I.W. (Eds.), Rasch models- foundations, recent developments and applications (pp. 215237). New York: Springer.Google Scholar
Verhelst, N.D., Glas, C.A.W., & Verstralen, HHFM (1994). OPLM: Computer program and manual. Arnhem: CITO.Google Scholar
von Davier, A. A. (2003). Large sample tests for comparing regression coefficients in models with normally distributed variables. Research Report RR-03-29. Princeton, NJ: Educational Testing Service.Google Scholar
Wald, A. (1943). Test of statistical hypotheses concerning several parameters when the number of observations is large. Transactions of the American Mathematical Society, 54, 426482.CrossRefGoogle Scholar
Wilks, S.S. (1938). The large sample distribution of the likelihood ratio for testing composite hypotheses. The Annals of Mathematical Statistics, 9, 6062.CrossRefGoogle Scholar
Wilson, M., & Masters, G.N. (1993). The partial credit model and null categories. Psychometrika, 58, 8799.CrossRefGoogle Scholar