题名

九種古典測驗理論信度指標精確性之研究

并列篇名

A Comparison of Precision of Nine Reliability Estimates Based on Classical Test Theory

作者

蔡佩圜(Pei-Huan Tsai);凃柏原(Bor-yaun Twu);吳裕益(Yuh-Yih Wu)

关键词

最大信度估計下限 ; 驗證性因素分析 ; confirmatory factor analysis ; the greatest lower bound reliability

期刊名称

測驗學刊

卷期/出版年月

65卷2期(2018 / 06 / 01)

页次

217 - 240

内容语文

繁體中文

中文摘要

本研究採用因素結構已知的驗證性因素分析模式來產生模擬資料,探討測驗因素數目、題數、樣本數三個自變項,對ρ_(g1b)、λ_1、λ_2、λ_3(α)、λ_4、λ_5、ω_h、ω_t等九種信度估計方法的偏誤、絕對偏誤、誤差均方根三個依變項之影響,藉以評估不同信度估計指標之精確性。研究結果顯示:(1)傳統最常使用的信度估計值λ_3(α)僅適合用來分析單向度測驗,若為多因素測驗,則會明顯低估信度真值;(2) λ_4及ω_t無論在何種情境其信度估計誤差均極微,建議盡可能採用這兩種信度估計值,當測驗資料之因素結構很明確時,最適合以ω_t來估計整體之信度,若因素結構不明確時,最適合以λ_4來估計整體之信度;(3)除非是分析母群資料,否則ρ_(g1b)有高估信度真值的現象,不適合稱之為最大信度下限;(4)ω_h與ω_t之比值是g因素解釋率占所有共同因素(包括g與所有f)總解釋率之比率,建議以ω_h與ω_t之比值作為評估測驗是否接近單向度的指標。本研究之分析結果可提供給測驗使用人員依不同測驗情境選擇較適切之信度估計指標。

英文摘要

The purpose of this research is mainly to analyze the accuracy of different reliability index by employing ρ_(g1b)、λ_1、λ_2、λ_3(α)、λ_4、λ_5、ω_h、ω_t as the major arguments. Confirmatory Factor Analysis (CFA) is utilized for simulating data in this experiment, basically relying on independent variables (the number of test factors, the number of test items, the number of sample sizes) and dependent variable (bias, absolute mean bias, root mean squared error). The statistical results and analyses are described as following: (1) λ_3(α), the most commonly and traditionally used, only suitable for the analysis of one-dimension test, reliability index value will be significantly underestimated if multi-factor test takes place. (2) ω_t、λ_4 display best values of reliability estimation with extreme little error, it is recommended that these two can be used as much as possible. When the structure of factor of the test data is very clear, ω_t is the most suitable role to estimate the overall reliability. On the other hand, if it is not clear, thenλ_4 is the appropriate candidate to do the work. (3) Unless it is for analyzing the parent group data, then ρ_(g1b) shows a high estimated value of reliability which is not proper to name it as the greatest lower bound reliability. (4) The ratio of ω_h to ω_t is the ratio of the explanatory rate of g factor to the total explanatory rate of all common factors (including g and all f). It is recommended that it can be used as an indicator of whether the undergoing test is close to one dimension. The results of this study can provide testing persons with more appropriate estimates of reliability indicators according to different test scenarios.

主题分类 社會科學 > 心理學
社會科學 > 教育學
参考文献
  1. Bentler, P. M.(2009).Alpha, dimension-free, and model-based internal consistency reliability.Psychometrika,74(1),137-143.
  2. Bentler, P. M.,Woodward, J. A.(1980).Inequalities among lower bounds to reliability: With applications to test construction and factor analysis.Psychometrika,45,249-267.
  3. Benton, T.(2013).An empirical assessment of Guttman's Lambda 4 reliability coefficient.Quantitative Psychology Research
  4. Bollen, K. A.(1989).Structural equations with latent variables.NewYork, NY:JohnWiley & Sons.
  5. Comrey, A. L.,Lee, H. B.(1992).A first course in factor analysis.Hillsdale, NJ:Lawrence Erlbaum Associates.
  6. Eisinga, R.,Grotenhuis, M.(2016).The reliability of a two-item scale: Pearson, Cronbach, or Spearman-Brown?.International Journal of Public Health,58(4),637-642.
  7. Graham, J. M.(2006).Congeneric and (essentially) tau-equivalent estimates of score reliability: What they are and how to use them.Educational and Psychological Measurement,66(6),930-944.
  8. Green, S. B.,Hershberger, S. L.(2000).Correlated errors in true score models and their effect on coefficient alpha.Structural Equation Modeling,7,251-270.
  9. Green, S. B.,Yang, Y.(2009).Reliability of summed item scores using structural equation modeling: An alternative to coefficient alpha.Psychometrika,74(1),155-167.
  10. Green, S. B.,Yang, Y.(2009).Commentary on coefficient alpha: A cautionary tale.Psychometrika,74(1),121-135.
  11. Gronlund, N. E.,Linn, R. L.(1990).Measurement and evaluation in teaching.New York, NY:Macmillan.
  12. Gulliksen, H.(1950).Theory of Mental Tests.Hillsdale, NJ:Lawrence Erlbaum Associates.
  13. Guttman, L. (1945). A basis for analyzing test-retest reliability. Psychometrika, 10(4), 255-282.
  14. Hogan, T. P.,Benjamin, A.,Brezinski, K. L.(2000).Reliability methods.Educational and Psychological Measurement,60,523-531.
  15. Jackson, P.,Agunwamba, C.(1977).Lower bounds for the reliability of the total score on a test composed of nonhomogeneous items: I: Algebraic lower bounds.Psychometrika,42(4),567-578.
  16. Lord, F. M.,Novick, M. R.(1968).Statistical theories of mental test scores.Reading, MA:Addison-Wesley.
  17. McDonald, R. P.(1999).Test theory: A unified treatment.Hillsdale, NJ:Lawrence Erlbaum Associates.
  18. McDonald, R. P.(1978).Generalizability in factorable domains: "domain validity and generalizability": 1.Educational and Psychological Measurement,38(1),75-79.
  19. Novick, M. R.,Lewis, C.(1967).Coefficient alpha and the reliability of composite measurements.Psychometrika,32,1-13.
  20. Osburn, H. G.(2000).Coefficient alpha and related internal consistency reliability coefficients.Psychological Methods,5(3),343-355.
  21. Raykov, T.(1997).Estimation of composite reliability for congeneric measures.Applied Psychological Measurement,21,173-184.
  22. Raykov, T.(2004).Estimation of maximal reliability: A note on a covariance structure modeling approach.British Journal of Mathematical and Statistical Psychology,57,21- 27.
  23. Revelle, W.(1979).Hierarchical cluster-analysis and the internal structure of tests.Multivariate Behavioral Research,14(1),57-74.
  24. Revelle,W.,Zinbarg, R. E.(2009).Coefficients alpha, Beta, Omega, and the GLB: Comments on Sijtsma.Psychometrika,74(1),145-154.
  25. Sijtsma, K.(2009).On the use, the misuse, and the very limited usefulness of Cronbach's alpha.Psychometrika,74(1),107-120.
  26. Sijtsma, K.(2009).Reliability beyond theory and into practice.Psychometrika,74(1),169-173.
  27. Tabachnick, B. G.,Fidell, L. S.(2007).Using multivariate statistics.Boston, MA:Pearson.
  28. Tang, W., & Cui, Y. (2012). A simulation study for comparing three lower bounds to reliability. Retrieved from http://www.crame.ualberta.ca/docs/April 2012/AERA paper_2012.pdf
  29. Ten Berge, J. M. F.,Sočan, G.(2004).The greatest lower bound to the reliability of a test and the hypothesis of unidimensionality.Psychometrika,69(4),613-625.
  30. Yang, Y.,Green, S. B.(2014).Evaluation of structural equation modeling estimates of reliability for scales with ordered categorical items.Methodology,11(1),1-12.
  31. Zinbarg, R. E.,Revelle, W.,Yovel, I.,Li, W.(2005).Cronbach's , Revelle's , and McDonald's : Their relations with each other and two alternative conceptualizations of reliability.Psychometrika,70(1),123-133.
被引用次数
  1. 凃柏原,蔡佩圜,吳裕益(2020)。向度數、題數及樣本數分別與六種信度估計法估計誤差交互作用效果之探討。教育學誌,43,67-104。
  2. (2020)。Alpha 係數及相關的信度估計方法探討。教育研究學報,54(1),1-26。
  3. (2023)。大學生運動休閒情緒感染量表之發展。高雄師大學報:教育與社會科學類,55,27-45。