英文摘要
|
In resent years, for estimating students' higher abilities, the framework of assessment gradually turns into large-scale standardized assessment framework. Suitable model not only tells us the ability estimates wanted, and gets the better estimation result. By means of empirical study, the main purpose of the study is to compare if there is difference in mathematical ability estimation by HIRT (hierarchical item response theory), MIRT (multidimensional item response theory) and UIRT (unidimensional item response theory) and what their influences are as the reference of mathematical assessment model. The assessment on Decimal division is designed for six-grade students based on the mathematical assessment framework of NAEP.The reliability on the assessment is 0.79. The result is analyzed and compared by HIRT, MIRT and UIRT models. According to the model fit indexes (AIC, BIC and DIC), it shows that HIRT model is suitable to large-scale standardized assessment framework. In HIRT pattern, the coefficients of Decimal division, and conceptual understanding, procedural knowledge, problem solving inference regression are higher than 0.7, especially conceptual understanding influence the Decimal division. Therefore, the result of the empirical study confirms HIRT model can provide more information and has better estimation.
|
参考文献
|
-
王敏嫻(2010)。多向度試題反應理論之可能值方法對大型測驗中群體平均數估計之影響─以TASA2006數學科為例。測驗統計年刊,18(1),47-68。
連結:
-
Adams, R. J.,Wilson, M.,Wang, W. C.(1997).The multidimensional random coefficients multinomial logit model.Applied Psychological Measurement,21,1-23.
-
Congdon, P.(2003).Applied Bayesian modelling.New York:John Wiley.
-
Cowles, M. K.(2004).Review of WinBUGS 1.4.The American Statistician,58,330-336.
-
Hambleton, R. K.(Ed.),Zall, J.(Ed.)(1991).Advances in educational and psychological testing.Boston:Kluwer-Nijhoff.
-
Hoskens, M.,De Boeck, P(1997).A parameteric model for local dependence among test items.Psychological methods,2,261-277.
-
National Assessment Governing Board(2002).Mathematics framework for the 2003 national assessment of educational progress.National Assessment Governing Board U.S. Department of Education.
-
Qiu, Z.,Song, P. X.-K.,Tan, M.(2002).Bayesian hierarchical models for multi-level repeated ordinal data using WinBUGS.Journal of Biopharmaceutical Statistics,12,121-135.
-
Rasch, G.(1960).Probability models for some intelligence and attainment tests.Copenhagen Danmark:Danmark's Paedogogiske Institute for Educational Research.
-
Song, H.(2007).The State University of New Jersey.
-
Spiegelhalter, D.,Best, N.,Carlin, B.(1998).Technical reportTechnical report,University of Minnesota.
-
Sturtz, S.,Ligges, U.,Gelman, A.(2005).R2WinBUGS: A package for running WinBUGS from R.Journal of Statistical Software,12,1-16.
-
Wang, W.,Wilson, M.,Cheng, Y.(2000).Local Dependence between Latent Traits when Common Stimuli are Used.International Objective Measurement Workshop,New Orleans, LA:
-
Wilson, M.,Adams, R. J.(1995).Rasch models for item bundles.Psychometrika,60,181-198.
-
余民寧(1992)。試題反應理論的介紹(三)─試題反應模式及其特性。研習資訊,9(2),6-10。
-
吳昭容(2003)。「理解」與「計算」,有何兩難?。「你建構我運算,孩子會了什麼?」數學教育之趨勢研討會,台北:
-
吳昭容(1996)。國立台灣大學心理學研究所。
-
林佳樺(2009)。國立臺中教育大學教育測驗統計研究所。
-
邱美珍(2008)。國立交通大學運輸科技與管理學系。
-
張勝凱(2010)。國立臺中教育大學教育測驗統計研究所。
-
教育部(2003)。國民中小學九年一貫課程綱要數學學習領域。台北市:教育部。
-
陳永峰(1998)。國小六年級學童小數知識之研究。國民教育研究,3,337-373。
-
劉曼麗(2002)。國科會補助之專題計畫成果報告國科會補助之專題計畫成果報告,未出版
-
劉曼麗(2008)。小數除法的學與教。科學教育月刊,314,27-38。
-
劉曼麗(2004)。行政院國家科學委員會專題研究計畫成果報告行政院國家科學委員會專題研究計畫成果報告,未出版
-
賴文溥(2009)。國立臺中教育大學數學教育研究所。
-
謝典佑、林佳樺、郭伯臣、施淑娟(2009)。單因子高層次IRT模式適合度檢定之研究─以TASA數學科為例。「大型教育資料庫建置及相關議題」學術研討會
|