英文摘要
|
The concept of validity evidence has become diverse and multifaceted in recent years. The purpose of the present study is to examine the external validity of science assessment standard setting for 4th grade, which was implemented with the bookmark method in 2010. This study uses "contrasting group method", an examinee-centered method, to set performance standards. The participants were eight elementary school teachers and their 233 students. The instruments were classification sheet and a particular form of science test. Teachers were instructed to judge the performance of students based on the performance level descriptors and mark in the classification sheet with four levels (basic, basic, proficient, and advanced). In order to link the existing scale and teachers' grading, the particular form of science test was administered to students. Generalized partial credit model was applied to estimate the dichotomous and polytomous data. The results revealed that the minimum standards of basic level set by contrasting group method was lower than that of the bookmark method, while the standard of advanced level by contrasting group method was higher than that of the bookmark method. Besides, the general hit rate was 52.36%, while hit rates of the performance classifications were 26.32%, 57.14%, 58.00%, and 54.55%. In the conclusion, suggestions for further studies are provided.
|
参考文献
|
-
吳宜芳,鄒慧英,林娟如(2010)。標準設定效度驗證之探究:以大型數學學習成就評量為例。測驗學刊,57(1),1-27。
連結:
-
吳毓瑩,陳彥名,張郁雯,陳淑惠,何東憲,林俊吉(2009)。以常態混組模型討論書籤標準設定法對英語聽讀基本能力標準設定有效性之幅合證據。教育心理學報,41(1),69-90。
連結:
-
杜佳真,林世華(2007)。九年一貫課程數學領域能力指標「數與量」、「代數」主題軸第一、二階段表現標準適切性評估之研究。師大學報:教育類,52(1),63-85。
連結:
-
謝進昌,謝名娟,林世華,林陳涌,陳清溪,謝佩蓉(2011)。大型資料庫國小四年級自然科學習成就評量標準設定結果之效度評估。教育科學研究期刊,56(1),1-32。
連結:
-
Bontempo, B. D.,Marks, C. M.,Karabatsos, G.(1998).A meta-analytic assessment of empirical differences in standard setting procedures.annual meeting of the American Educational Research Association,San Diego, CA.:
-
Brandon, P. R.(2002).Two versions of the contrasting-groups standard-setting method: A review.Measurement and Evaluation in Counseling and Development,35(3),167-181.
-
Cizek, G. J.,Bunch, M. B.(2007).Standard setting: A guide to establishing and evaluating performance standards on tests.Thousand Oaks, CA:Sage.
-
Cunningham, G. K.(2005).Must high stakes mean low quality? Some testing program implementation issues.Defending standardized testing,Mahwah, NJ:
-
Educational Testing Service(2002).ETS standards for quality and fairness.Princeton, NJ:Author.
-
Haertel, E. H.(2002).Standard setting as a participatory process: Implications for validation of standards-based accountability programs.Educational Measurement: Issues and Practice,21(1),16-22.
-
Hansche, L. N.(1998).Handbook for the development of performance standards: Meeting the requirements of title I.Bethesda, MD:Frost Associate.
-
Kane, M.(1998).Choosing between examinee-centered and test-centered standard-setting methods.Educational Assessment,5(3),129-145.
-
Kane, M.(1994).Validating the performance standards associated with passing scores.Review of Educational Research,64(3),425-461.
-
Livingston S. A.,Zieky, M. J.(1982).Passing scores: A manual for setting standards of performance on educational and occupational tests.Princeton, NJ:Educational Testing Service.
-
Loomis, S. C.,Bourque, M. L.(2001).From tradition to innovation: Standard setting on the National Assessment of Educational Progress.Setting performance standards,Mahwah, NJ:
-
Muraki, E.(1992).A generalized partial credit model: Application of an EM algorithm.Applied Psychological Measurement,16(2),159-176.
-
Muraki, E.(1993).Information functions of the generalized partial credit model.Applied psychological measurement,17(4),351-363.
-
Näsström, G.,Nyström, P.(2008).A comparison of two different methods for setting performance standards for a test with constructed-response items.Practical Assessment, Research & Evaluation,13(9),1-12.
-
Nichols, P.,Twing, J.,Mueller, C. D.,O'Malley, K.(2010).Standard-setting methods as measurement processes.Educational Measurement: Issues and Practice,29(1),14-24.
-
Nijlen, D. V.,Janssen, R.(2008).Modeling judgments in the Angoff and contrasting-groups method of standard setting.Journal of Educational Measurement,45(1),45-63.
-
Sommers, S.(2012).The training and preparation of Angoff standard setting panelists: The role of group discussion and experience in determining panelist accuracy.1st International Conference on Standard-based Assessment,Taipei:
-
Tannenbaum, R. J.(2011).Setting standards on the Praxis Series Tests: A multistate approach.R&D Connections,17,1-9.
-
Tindal, G.(Ed.),Haladyna, T. M.(Ed.)(2002).Large-scale assessment programs for all students: Validity, technical adequacy, and implementation.Mahwah, NJ:Lawrence Erlbaum Associates.
-
Wolfe, E. W.,Smith, E. V.(2007).Instrument development tools and activities for measure validation using Rasch models: Part I - instrument development tools.Journal of Applied Measurement,8(1),97-123.
-
Wolfe, E. W.,Smith, E. V.(2007).Instrument development tools and activities for measure validation using Rasch models: Part II - validation activities.Journal of Applied Measurement,8(2),234-294.
-
吳宜芳,鄒慧英(2010)。試題呈現與回饋模式對Angoff標準設定結果一致性提升效益之比較研究。教育研究與發展期刊,6(4),47-80。
-
吳裕益(1988)。九種通過分數設定方法之比較研究。初等教育學報,1,47-120。
-
黃俊傑(2009)。「攜手計畫課後扶助」執行評析及建議。北縣教育,67,69-72。
|