题名

倒扣指導語對測驗品質的影響-以物理學科測驗爲例

并列篇名

Effects of Formula Scoring and Number-Right Scoring Instructions on Test Quality: Taking a Multiple-Choice Physics Test as an Example

DOI

10.7108/PT.200812.0007

作者

區雅倫(Ya-Lun Ou);張耿輔(Ken-Fu Chang);翁儷禎(Li-Jen Weng);程暐瀅(Wei-Ying Cheng)

关键词

倒扣計分 ; 能力測驗 ; 猜答 ; 測驗品質 ; 測驗指導語 ; 學科測驗 ; ability test ; formula scoring ; guessing ; subject test ; test instructions ; test quality

期刊名称

測驗學刊

卷期/出版年月

55卷3期(2008 / 12 / 01)

页次

591 - 610

内容语文

繁體中文

中文摘要

選擇題的答錯不倒扣是大多數評量專家所建議的計分方式,但台灣重要的入學考試指定科目考試採用的是答錯倒扣計分,過去研究甚少就學科測驗進行倒扣計分的相關探討,台灣亦缺乏倒扣與否的評鑑文獻。因此,本研究從測驗編製者的觀點,欲了解倒扣與否的測驗指導語對測驗品質的影響。以物理學科測驗為工具,抽樣台灣313名高三學生受測不同指導語但試題相同的題本,分析倒扣或不倒扣指導語對答對題數、測驗分數的內部一致性信度、試題難度、鑑別度與棄答題數之影響。結果顯示指導語倒扣與否對試題鑑別度有影響,對棄答題數亦有影響,但對答對題數、測驗分數內部一致性信度與試題難度則沒有影響。如果測驗的主要目的在於區分考生學科能力的高低,本研究結果建議倒扣是可行方法,但若不期望增加考生的答題壓力,不倒扣可能較為適合。然此結論可能僅適用於物理學科測驗,未來可增加不同學科之研究,以瞭解本研究結論在跨學科上的穩定性。

英文摘要

The purpose of this study is to compare the effects of instructions of formula scoring and number-right scoring on test quality on a multiple-choice physics test. Three hundred and thirteen 12-grade students were administered the same test with different instructions. The results indicated that the number of omitted items and the item discrimination index might vary with instructions given. If the main purpose of test use is to discriminate among students of different levels of ability, instructions of formula scoring as used by the Department Required Test for college entrance in Taiwan is recommended. Future research is encouraged to evaluate the generalizability of the present findings to tests on other subjects.

主题分类 社會科學 > 心理學
社會科學 > 教育學
参考文献
  1. Aiken, L. R.(1987).Testing with multiple-choice items.Journal of Research and Development in Education,20,44-58.
  2. Airasian, P. W.(1994).Classroom assessment.New York:McGraw-Hill.
  3. Albanese, M. A.(1988).The projected impact of the correction for guessing on individual scores.Journal of Educational Measurement,25,149-157.
  4. Angoff, W. H.(1989).Does guessing really help.Journal of Educational Measurement,26,323-336.
  5. Bliss, L. B.(1980).A test of Lord`s assumption regarding examinee guessing behavior on multiple-choice tests using elementary school students.Journal of Educational Measurement,17,147-153.
  6. Budescu, D.,Bar-Hillel, M.(1993).To guess or not to guess: A decision-theoretic view of formula scoring.Journal of Educational Measurement,30,277-291.
  7. Burton, R. F.(2002).Misinformation, partial knowledge and guessing in true/false tests.Medical Education,36,805-811.
  8. Burton, R. F.(2004).Multiple choice and true/false tests: Reliability measures and some implications of negative marking.Assessment and Evaluation in Higher Education,29,585-595.
  9. Burton, R. F.(2005).Multiple-choice and true/false tests: Myths and misapprehensions.Assessment and Evaluation in Higher Education,30,65-72.
  10. Cohen, J.(1988).Statistical power analysis for the behavioral sciences.Hillsdale, NJ:Lawrence Erlbaum Associates.
  11. Crocker, L.,Algina, J.(1986).Introduction to classical and modern test theory.NY:Holt, Rinehart, and Winston.
  12. Cross, L. H.,Frary, R. B.(1977).An empirical test of Lord's theoretical results regarding formula scoring of multiple-choice tests.Journal of Educational Measurement,14,313-321.
  13. Diamond, J. J.(1975).A preliminary study of the reliability and validity of a scoring procedure based upon confidence and partial information.Journal of Educational Measurement,12,129-133.
  14. Diamond, J.,Evans, W.(1973).The correction for guessing.Review of Educational Research,43,181-191.
  15. Downing, S. M.(2003).Guessing on selected-response examinations.Medical Education,37,670-671.
  16. Ebel, R. L.,Frisbie, D. A.(1991).Essentials of educational measurement.Englewood Cliffs, NJ:Prentice-Hall.
  17. Feldt, L. S.,Woodruff, D. J.,Salih, F. A.(1987).Statistical inference for coefficient alpha.Applied Psychological Measurement,11,93-103.
  18. Frary, R. B.(1989).The effect of inappropriate omissions on formula scores: A simulation study.Journal of Educational Measurement,26,41-53.
  19. Frary, R. B.,Cross, L. H.,Lowry, S. R.(1977).Random guessing, correction for guessing and reliability of multiple-choice tests scores.Journal of Experimental Education,46,9-15.
  20. Fujimori, S.,Nakano, K.(1994).Effect of test directions upon examinees` test-taking behaviors and performance.Japanese Journal of Educational Psychology,42,455-462.
  21. Holzinger, K. J.(1924).On Scoring Multiple Response Tests.Journal of Educational Psychology,15,445-447.
  22. Koretz, D. M.,Hamilton, L. S.,R. L. Brennan (Ed.)(2006).Educational measurement.Washington, DC:American Council on Education.
  23. Lord, F. M.(1975).Formula scoring and number-right scoring.Journal of Educational Measurement,12,7-11.
  24. Mattson, D.(1965).The effects of guessing on the standard errer of measurement and the reliability of test scores.Educational and Psychological Measurement,25,727-730.
  25. Quereshi, M. Y.(1974).Performance on multiple-choice tests and penalty for guessing.Journal of Experimental Education,42,74-77.
  26. Rowley, G. L.,Traub, R. E.(1977).Formula scoring, number-right scoring, and test-taking strategy.Journal of Educational Measurement,14,10-21.
  27. Sabers, D. L.,Feldt, L. S.(1968).An empirical study of the effect of the correction for chance success on the reliability and validity of an aptitude test.Journal of Educational Measurement,5,251-258.
  28. Thorndike, R. L. (Ed.)(1971).Educational measurement.Washington, DC:American Council on Education.
  29. Thurstone, L. L.(1919).A method for scoring tests.Psychological Bulletin,16,235-240.
  30. Traub, R. E.,Hambleton, R. K.(1973).Note of correction on the article entitled "The effect of scoring instructions and degree of speededness on the validity and reliability of multiple-choice tests".Educational and Psychological Measurement,33,877-878.
  31. Traub, R. E.,Hambleton, R. K.,Singh, B.(1969).Effects of Promised reward and threatened penalty on performance of a multiple-choice vocabulary test.Educational and Psychological Measurement,29,847-861.
  32. Waters, C. W.,Waters, L. K.(1971).Validity and likability ratings for three scoring instructions for a multiple-choice vocabulary test.Educational and Psychological Measurement,31,935-938.
  33. Wood, R.(1976).Inhibiting blind guessing: The effect of instructions.Journal of Educational Measurement,13,297-308.
  34. 大學入學考試中心(2005)。九十四年度指定科目考試工作報告。台北市:大學入學考試中心。
  35. 余民寧(1997)。教育測驗與評量―成就測驗與教學評量。台北市:心理。
  36. 林光賢(1990)。大學入學考試中心研究計畫報告。台北市:大學入學考試中心。
  37. 林佩霓(2007)。歐洲高中畢業會考與高中課程分組的關係―探討法國Bac與英國A-level的改革。考試學刊,2,39-60。
  38. 區雅倫(2005)。大學人學考試中心內部專題報告。台北市:大學入學考試中心。
  39. 曹亮吉、程眸瀅(2003)。學科能力測驗與指定科目考試的命題理念與方向。文教新潮,2,12-18。
  40. 程暐瀅(2005)。九十三年指定科目考試試題分析―物理科。台北市:大學入學考試中心。
  41. 簡茂發(2001)。心理測驗與統計方法。台北市:心理。