英文摘要
|
This study aimed to develop an alternative scaled score method, based on Item Response Theory, to report the results of the student basic competence tests in a more reasonable way. The scaled score method designed by the present study is characterized not only by its theoretical basis but also by the ease of use and understanding.
This study employed the BILOG-MG software system with 3-ParmeterLogistic Model and Bayesian EAP method to estimate student ability value θ s. The value θ s were in turn transformed into scaled scores with a mean of 30 and standard deviation of 7.75. Different scaled scores were found among the students who had the same number of correct items on the tests. The range of the scaled score difference was related to the number of correct items on the tests. In general, in the cases that the students' numbers of correct items were between medium and slightly below medium, the differences of the corresponding scaled scores tended to span a wider range. The scaled scores also differed in subject areas. The widest range of the difference shown within each subject area is presented respectively as follows: Chinese with 10.37 points, English with 10.72 points, math with 11.30 points, social science with 13.00 points, and science with 11.61 points. Subsequently, totaling the scaled scores of the five subjects made the differences even greater among the students who had the same number of correct items, which has a profound influence on students' chance of entering high schools. It is therefore justified that the fairness of the scaled score system currently used should be taken into serious consideration. Finally, some suggestions were proposed on the basis of the discussions provided in this study for future applications.
|
参考文献
|
-
徐柏原(2005)。如何將原始分數轉換成量尺分數-以國中基本測驗為例。測驗學刊,52(2),1-28。
連結:
-
謝進昌、余民寧(2005)。以最大測驗訊息量決定通過分數之研究。測驗學刊,52(2),149-176。
連結:
-
Andersen, E. B.(1972).The numerical solution of a set of conditional estimation equations.The Journal of the Royal Statistical Society, Series B,34,42-54.
-
Birnbaum, A.,F. M. Lord,M. R. Novick(1968).Statistical theories of mental test scores.Reading MA:Addison-Wesley.
-
Bock, R. D.,Aitkin, M.(1981).Marginal maximum lidelihood estimation of item parameters: An application of an EM algorithm.Psychometrika,46,443-459.
-
Guion, R. M.,Ironson, G. H.(1983).Latent trait theory for organizational research.Organizational Behavior and Human Performance,31,54-87.
-
Hambleton, R. K.,Cook, L. L.(1977).Latent trait models and their use in the analysis of educational test data.Journal of Educational Measurement,14,75-96.
-
Hambleton, R. K.,R. L. Linn (Ed.)(1989).Educational measurement.New York:Macmillan.
-
Hambleton, R. K.,Swaminathan, H.(1985).Item response theory: Principles and applications.Boston, MA:Kluwer-Nijhoff.
-
Hambleton, R. K.,Swaminathan, H.,Rogers, H. J.(1991).Fundamentals of item response theory.Newbury Park, CA:Sage.
-
Hambleton, R. K.,Traub, R. E.(1973).Analysis of empirical data using two logistic latent trait modesl.British Journal of Mathematical and Statistical Psychology,26,195-211.
-
Lord, F. M.(1968).An analysis of the verbal Scholastic Aptitude Test using Birnbaum`s three-parameter logistic model.Educational and Psychological Measurement,28,989-1020.
-
Lord, F. M.(1980).Application of item response theory to practical testing problems.Hillsdale, NJ:Lawrence Erlbaum Associates.
-
Rasch, G.(1960).Probabilistic models for sonic intelligence and attainment tests.Copenhagen:Danish Institute for Educational Research.
-
Ross, J.(1966).An empirical study of a logistic mental test model.Psychoemtrika,31,325-340.
-
Traub, R. E.,R. K. Hambleton (Ed.)(1983).Applications of item response theory.Vancouver, BC:Educational Research Institute of British Columbia.
-
Weiss, D. J.(1982).Improving measurement quality and efficiency with adaptive testing.Applied Psychological Measurement,6,379-396.
-
Wright, B. D.,Stone, M. H.(1979).Best test design.Chicago:MESA.
-
Zimowski, M. F.,Muraki, E.,Mislevy, R. J.,Bock, R. D.(2003).BILOG-MG for Windows (version 3).Chicago, IL:Scientific Software International, Inc.
-
人本教育札記(2001)。基本學力測驗會不會太簡單,考不出程度。人本教育札記,144
-
王逸慧(2001)。基本學力測驗之時師:問題與展望。教育研究資訊,9(1),15-31。
-
何文慶(2001)。從基本學力測驗談起。師友,405,34-36。
-
何雅娟(2000)。多元入學多元思維-談台北市高中多元入學方案。教師天地,109,33-44。
-
余民寧、賴姿伶、劉育如(2005)。國中基本學力測驗實施成效之初步調查:學校的觀點。教育與心理研究,28(2),193-217。
-
余民寧、賴姿伶、劉育如(2004)。國中基本學力測驗實施成效之初步調查:學生的觀點。教育與心理研究,27(3),457-481。
-
余霖(2000)。對國民中學學生基本學力測驗之評論與期望。教師天地,109,30-32。
-
吳伯林、謝名娟(2001)。從國中基本學力測驗看九年一貫數學科教材教法。教育研究雙月刊,86,77-89。
-
宋曼雯(1998)。縮短城鄉教育資源差距的基本學力測驗。中央月刊,31(6),26-30。
-
李春生(1999)。以美國經驗為例淺談國民中學基本學力測驗。高中教育,6,18-27。
-
沈育美、陳惠珠、劉佳玲(2001)。九十學年度國中基本學力測驗-社會科中的歷史試題評析。歷史月刊,161,108-117。
-
沈玟君(2003)。碩士論文(碩士論文)。台南市,國立台南大學資訊教育研究所。
-
周麗玉(2001)。多元入學方案不等於基本學力測驗。師友,404,15-19。
-
林世華(2000)。跨世紀的測驗發展計畫-國民中學學生基本學力測驗發展研究。教師天地,109,4-8。
-
林英毅(2003)。碩士論文(碩士論文)。台南市,國立台南大學/資訊教育研究所。
-
林繼生(2000)。是開始也是結束-台北區公立高中八十九學年度聯招國文試題分析,兼論基本學力測驗。國文天地,16(4),60-64。
-
林繼生(2001)。新的開始,新的震撼-國中基本學力測驗第一次測驗國文科試題分析。國文天地,16(12),97-103。
-
飛揚期刊
-
飛揚期刊
-
量尺分數問與答:Q3、換算出來的結果大概會是什麼樣子?
-
九十三年國民中學學生基本學力測驗問與答-百勝篇
-
飛揚期刊
-
基本學力分數的建立
-
國中基本學力測驗分數的建立意義與使用
-
許綺婷(2002)。碩士論文(碩士論文)。國立台灣師範大學科學教育研究所。
-
楊慧美(2002)。碩士論文(碩士論文)。台北市,國立台灣師範大學科學教育研究所。
-
溫美玲(2003)。碩士論文(碩士論文)。桃園縣,中原大學應用數學研究所。
-
廖萬清(2002)。碩士論文(碩士論文)。南投縣,暨南國際大學教育政策與行政研究所。
-
蔡雅瑩(2004)。碩士論文(碩士論文)。台北市,國立政治大學統計研究所。
-
鄭圓鈴(2001)。國中國文科基本學力測驗試題分析。中國語文,88(5),527。
-
羅曉雯(2002)。碩士論文(碩士論文)。台北市,國立政治大學教育學系。
|