Bejar, I. I.(2011).A validity-based approach to quality control and assurance of automated scoring.Assessment in Education: Principles, Policy & Practice,18,319-341.
Burstein, J.(2003).The e-rater® scoring engine: Automated essay scoring with natural language processing.Automated essay scoring: A cross-disciplinary perspective,Hillsdale, NJ:
Chakraborty, U. K.,Konar, D.,Roy, S.,Choudhury, S.(2019).Automatic short answer grading using rough concept clusters.International Journal of Advanced Intelligence Paradigms,14(3/4),260-280.
de la Torre, J.(2009).DINA model and parameter estimation: A didactic.Journal of Educational and Behavioral Statistics,34,115-130.
de la Torre, J.,Douglas, J.(2004).Higher-order latent trait models for cognitive diagnosis.Psychometrika,69,333-353.
Doignon, J. P.,Falmagne, J. C.(1999).Knowledge spaces.New York, NY:Springer-Verlag.
Dziuban, C.,Moskal, P.,Johnson, C.,Evans, D.(2017).Adaptive learning: A tale of two contexts.Current Issues in Emerging eLearning,4(1),3.
Elliot, S.(2003).IntelliMetric: From here to validity.Automated essay scoring: A cross-disciplinary perspective,Hillsdale, NJ:
Heilman, M.,Madnani, N.(2015).The impact of training data on automated short answer scoring performance.Proceedings of the Tenth Workshop on Innovative Use of NLP for Building Educational Applications,Stroudsburg, PA:
Huebner, A.(2010).An overview of recent developments in cognitive diagnostic computer adaptive assessments.Practical Assessment, Research & Evaluation,15(3)
Junker, B. W.,Sijtsma, K.(2001).Cognitive assessment models with few assumptions, and connections with nonparametric item response theory.Applied Psychological Measurement,25,258-272.
Kuo, B. C.,Chen, C. H.,Yang, C. W.,Mok, M. M. C.(2016).Cognitive diagnostic models for tests with multiple choice and constructed response items.Educational Psychology,36,1115-1133.
Leacock, C.,Chodorow, M.(2003).c-rater: Automated scoring of short-answer questions.Computers and the Humanities,37,389-405.
Li, H.,Hunter, C. V.,Lei, P. W.(2016).The selection of cognitive diagnostic models fora reading comprehension test.Language Testing,33,391-409.
Liu, O. L.,Brew, C.,Blackmore, J.,Gerard, L.,Madhok, J.,Linn, M. C.(2014).Automated scoring of constructed-response science items: Prospects and obstacles.Educational Measurement: Issues and Practice,33(2),19-28.
Peredo, R.,Canales, A.,Menchaca, A.,Peredo, I.(2011).Intelligent web-based education system for adaptive learning.Expert Systems with Applications,38(12),14690-14702.
Ravand, H.,Barati, H.,Widhiarso, W.(2013).Exploring diagnostic capacity of a high stakes reading comprehension test: A pedagogical demonstration.Iranian Journal of Language Testing,3(1),12-37.
Risse, T.(2007).Testing and assessing mathematical skills by a script based system.10th International Conference on Interactive Computer Aided Learning,Villach, Austria:
Roberts, M. R.,Gierl, M.(2010).Developing score reports for cognitive diagnostic assessments.Educational Measurement: Issues and Practice,29(3),25-38.
Rupp, A.,Templin, J.(2008).Effects of Q-matrix misspecification on parameter estimates and misclassification rates in the DINA model.Educational and Psychological Measurement,68,78-98.
Schuwirth, L. W.,van der Vleuten, C. P.(2011).Programmatic assessment: From assessment of learning to assessment for learning.Med Teach,33,478-485.
Seifried, J.,Brandt, S.,Kögler, K.,Rausch, A.(2020).The computer-based assessment of domain specific problem-solving competence-A three step scoring procedure.Cogent Education,7(1),1719571.
Shute, V. J.(2008).Focus on formative feedback.Review of Educational Research,78,153-189.
Topol, B.,Olson, J.,Roeber, E.(2010).The cost of new higher quality assessments: A comprehensive analysis of the potential costs for future state assessments.Stanford, CA:Stanford University, Stanford Center for Opportunity Policy in Education.
Wang, S.,Yang, Y.,Culpepper, S. A.,Douglas, J. A.(2017).Tracking skill acquisition with cognitive diagnosis models: A higher-order, hidden markov model with covariates.Journal of Educational and Behavioral Statistics,43(1),57-87.
Williamson, D. M.(ed.),Mislevy, R. J.(ed.),Bejar, I. I.(ed.)(2006).Automated scoring of complex tasks in computer-based testing.Mahwah, NJ:Lawrence Erlbaum Associates.
Williamson, D. M.,Almond, R. G.,Mislevy, R. J.,Levy, R.(2006).An application of Bayesian networks in automated scoring of computerized simulation tasks.Automated scoring of complex tasks in computer-based testing,Mahwah, NJ:
Yang, C. W.,Kuo, B. C.,Liao, C. H.(2011).A HO-IRT based diagnostic assessment system with constructed response items.Turkish Online Journal of Educational Technology,10,46-51.