题名

Measurement Equivalence between Respondent Groups: A Non-Parametric Differential Item Functioning Analysis of Polytomous Personality Measures

并列篇名

多元計分人格測驗之測量恆等性:非參數方法之試題差異功能分析

DOI

10.3966/181665042015121104001

作者

賴姿伶(Tzu-Ling Lai)

关键词

personality measures ; measurement equivalence ; differential item functioning (DIF) ; polytomous items ; 人格測驗 ; 測量恆等性 ; 差異試題功能 ; 多元計分試題

期刊名称

教育研究與發展期刊

卷期/出版年月

11卷4期(2015 / 12 / 31)

页次

1 - 22

内容语文

英文

中文摘要

The question of whether applicants respond to self-report personality measures differently when responding for selection purposes has been a crucial concern for decades. However, little research has focused on item-level measurement properties to identify the effect of testing situations on polytomous personality items. This study conducted a non-parametric poly-SIBTEST procedure to investigate both item-level and scale-level measurement equivalence on polytomous Likert-type personality scales between applicants and incumbents. The results indicated that several items exhibited differential item functioning (DIF); however, because DIF items did not systematically function with bias toward a particular group, substantial test functioning variations were not observed for all five scales. The items seemed to measure the same underlying constructs between applicants and incumbents.

英文摘要

自陳式人格測驗經常以李克特式多元計分試題的方式呈現。然而,此類作答方式卻容易引起對於不同應試族群是否產生了不同的測量效果之疑慮,例如,當測量目的是為進行甄選時,受試者是否可能為了獲得錄取而刻意往高分的方向填答(亦即一般所稱的「作假」),而使得測量結果和其他情境下產生差異?過去已有大量研究探討應徵者在李克特式多分題的作答是否和一般學生或在職者不同,但卻多從整份測驗的層次著手,甚少針對試題層次的測量特性進行分析。本研究運用非參數的多分題同步試題偏差檢定法(poly-SIBTEST)來進行應徵者和在職者在試題層次以及量表層次的測量恆等性分析。研究結果發現:的確有若干試題對於不同的應試族群具有差異試題功能(DIF);然而,由於差異試題功能並無系統性地偏利於某一族群,因此在所有的五個人格量表中皆未呈現差異測驗功能(DTF)。分析結果顯示多分題人格測驗應用於甄選情境時,所測量到的潛在特性和其他情境是相等的。

主题分类 社會科學 > 教育學
参考文献
  1. Barrick, M. R.,Mount, M. K.(1991).The big-five personality dimensions job performance: A meta-analysis.Personnel Psychology,44,1-26.
  2. Birkeland, S. A.,Manson, T. M.,Kisamore, J. L.,Brannick, M. T.,Smith, M. A.(2006).A meta-analytic investigation of job applicant faking on personality measures.International Journal of Selection and Assessment,14(4),317-335.
  3. Bollen, K. A.(Ed.),Long, J. S.(Ed.)(1993).Testing structural equation models.Beverly Hills, CA:Sage.
  4. Bollen, K.A.(1989).A new incremental fit index for general structureal equation models.Sociological Methods and Research,17,303-316.
  5. Bolt, D.,Stout, W.(1996).Differential item functioning: Its multidimensional model and resulting SIBTEST detection procedure.Behaviormetrika,23(1),67-95.
  6. Byrne, B. M.,Shavelson, R. J.,Muthén, B.(1989).Testing for the equivalence of factor covariance and mean structures: The issue of partial measurement invariance.Psychological Bulletin,105,456-466.
  7. Camilli, G.,Shepard, L. A.(1994).Methods for identifying biased test items.Thousand Oaks, CA:Sage.
  8. Chang, H. H.,Mazzeo, J.,Roussos, L.(1996).Detecting DIF for polytomously scored items: An adaptation of the SIBTEST procedure.Journal of Educational Measurement,33,333-353.
  9. Chernyshenko, O. S.,Chan, K. Y.,Stark, S.,Drasgow, F.,Williams, B.(1999).Fitting item response theory models to personality data.14th Annual Conference of the Society for Industrial and Organizational Psychology,Atlanta, GA.:
  10. Doulas, J. E.,Roussos, L. A.,Stout, W.(1996).Item-bundle DIF hypothesis testing: Identifying suspect bundles and assessing their differential functioning.Journal of Educational Measurement,33,465-484.
  11. Dunnette, M. D.(Ed.),Hough, L. M.(Ed.)(1990).Handbook of industrial & organizational psychology.Palo Alto, CA:Consulting Psychologists.
  12. Ellingson, J. E.,Sackett, P. R.,Connelly, B. S.(2007).Personality assessment across selection and development contexts: Insights into response distortion.Journal of Applied Psychology,92(2),386-395.
  13. Ellingson, J. E.,Smith, D. B.,Sackett, P. R.(2001).Investigating the influence of social desirability on personality factor structure.Journal of Applied Psychology,86(1),122-133.
  14. Frei, R. L.,Griffith, R. L.,McDaniel, M. A.,Snell, A. F.,Douglas, E. F.(1997).Faking noncognitive measures: Factor invariance using multiple groups LISREL.Faking matters. Symposium conducted at the annual meeting of the Society for Industrial and Organizational Psychology,St. Louis, MO:
  15. Griffith, R. L.,Chmielowski, T.,Yoshita, Y.(2007).Do applicants fake? An examination of the frequency of applicant faking behavior.Personnel Review,36,341-355.
  16. Hogan, J.,Barrett, P.,Hogan, R.(2007).Personality measurement, faking, and employment selection.Journal of Applied Psychology,92(5),1270-1285.
  17. Holland, P. W.,Thayer, D. T.(1988).Differential item performance and Mantel-Haenszel procedure.Test Validity,Hillsdale NJ:
  18. Hough, L. M.,Eaton, N. K.,Dunnette, M. D.,Kamp, J. D.,McCloy, R. A.(1990).Criterionrelated validities of personality constructs and the effect of response distortion on those validities.Journal of Applied Psychology,75,581-595.
  19. Ilgen, D. R.(Ed.),Hulin, C. L.(Ed.)(2000).Computational modeling of behavior in organizations: The third scientific discipline.Washington, DC:American Psychological Association.
  20. Lord, F. M.(1980).Application of item response theory to practical testing problems.Hillsdale, NJ:Lawrence Erlbaum Associates.
  21. Mantel, N.,Haenszel, W.(1959).Statistical aspects of the analysis of data from retrospective studies of disease.Journal of the National Cancer Institute,22,719-748.
  22. Masters, G. N.(1982).A Rasch model for partial credit scoring.Psychometrika,47,149-174.
  23. Maydeu-Olivares, A.(2005).Further empirical results on parametric versus non-parametric IRT modeling of Likert-type personality data.Multivariate Behavioral Research,40(2),261-279.
  24. Mount, M. K.,Barrick, M. R.(1995).The Big Five personality dimensions: Implications for research and practice in human resources management.Research in personnel and human resources management,13,153-200.
  25. Murphy, K. R.(Ed.)(1996).Individual differences and behavior in organizations.San Francisco, CA:Jossey-Bass.
  26. O'Brien, E.,LaHuis, D. M.(2011).Do applicants and incumbents respond to personality items similarly? A comparison of dominance and ideal point response models.International Journal of Selection and Assessment,19(2),109-118.
  27. Raju, N. S.,Laffitte, L. J.,Byrne, B. M.(2002).Measurement equivalence: A comparison of methods based on confirmatory factor analysis and item response theory.Journal of Applied Psychology,87(3),517-529.
  28. Raju, N. S.,van der Linden, W. J.,Fleer, P. F.(1995).IRT-based internal measures of differential functioning of items and tests.Applied Psychological Measurement,19,353-368.
  29. Robie, C.,Zickar, M. J.,Schmit, M. J.(2001).Measurement equivalence between applicant and incumbent groups: An IRT analysis of personality scales.Human Performance,14,187-207.
  30. Roussos, L. A.,Stout, W.(1996).A multidimensionality-based DIF analysis paradigm.Applied Psychological Measurement,20,355-371.
  31. Salgado, J. F.(1997).The five factor model of personality and job performance in the European community.Journal of Applied Psychology,82(1),30-43.
  32. Samejima, F.(1969).Estimation of latent ability using a response pattern of graded scores.Psychometric Monograph,34(Suppl.17)
  33. Schmidt, F. L.,Hunter, J. E.(1998).The validity and utility of selection methods in personnel psychology: Practical and theoretical implications of 85 years of research findings.Psychological Bulletin,124,262-274.
  34. Schmit, M. J.,Ryan, A. M.(1993).The Big Five in personnel selection: Factor structure in applicant and non-applicant populations.Journal of applied psychology,78,966-974.
  35. Shealy, R.,Stout, W.(1993).A model-based standardization approach that separates true bias/DIF from group ability differences and detects test bias/DTF as well as item Bias/DIF.Psychometrika,58,159-194.
  36. Smith, D. B.,Ellingson, J. E.(2002).Substance versus style: A new look at social desirability in motivating contexts.Journal of applied psychology,87(2),211-219.
  37. Somes, G. W.(1986).The generalized Mantel- Haenszel statistic.The American Statistician,40,106-108.
  38. Stark, S.,Chernyshenko, O. S.,Chan, K. Y.,Lee, W. C.,Drasgow, F.(2001).Effects of the testing situation on item responding: Cause for concern.Journal of Applied Psychology,86(5),943-953.
  39. Viswesvaran, C.,Ones, D. S.(1999).Meta-analyses of fakability estimates: Implications for personality measurement.Educational and Psychological Measurement,59,197-210.
  40. Zickar, M. J.,Gibby, R. E.,Robie, C.(2004).Uncovering faking samples in applicant, incumbent, and experimental data sets: An application of mixed-model item response theory.Organizational research methods,7(2),168-190.
  41. Zickar, M. J.,Robie, C.(1999).Modeling faking good on personality items: An item-level analysis.Journal of Applied Psychology,84(4),551-563.
  42. Zickar, M. J.,Ury, K. L.(2002).Developing an interpretation of item parameters for personality items: Content correlates of parameter estimates.Educational and Psychological Measurement,62,19-31.
  43. 賴姿伶、余民寧、徐崇文(2009)。員工甄選人格量表的編製及其信效度考驗之初步報告。教育研究與發展期刊,5(4),269-304。
被引用次数
  1. 鄧鈞文,陳俊瑋,林仁傑(2019)。數學成就測驗的性別差異試題功能(DIF)現象:以臺灣學生學習成就評量資料為例。教育科學期刊,18(1),71-91。