我们的网站为什么显示成这样?

可能因为您的浏览器不支持样式,您可以更新您的浏览器到最新版本,以获取对此功能的支持,访问下面的网站,获取关于浏览器的信息:

|本期目录/Table of Contents|

 四参数Logistic模型和传统模型对被试作答拟合能力的比较研究(PDF)

《心理学探新》[ISSN:1003-5184/CN:36-1228/B]

期数:
 2018年03期
页码:
 228-235
栏目:
 心理统计与测量
出版日期:
 2018-05-30

文章信息/Info

Title:
 A Comparison Study for the Four Parameter Logistic Model and Traditional Logistic Models
作者:
 刘 玥 刘红云
 北京师范大学心理学部,北京 100875
Author(s):
 Liu Yue Liu Hongyun
 School of Psychology,Beijing Normal University,Beijing 100875
关键词:
 项目反应理论 睡眠现象 四参数Logistic模型
Keywords:
 item response theory sleeping phenomenon four-parameter logistic model
分类号:
 B841.2
DOI:
 -
文献标识码:
 A
摘要:
 针对测验中高能力被试答错容易试题的睡眠现象,可使用四参数Logistic模型分析数据。研究选取了来自心理测验和成就测验的实际数据,分别采用传统模型和四参数Logistic模型进行拟合,对不同模型的拟合指标及参数估计结果进行比较。结果表明,四参数Logistic模型能够提高拟合程度,增强估计结果的准确性,有效纠正高能力被试能力被低估的现象。建议在必要时使用四参数Logistic模型进行数据分析。
Abstract:
 High-ability test-takers may on occasion answer an easy question incorrectly,which is called sleeping phenomenon(Wright,1977).In these situations,four parameter logistic model(4PM)may be uniquely suited for characterizing the data.The 4PM was proposed by Barton and Lord(1981),which added the d parameter to allow upper asymptotes to be less than 1.00.The more general formulation of the 4PM(Waller & Reise,2010)suggestedd as an item-specific upper asymptote.Besides,a three parameter logistic model for reversed data(3PMR)was discussed,which was suited for the situations with no guessing phenomenon but sleeping phenomenon.In the previous researches,the4PM provided good fit for some psychological tests,such as MMPI and so on.However,for achievement tests,Barton and Lord in their earlier work found that the 4PM failed to improve the likelihood or to significantly change any ability estimates for the datasets collected by ETS.Therefore,is it really inappropriate to use the 4PM in achievement tests? Moreover,most previous researches focused on the differences of parameter estimations based on simulated data.However,how often the sleeping phenomenon happen in real situations is still worth studying.In our research,we fitted seven models to the Taylor Manifest Anxiety Scale(TMA)and the large-scale Maths test.Meanwhile,the dataset of Maths tests was used to construct two different distributions:approximately normal distribution(skewness is 0.097)and negatively skewed distribution(skewness is-0.199).The models compared were Rasch model,two parameter logistic model(2PM),three parameter logistic model(3PM),3PM with reversing scores on each item(3PM_R),4PM,4PM with equal guessing parameters(4PM_c)and 4PM with equal d parameters(4PM_d).The R package sirt was used to estimate all the models in our study.In order to investigate the differences of these models,we computed:(1)the model fit indexAIC,BIC;(2)the correlations of the item parameter estimations of the best fitted logistic model with d parameter and the second best model without d parameter,for all the items and after the easiest 5,10,and 10 items were deleted;(3)the correlations of the ability parameter estimations of the two models discussed in(2),for all and the top 1000,500,300,200,100 respondents.The results indicated that(1)the Rasch model showed the worst fit for all the datasets.For TMA data,the 3PMR showed the best fit,for the Maths tests,the 4PM showed the best fit;(2)the difficulty parameters were quite similar inthe two compared models,however,there was lager difference between the discrimination parameters,the negatively skewed Standard Maths test data showed similar results; when the easiest items were deleted,the correlation of the discrimination parameters became larger,especially for the negatively skewed Standard Maths test;(3)the ability parameters of two compared models correlated highly across all groups of respondents,however,the correlations for the top 1000,500,300,200,100 groups were relatively small,especially for the top 100 respondents.In conclusion,the 4PM is necessary in both psychological tests and achievement tests.For practitioners who should make a decision about whether to choose the 4PM,the type of the tests,the purpose of the tests,and the complexity of the computation should be considered at the same time.

参考文献/References

 简小珠.(2006).Logistic模型c,γ参数对被试作答的拟合能力.硕士论文.南昌:江西师范大学.
简小珠,戴海崎,彭春妹.(2007).IRT 中 Logistic模型的c,γ参数对能力估计的改善.心理学报,39(4),737-746.
简小珠,焦璨,彭春妹.(2010).四参数模型对被试作答异常现象的拟合与纠正.心理科学进展,(3),537-544.
简小珠,张敏强,彭春妹.(2010).四参数 Logistic 模型研究进展及其评析.心理学探新,30(3),69-73.
Barton,M.A.,& Lord,F.M.(1981).Anupperasymptoteforthethree-parameter logistic item-response model.ETS Research Report Series,(1),i-8.
Carlson,S.(2000).ETS finds flaws in the way online GRE rates some students.Chronicle of Higher Education,47(8),A47.
Linacre,J.M.(2009).A user's guide to WINSTEPS MINISTEP Rasch-model computer programs.Retrieved from:http://199.236.93.8/winman/index.htm?asymptote.htm
Loken,E.,& Rulison,K.L.(2010).Estimation of a four-parameter item response theory model.British Journal of Mathematical and Statistical Psychology,63(3),509-525.
McDonald,R.P.(1967).Non-linear factor analysis.Psychometric Monographs,15.
Reise,S.P.,& Waller,N.G.(2003).How many IRT parameters does it take to model psychopathology items?Psychological Methods,8(2),164.
Rulison,K.L.,& Loken,E.(2009).I've Fallen and I Can't Get Up:Can High-Ability Students Recover From Early Mistakes in CAT?Applied Psychological Measurement,33(2),83-101.
Waller,N.G.,& Reise,S.P.(2010).Measuring psychopathology with non-standard IRT models:Fitting the four parameter model to the MMPI.In S.Embretson & J.S.Roberts(Eds.),Newdirections in psychological measurement with model-based approaches.Washington,DC:American Psychological Association.
Wright,B.D.(1977).Solving measurement problems with the Rasch model.Journal of Educational Measurement,97-116.

备注/Memo

备注/Memo:
 -
更新日期/Last Update:  2018-05-30