我们的网站为什么显示成这样?

可能因为您的浏览器不支持样式,您可以更新您的浏览器到最新版本,以获取对此功能的支持,访问下面的网站,获取关于浏览器的信息:

|本期目录/Table of Contents|

 无锚题非等组设计下的等值实现(PDF)

《心理学探新》[ISSN:1003-5184/CN:36-1228/B]

期数:
 2025年01期
页码:
 72-77,86
栏目:
 心理统计与测量
出版日期:
 2025-02-20

文章信息/Info

Title:
 Equating Under the Design of Non-equivalent Groups without Anchor Items
文章编号:
1003-5184(2025)01-0072-06
作者:
 董圣鸿1秦春影12游晓锋2喻晓锋1
 (1.江西师范大学心理学院,南昌 330022; 2.南昌师范学院数学与信息科学学院,南昌 330032)
Author(s):
 Dong Shenghong1Qin Chunying12You Xiaofeng2Yu Xiaofeng1
 (1.School of Psychology,Jiangxi Normal University,Nanchang 330022; 2.School of Mathematics and Information Science,Nanchang Normal University,Nanchang 330032)
关键词:
 等值 锚题 拟等组 认知结构
Keywords:
 equating anchor items non-equivalent group construct
分类号:
 B841.2
DOI:
 -
文献标识码:
 A
摘要:
 很多测验在设计的时候没有为针对等值作专门的考虑,或者是由于测验自身的高风险特性,导致测验天然地不符合常规的等值设计,但是仍然需要对不同测验版本的测验分数或考生的能力进行比较,即存在等值的必要性。本文聚焦无锚设计(即不存在锚题,也不存在等组)下的等值问题,对已有典型研究中涉及到的方法和技术进行评价,包括构造拟等组的方法,基于共同认知结构的方法等,目的是厘清不同方法的使用条件和优缺点,并对未来的研究方向进行了展望。
Abstract:
 In many testing contexts,the design of assessments often lacks intentional consideration for equating purposes,or the high-stakes nature of assessments inherently precludes conventional equating designs(e.g.,those requiring anchor items or equivalent groups).Nevertheless,there remains a compelling need to compare scores across different test forms or evaluate examinee abilities under such constraints,thereby necessitating equating solutions.This study focuses on the equating problem under anchorless designs(i.e.,scenarios devoid of anchor items and equivalent groups).We critically evaluate existing methodologies and techniques in the literature,clarify the applicability and limitations of different approaches,and propose future research directions.

参考文献/References

 戴步云,罗照盛.(2012).题目难度分布和样本容量对两种CTT等值结果的影响.心理学探新,32(3),246-251.
李雪莹.(2016).基于锚属性非等组设计的认知诊断等值方法研究:属性特征曲线等值法(硕士学位论文).江西师范大学.
刘玥,刘红云.(2015a).多维数据IRT真分数等值和IRT观察分数等值研究.心理学探新,35(1),56-61.
刘玥,刘红云.(2015b).无铆题情况下测验分数等值方法探索:构造铆测验法.心理科学,38(6),1504-1512.
彭亚风,罗照盛,李喻骏,高椿雷.(2018).不同认知结构被试的测验设计模式.心理学报,50(1),130-140.
童昊,喻晓锋,秦春影,彭亚风,钟小缘.(2022).多级计分测验中基于残差统计量的被试拟合研究.心理学报,54(9),1122-1136.
王少杰,张敏强,黄菲菲,刘颖.(2024).参数估计误差对多级评分题型测验等值的影响.心理学探新,44(6),550-558,565.
王一波,杨涛,辛涛.(2017).无锚题测验等值设计方法研究进展.考试研究,(3),48-54.
颜玉枝.(2022).排序理论在认知诊断中辅助标定属性层级关系:基于属性关联强度矩阵(硕士学位论文).江西师范大学.
杨钰萍.(2019).共同总体假设下基于虚拟人的测验等值研究(硕士学位论文).江西财经大学.
杨志明.(2015).一年多考背景下分数等值的意义和方法.教育测量与评价(理论版),(12),58-61.
朱殷睿.(2023).事后引入多连接组的多群组无锚题测验等值研究(硕士学位论文).浙江师范大学.
Albano,A.D.,& Wiberg,M.(2019).Linking With External Covariates:Examining Accuracy by Anchor Type,Test Length,Ability Difference,and Sample Size.Applied Psychological Measurement,43(8),597-610.
Babcock,B.,& Hodge,K.J.(2019).Raschversus classical equating in the context of small sample sizes.Educational and Psychological Measurement,80(3),499-521.
de la Torre,J.(2008).An empirically based method of Q-matrix validation for the DINA model:Development and applications.Journal of Educational Measurement,45(4),343-362.
de la Torre,J.,& Chiu,C.-Y.(2016).A general method of empirical Q-matrix validation.Psychometrika,81(2),253-273.
Filonczuk,A.,& Cheng,Y.(2025).Robust estimation of the latent trait in graded response models.Behavior Research Methods,57(1),55.
Haberman,S.J.(1984).Adjustment by minimum discriminant information.The Annals of Statistics,12(3),971-988.
Haberman,S.J.(2015).Pseudo-equivalent groups and linking.Journal of Educational and Behavioral Statistics,40(3),254-273.
He,Y.,& Cui,Z.(2019).Evaluating robust scale transformation methods with multiple outlying common items under IRT true score equating.Applied Psychological Measurement,44(4),296-310.
Hong,M.R.,& Cheng,Y.(2019).Robust maximum marginal likelihood(RMML)estimation for item response theory models.Behavior Research Methods,51(2),573-588.
Jiang,Z.,Han,Y.,Xu,L.,Shi,D.,Liu,R.,Ouyang,J.,& Cai,F.(2023).The NEAT equating via chaining random forests in the context of small sample sizes:A machine-learning method.Educational Psychological Measurement,83(5),984-1006.
Kim,S.,& Lu,R.(2018).The pseudo-equivalent groups approach as an alternative to common-item equating.ETS Research Report Series,2018(RR-18-02),1-13.
Kullback,S.,& Leibler,R.(1951).On information and sufficiency.Annals of Mathematical Statistics,22,79-86.
Kolen,M.J.,& Brennan,R.L.(2014).Test equating,scaling,and linking:Methods and practices(3rd ed.).New York,NY:Springer Press.
Leighton,J.P.,Gierl,M.J.,& Hunka,S.M.(2004).The attribute hierarchy method for cognitive assessment:A variation on Tatsuoka's rule-space approach.Journal of Educational Measurement,41(3),205-237.
Leôncio,W.,Wiberg,M.,& Battauz,M.(2022).Evaluating equating transformations in IRT observed-score and kernel equating methods.Applied Psychological Measurement,47(2),123-140.
Lord,F.M.(1980).Applications of item response theory to practical testing problems.Hillsdale NJ:Erlbaum.
Lord,F.M.,& Wingersky,M.S.(1984).Comparison of IRT true-score and equipercentile observed-score “equating”.Applied Psychological Measurement,8(4),452-461.
Longford,N.T.(2015).Equating without an anchor for nonequivalent groups of examinees.Journal of Educational and Behavioral Statistics,40(3),227-253.
Lu,R.,& Guo,H.(2018).A simulation study to compare nonequivalent groups with anchor test equating and pseudo-equivalent group linking.ETS Research Report Series,2018(RR-18-08),1-16.
Qin,C.Y.,Dong,S.H.,& Yu,X.F.(2024).Exploration of polytomous-attribute Q-matrix validation in cognitive diagnostic assessment.Knowledge-Based Systems,292,111577.
Tatsuoka,K.K.(1983).Rule space:An approach for dealing with misconceptions based on item response theory.Journal of Educational Measurement,20(4),345-354.
Tatsuoka,K.K.(2009).Cognitive assessment:An introduction to the rule space method.New York,NY:Routledge.
vonDavier,A.A.,Holland,P.W.,& Thayer,D.T.(2004).The kernel method of test equating.New York,NY:Springer.
van der Linden,W.J.(2000).A test-theoretic approach to observed-score equating.Psychometrika,65(4),437-456.
van der Linden,W.J.(2006b).Equating error in observed-score equating.Applied Psychological Measurement,30(5),355-378.
van der Linden,W.J.(2010).Local observed-score equating.In A.A.vonDavier(Ed.),Statistical models for equating,scaling and linking(pp.201-223).New York,NY:Springer.
van der Linden,W.J.,& Wiberg,M.(2010).Local observed-score equating with anchor-test designs.Applied Psychological Measurement,34(8),620-640.
Xin,T.,& Zhang,J.H.(2015).Local equating of cognitively diagnostic modeled observed scores.Applied Psychological Measurement,39(1),44-61.
Yan,Y.Z.,Dong,S.H.,& Yu,X.F.(2025).Using ordering theory to learn attribute hierarchies from examinees' attribute profiles.Journal of Educational and Behavioral Statistics.Advanced Online.Doi.10.3102/10769986241280389.

备注/Memo

备注/Memo:
 基金项目:教育部教育考试院‘十四五'规划支撑专项课题“高考实施过程中的科目跨年分数的转换研究”(NEEA2021050)。
通信作者:董圣鸿,E-mail:jxnudsh@163.com; 秦春影,E-mail:cqin@jxnu.edu.cn。
更新日期/Last Update:  2025-02-20