我们的网站为什么显示成这样?

可能因为您的浏览器不支持样式,您可以更新您的浏览器到最新版本,以获取对此功能的支持,访问下面的网站,获取关于浏览器的信息:

|本期目录/Table of Contents|

 不同基准年级下锚题难度范围与年级离散程度对垂直量尺化的影响(PDF)

《心理学探新》[ISSN:1003-5184/CN:36-1228/B]

期数:
 2023年01期
页码:
 68-76
栏目:
 心理统计与测量
出版日期:
 2023-03-20

文章信息/Info

Title:
 The Influence of Difficulty Range of Anchor Items and Separation of Grade Distributions on Vertical Scaling Under Different Base Grades
文章编号:
1003-5184(2023)01-0068-09
作者:
 黎光明 张晓婷
 (华南师范大学心理学院,心理应用研究中心,广州 510631)
Author(s):
 Li Guangming Zhang Xiaoting
 (School of Psychology,Center for Studies of Psychological Application,South China Normal University,Guangzhou 510631)
关键词:
 垂直量尺化 基准年级 锚题难度范围 年级离散程度 测验等值
Keywords:
 vertical scaling base grade difficulty range of anchor items separation of grade distributions test equating
分类号:
 B841.2
DOI:
 -
文献标识码:
 A
摘要:
 使用3PLM和蒙特卡洛法生成数据,基于非等组锚题设计,考察不同基准年级下锚题难度范围与年级离散程度对垂直量尺化的影响。结果发现:(1)基准年级的选择会影响垂直量尺化的精度。(2)锚题设计下垂直量尺化的转换不宜超过两个年级。(3)不同基准年级下,年级离散程度越小,估计精度越好。(4)不同基准年级下,对锚题难度范围的选择应有所不同。(5)年级离散程度与锚题难度范围之间存在交互效应。
Abstract:
 In this study,we used 3PLM under common-item design,and set grade 1 and grade 2 as the base grade respectively.Setting the item parameters of 100 items of four grades and the ability parameters of 1000 subjects in each grade in different combinations of difficulty range of anchor items and the separation of grade distributions,and then simulated the response matrix by Monte Carlo method using BILOG-MG for concurrent calibration.Bias and RMSE values are calculated as accuracy criteria.This study shows that:(1)the choice of the base group affects the results of the vertical scaling.(2)The conversion of vertical scaling under anchor question design should not exceed two grades.(3)Under different base grades,the smaller the separation of grade distribution is,the better the estimation accuracy is.(4)When the base grade is the lower grade,the difficulty range of the anchor items should be wide.When the base grade is middle grade,the difficulty range of the anchor items should be medium to get a better accuracy.(5)There is an interaction between the separation of grade distributions and the difficulty range of the anchor items.

参考文献/References

 蔡艳,丁树良,涂冬波.(2009).锚题比例对等值精度的影响.心理学探新,29(2),56-59.
陈丽.(2014).垂直量尺化对大学英语分级教学测评体系弊端的解析.西安外国语大学学报,22(2),76-78.
戴海崎,张锋.(2018).心理与教育测量(第四版).广州:暨南大学出版社.
郭小军.(2014). 不同参照基准与年级离散程度对垂直等值的影响研究(硕士毕业论文).江西师范大学,南昌.
罗照盛.(2012).项目反应理论基础. 北京师范大学出版社.
梁正妍.(2017).年级离散程度与锚题比例对垂直量尺化精度的影响(硕士学位论文).华南师范大学,广州.
漆书青,戴海崎.(1992).项目反应理论及其应用研究.南昌:江西高校出版社.
王烨晖,边玉芳.(2010).构建学业发展性量表-垂直等值的应用.中国考试,22(10),7-12.
熊建华,叶新蓉,丁树良,罗芬.(2010).等值设计中锚题比例研究.Proceedings of 2010 Third International Conference on Education Technology and Training(Volume 7).
叶萌,辛涛.(2015).测验链接中的锚题代表性研究.心理科学,38(1),209-215.
叶昶成.(2015).不同垂直等化设计下可能值方法估计效果值探讨(硕士学位论文).台中教育大学.
Briggs,D.C.,& Dadey,N.(2015).Making sense of common test items that do not get easier over time:Implications for vertical scale designs.Educational Assessment,20(1),1-22.
Briggs,D.C.,& Peck,F.A.(2015).Using learning progressions to design vertical scales that support coherent inferences about student growth.Measurement Interdisciplinary Research & Perspectives,13(2),75-99.
Carlson,J.E.(2017).Unidimensional vertical scaling in multidimensional space.Ets Research Report,(4).
Chin,T.Y.,Kim,W.,& Nering,M.L.(2006).Five statistical factors that influence IRT vertical scaling.Paper presented at the annual meeting of the National Council on Measurement in Education,San Francisco,CA.
Kolen,M.J.,& Brennan,R.L.(2013).Test equating scaling and linking——method and practices(3rd ed.).Springer-Verlag New York Inc.
Lao,H.(2015).Some thoughts on using learning progressions to design vertical scales that support coherent inferences about student growth.Measurement Interdisciplinary Research & Perspectives,13(3),195-199.
Li,Y.(2011).Exploring the full-information bifactor model in vertical scaling with construct shift(Unpublished doctoral dissertation).University of Maryland,College Park,MD.
Li,Y.,& Lissitz,R.W.(2012).Exploring the full-information bifactor model in vertical scaling with construct shift.Applied Psychological Measurement,36,3-20.
Liu,J.S.,Sinharay,S.,Holland,P.,Feigenbaum,M.,& Curley,E.(2011).Observed score equating using a mini-version anchor and an anchor with less spread of difficulty:A comparison study.Educational and Psychological Measurement,71(2),346-361.
Martineau,J.A.(2004).The effects of construct shift on growth and accountability models.ProQuest Information & Learning.Michigan State University,East Lansing,MI.
Martineau,J.A.(2006).Distorting value added:The use of longitudinal,vertically scaled student achievement data for growth-based,value-added accountability.Journal of Educational and Behavioral Statistics,31(1),35-62.
Petersen,N.S.,Kolen,M.J.,& Hoover,H.D.(1989).Scaling,norming,and equating.In R.L.Linn(Ed.),Educational measurement(3rd ed.,pp.221-262).Washington,DC.
Reckase,M.D.,& Martineau,J.(2004).The vertical scaling of science achievement tests.Paper commissioned by the committee on test design for K-12 Science achievement,center for education,national research council.
Sari,A.A.,& Keleciogˇlu,H.(2016).Assessment of achievement and growth by vertical scaling:Comparison of vertical scaling methods.Journal of Educational Sciences Research,6(2),25-38.
Sinharay,S.,& Holland,P.W.(2006).The correlation between the scores of a test and an anchor test(ETS RR-06-04).Princeton,NJ:Educational Testing Service.
Sinharay,S.,& Holland,P.W.(2007).Is it necessary to make anchor tests mini-versions of the tests being equated or can some restrictions be relaxed?Journal of Educational Measurement,44(3),249-275.
Wells,C.S.,Subkoviak,M.J.,& Serlin,R.C.(2002).The effect of item parameter drift on examinee ability estimates.Applied Psychological Measurement,26(1),77-87.
Ye,M.,& Xin,T.(2014).Effects of item parameter drift on vertical scaling with the nonequivalent groups with anchor test(neat)design.Educational & Psychological Measurement,74(2),227-235.
Yen,W.M.(1986).The choice of scale for educational measurement:An IRT perspective.Journal of Educational Measurement,23(4),299-325.
Yen,W.M.,Lall,V.F.,& Lora,M.(2012).Evaluating academic progress without a vertical scale.Ets Research Report,(1),1-55.
Yildirim,H.H.(2014).Findings from an empirical vertical scaling study with BILOG-MG.Education & Science.

备注/Memo

备注/Memo:
 基金项目:广东省自然科学基金面上项目(2021A1515012516),广东省普通高校特色创新类项目(哲学社会科学)(粤教科函[2021]7号,2021WTSCX020)。
通讯作者:黎光明,E-mail:Lgm2004100@sina.com。
更新日期/Last Update:  2023-03-20