การพัฒนาแบบสอบคู่ขนานวิชาคณิตศาสตร์สำหรับนักเรียนชั้นประถมศึกษาปีที่ 2                      โรงเรียนสังกัดกรุงเทพมหานคร โดยการประยุกต์ใช้ทฤษฎีการตอบสนองข้อสอบแบบ 2 พารามิเตอร์

สาวินา   บุญเสน; จุฑาภรณ์   มาสันเทียะ; พนิดา  พานิชวัฒนะ

PDF

เผยแพร่แล้ว: มิ.ย. 28, 2023

คำสำคัญ:

ทฤษฎีการตอบสนองข้อสอบ แบบสอบคู่ขนาน ฟังก์ชันสารสนเทศของข้อสอบ ฟังก์ชันสารสนเทศของแบบสอบ

สาวินา บุญเสน

คณะศึกษาศาสตร์ มหาวิทยาลัยรามคำแหง

จุฑาภรณ์ มาสันเทียะ

คณะศึกษาศาสตร์ มหาวิทยาลัยรามคำแหง

พนิดา พานิชวัฒนะ

คณะศึกษาศาสตร์ มหาวิทยาลัยรามคำแหง

บทคัดย่อ

การวิจัยนี้มีความมุ่งหมายเพื่อ 1) พัฒนาแบบสอบคู่ขนานวิชาคณิตศาสตร์ระดับชั้นประถมศึกษาปีที่ 2 สาระการวัด และ2) ตรวจสอบความเป็นคู่ขนานของแบบสอบคู่ขนานวิชาคณิตศาสตร์ระดับชั้นประถมศึกษาปีที่ 2 สาระการวัด กลุ่มตัวอย่างเป็นนักเรียนชั้นประถมศึกษาปีที่ 2 ปีการศึกษา 2562 จำนวน 800 คน จากโรงเรียนสังกัดกรุงเทพมหานคร ในเขตหนองจอก จำนวน 13 โรงเรียน โดยการสุ่มแบบแบ่งชั้น (stratified random sampling) เครื่องมือที่ใช้ คือ แบบสอบแบบปรนัยเรื่องความยาว (1A, 1B) และเรื่องน้ำหนัก (2A, 2B) ซึ่งเป็นข้อสอบแบบเลือกตอบฉบับละ 20 ข้อ การวิเคราะห์ข้อมูล ได้แก่ ความตรงตามเนื้อหา ความเที่ยง ค่าฟังก์ชันสารสนเทศของแบบสอบ ค่าความยาก ค่าอำนาจจำแนก ค่าฟังก์ชันสารสนเทศของข้อสอบ และการตรวจสอบความเป็นคู่ขนานของข้อสอบและแบบสอบโดยใช้ Root mean square deviation of item information (RMSD_IF) และ Root mean square deviation of test information (RMSD_TIF)ตามลำดับ
ผลการวิจัยพบว่า
1. แบบสอบคู่ขนานวิชาคณิตศาสตร์ทั้ง 4 ฉบับ มีค่า IOC ซึ่งเป็นดัชนีความตรงตามเนื้อหาของแบบสอบทุกฉบับ ตั้งแต่ 0.67-1.00 ความเที่ยงมีค่า 0.711, 0.652, 0.718 และ 0.635 ตามลำดับ ค่าฟังก์ชันสารสนเทศของแบบสอบ 1A และ 1B ให้สารสนเทศสูงสุดสำหรับผู้ที่มีระดับความสามารถระหว่าง -0.60 ถึง -0.80 และแบบสอบฉบับ 2A และ 2B ให้สารสนเทศสูงสุดสำหรับผู้ที่มีระดับความสามารถระหว่าง -0.40 ถึง -0.60 ขณะที่ค่าความยาก (b) อยู่ระหว่าง -2.98 ถึง +0.57 ค่าอำนาจจำแนก (a) อยู่ระหว่าง 0.22 ถึง 1.13 และค่าฟังก์ชันสารสนเทศของข้อสอบทั้ง 4 ฉบับมีค่าอยู่ระหว่าง 0.013 ถึง 0.320
2. ผลการวิเคราะห์ความเป็นคู่ขนานรายข้อและทั้งฉบับของแบบสอบคู่ขนาน เรื่อง ความยาว (1A, 1B) มีค่า RMSD_IF ตั้งแต่ 0.008 ถึง 0.167 และค่า RMSD_TIF เท่ากับ 0.484 ขณะที่แบบสอบคู่ขนาน เรื่อง น้ำหนัก (2A, 2B) มีค่า RMSD_IF ตั้งแต่ 0.020 – 0.110 และค่า RMSD_TIF เท่ากับ 0.492

ฉบับ

ปีที่ 29 ฉบับที่ 1: มกราคม - มิถุนายน 2566

บท

บทความวิจัย

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

เนื้อหาและข้อมูลในบทความที่ลงตีพิมพ์ในวารสารการวัดผลการศึกษา มหาวิทยาลัยมหาสารคาม ถือเป็นข้อคิดเห็นและความรับผิดชอบของผู้เขียนบทความโดยตรง ซึ่งกองบรรณาธิการวารสาร ไม่จำเป็นต้องเห็นด้วย หรือร่วมรับผิดชอบใดๆ

บทความ ข้อมูล เนื้อหา รูปภาพ ฯลฯ ที่ได้รับการตีพิมพ์ในวารสารการวัดผลการศึกษา มหาวิทยาลัยมหาสารคาม ถือเป็นลิขสิทธิ์ของวารสารการวัดผลการศึกษา มหาวิทยาลัยมหาสารคาม หากบุคคลหรือหน่วยงานใดต้องการนำทั้งหมดหรือส่วนใดส่วนหนึ่งไปเผยแพร่ต่อหรือกระทำการใดๆ จะต้องได้รับอนุญาตเป็นลายลักษณ์อักษรจากวารสารการวัดผลการศึกษา มหาวิทยาลัยมหาสารคาม ก่อนเท่านั้น

References

Ali, U. S., & Rijn, P. W. (2016). An evaluation of different statistical target for assembling parallel form in item response theory. Applied psychological measurement, 40(3), 163-179.

Debeer, D., Ali, U. S., & Rijn, P. W. (2017). Evaluating Statistical Targets for Assembling Parallel Mixed-Format Test Forms. Journal of Educational Measurement, 54(2), 218-242.

Dorans, N. J., Pommerich, M., & Holland, P. W. (2007). Linking and aligning scores and scales. Springer-Verlag.

Foley, B. F. (2010). Improving IRT item parameter estimates with small sample sizes: Evaluating the efficacy of a new data augmentation technique [Doctoral dissertation]. University of Nebraska.

Lin, C. J. (2008). Comparisons between classical theory and item response theory in automated assembly of parallel test forms. The journal of technology, learning and assessment.

Lord, F. M. (1980). Applications of item response theory to practical testing problems. Erlbaum.

Lord, F. M., & Novick, M. R. (1968). Statistical theories of mental test scores. Addison-Wesley.

Reckase, M. (2009). Multidimensional item response theory. Springer.

Rupp, A., & Templin, J. (2008). Effects of Q-matrix misspecification on parameter estimates and misclassification rates in the DINA model. Educational and Psychological Measurement.

Education Strategy Office Education Bureau. (2016). Bangkok Basic Education Development Plan issue 2 (2017-2020). Agricultural Co-operative Federation of Thailand Press. (in Thai)

Kanchanapenkul, S. (2011). A Construction of Parallel Domainre-referenced test on “Circles” by Using Facet Design for Matthayousuksa 3 [Master’s thesis]. Srinakharinwirot University. (in Thai)

Kantivong, A. (2017). Comparison of Item and Test Parallel Index Between Test Forms that Selected by Different Characteristics of Experts: Application of Formal Cultural Consensus Theory [Master’s thesis]. Chulalongkorn University. (in Thai)

Karnjanawasee, S. (2012). New Exam Theory (4th Ed.). Chulalongkorn University Press. (in Thai)

Karnjanawasee, S. (2013). Classical Test Theory (7th Ed.). Chulalongkorn University Press. (in Thai)

Katsa, T., & Amonrattanasak, S. (2008). Educational evaluation. Ramkhamhaeng University. (in Thai)

Pasiphol, S. (2016). Learning Measurement and Evaluation. Chulalongkorn University Press. (in Thai)

Pasiphol, S. (2020). The Development of Parallel Tests: Creation and Parallelism analysis. Chulalongkorn University Press. (in Thai)

Sawangboon, N. (2011). A Construction of Mathematics Parallel Test Using Facet Design on “Factoring of Degree-2 Polynomials” for Matthayomsuksa 2 Students [Master’s thesis]. Mahasaraklham University. (in Thai)

Sawangsri, P. (2015). Comparison of Psychometric Properties Integrating of Reading Analytical Thinking and Writing Abilties and Indicayors of Content Science of Nine Grade Using The Different Item Review Methods [Master’s thesis]. Chulalongkorn University. (in Thai)

Article Sidebar

Main Article Content

บทคัดย่อ

Article Details

References