Assessing language proficiency: A rasch analysis of ELT assessment tools

Authors

DOI:

https://doi.org/10.29329/pedper.2025.103

Keywords:

Language Proficiency, Rasch Analysis, ELT Assessment Tools

Abstract

This paper now addresses the important role of Rasch analysis in ensuring high-quality assessments in the English Language Teaching (ELT) context. Assessment is at the core of ELT; it verifies student learning and lends an ear to teaching methodologies. Current assessment methodologies often fail to measure learners' competencies accurately. Rasch analysis, on the other hand, provides a process to assess test items so that they are as fair and equal as possible in a given assessment. Rasch analysis converts raw scores into interval-level measurements, allowing for precise comparisons between learners. The tool is an important driver in identifying central items, such as item bias and DIF (Differential Item Functioning), that must be addressed for fair assessments. In this review, we present the strengths of Rasch Analysis in a wide range of ELT contexts, including reading, listening, speaking, and writing assessments. Although Rasch analysis has strengths, it is not without limitations, such as requiring large sample sizes and understanding the results' meaning. Even so, it helps in knowing more about how assessment quality can be improved. It concludes by summarizing the impact of Rasch on assessment, providing a fair and valid method for measuring from both learners' and educators' perspectives in various types of educational settings. Further studies should be conducted to demonstrate its utility in smaller-scale designs and to investigate its effects in conjunction with other assessment methodologies.

References

Alavi, S. M., Ali, R. A., & Amirian, S. M. R. (2011). Academic discipline DIF in an English language proficiency test. Journal of English Language Teaching and Learning, 7, 39–65.

Aryadoust, V. (2017). Adapting levels 1 and 2 of Kirkpatrick's model of training evaluation to examine the effectiveness of a tertiary-level writing course. Innovation in Language Learning and Teaching, 11(2), 107–120. https://doi.org/10.1080/1554480X.2016.1242426

Baghaei, P., & Robitzsch, A. (2025). A tutorial on item response modelling with multiple groups using TAM. Educational Methods and Practice. (Yayına Hazırlık Aşamasında)

Bakri, H. (2022). Evaluating and testing English language skills: Benchmarking the TOEFL and IELTS tests. International Journal of English Linguistics, 12(3), 99–110. https://doi.org/10.5539/ijel.v12n3p99

Bakri, H. (2023). Evaluating and testing English language skills: Benchmarking the TOEFL and IELTS tests. Bulletin of Faculty of Languages and Translation. https://doi.org/10.21608/bflt.2023.319543

Bijani, H., & Khabiri, M. (2017). The impact of raters and test-takers' gender on oral proficiency assessment: A case of multifaceted Rasch analysis. Journal of Teaching Language Skills, 36(1), 115–138. https://doi.org/10.22099/jtls.2017.25897.2290

Bijani, H. (2019). Evaluating the effectiveness of the training program on direct and semi-direct oral proficiency assessment: A case of multifaceted Rasch analysis. Cogent Education, 6(1), 1670592. https://doi.org/10.1080/2331186x.2019.1670592

Bond, T. G., & Fox, C. M. (2015). Applying the Rasch model: Fundamental measurement in the human sciences (3rd ed.). Routledge.

Chen, P., & Fu, J. (2018). Examining measurement properties of the revised Preschool Language Assessment for use with Mandarin-speaking children. Language Assessment Quarterly, 15(2), 152–167. https://doi.org/10.1080/15434303.2018.1529176

Choi, H., & Lee, H. (2021). Effects of examinees' perceptions of interlocutor proficiency on paired oral assessment results. The Journal of Asia TEFL, 18(1), 23–40. https://doi.org/10.18823/ASIATEFL.2021.18.1.2.23

Farlie, M., Johnson, C., Wilkinson, T. J., & Keating, J. (2021). Refining assessment: Rasch analysis in health professional education and research. Focus on Health Professional Education, 22(2), 16–28. https://doi.org/10.11157/fohpe.v22i2.569

Hamp-Lyons, L. (1989). Applying the partial credit method of Rasch analysis: Language testing and accountability. Language Testing, 6(1), 1–9. https://doi.org/10.1177/026553228900600109

Inoue, C. (2016). A comparative study of the variables used to measure syntactic complexity and accuracy in task-based research. Language Awareness, 25(1–2), 4–20. https://doi.org/10.1080/09571736.2015.1130079

Khalaf, M. A., & Omara, E. M. N. (2022). Rasch analysis and differential item functioning of English Language Anxiety Scale (ELAS) across sex in the Egyptian context. BMC Psychology, 10(1), Article 1. https://doi.org/10.1186/s40359-022-00955-w

Kianinezhad, N. (2024). Validation of the adapted attitudes toward online teaching scale for Iranian EFL teachers: A Rasch model analysis. Education Mind, 3(1), 31–45.

Kianinezhad, N., & Kianinezhad, M. (2025). Comparative evaluation of C-test reliability using classical and modern psychometric methods. Language Education & Assessment, 8(1), 2279–2295.

Kopparapu, S., & Panda, A. (2024). Unified spoken language proficiency assessment system. In Proceedings of Oriental COCOSDA International Conference on Speech Database and Assessments. https://doi.org/10.1109/O-COCOSDA64382.2024.10800105

Lee-Ellis, S. (2009). The development and validation of a Korean C-test using Rasch analysis. Language Testing, 26(2), 245–274. https://doi.org/10.1177/0265532208101007

Leeming, P., & Harris, J. (2022). Measuring speaking proficiency growth in the language classroom: An investigation of practical approaches for teachers. Language Teaching Research. https://doi.org/10.1177/13621688221130856

McNamara, T. (1996). Measuring second language performance. Longman.

Medvedev, O., Sheppard, C. L., Monetta, L., & Taler, V. (2019). The BNT-38: Applying Rasch analysis to adapt the Boston Naming Test for use with English and French monolinguals and bilinguals. Journal of Speech, Language, and Hearing Research, 62(10), 3690–3701. https://doi.org/10.1044/2018_JSLHR-L-18-0084

O'Brien, M. L. (1989). Psychometric issues relevant to selecting items and assembling parallel forms of language proficiency instruments. Educational and Psychological Measurement, 49(2), 321–334. https://doi.org/10.1177/0013164489492007

Rasch, G. (1960). Probabilistic models for some intelligence and attainment tests. Danish Institute for Educational Research.

Shirazi, M. A. (2019). For a greater good: Bias analysis in writing assessment. SAGE Open, 9(1), 1–13. https://doi.org/10.1177/2158244018822377

Sundqvist, P., Sandlund, E., Skar, G. B., & Tengberg, M. (2020). Effects of rater training on the assessment of L2 English oral proficiency. Nordic Journal of Modern Language Methodology, 8(1), 1–23. https://doi.org/10.46364/njmlm.v8i1.605

Turkan, S., & Liu, O. L. (2012). Differential performance by English language learners on an inquiry-based science assessment. International Journal of Science Education, 34(13), 1967–1987. https://doi.org/10.1080/09500693.2012.705046

Wang, P., Coetzee, K. L., Strachan, A., Monteiro, S., & Cheng, L. (2020). Examining rater performance on the CELBAN speaking: A many-facets Rasch measurement analysis. Canadian Journal of Applied Linguistics, 23(2), 72–95. https://doi.org/10.37213/cjal.2020.30436

Wright, B. D., & Masters, G. N. (1982). Rating scale analysis: Rasch measurement. MESA Press.

Wang, P., Coetzee, K. L., Strachan, A., Monteiro, S., & Cheng, L. (2020). Examining rater performance on the CELBAN speaking: A many-facets Rasch measurement analysis. Canadian Journal of Applied Linguistics, 23(2), 72–95. https://doi.org/10.37213/cjal.2020.30436

Wright, B. D., & Masters, G. N. (1982). Rating scale analysis: Rasch measurement. MESA Press.

Downloads

Published

23-10-2025

How to Cite

Kianinezhad, N. (2025). Assessing language proficiency: A rasch analysis of ELT assessment tools. Pedagogical Perspective, 4(2), 499–508. https://doi.org/10.29329/pedper.2025.103

Issue

Section

Review Articles