Development of the data literacy scale in social sciences: A validity and reliability study
The present study will provide important information for future educational strategies and intervention programs by revealing the current status of undergraduate students in data literacy. The current research is a scale development study. The study group consisted of undergraduate students from 20 universities in Türkiye. Data validity and construct analysis were performed using exploratory and confirmatory factor analyses. Whereas the Kaiser–Meyer–Olkin (KMO) value of 0.967 indicated that the sample was perfect, Bartlett’s test confirmed that the correlations between the items were adequate. Cronbach’s alpha value of 0.973 indicated a very high internal consistency. Furthermore, high reliability was provided with the inter-form correlation of 0.853, the Spearman-Brown coefficient of 0.920, and the Guttman split-half coefficient of 0.919. The exploratory factor analysis revealed that the scale consisted of three sub-dimensions and explained 64.056% of the total variance. The items showed factor loadings above 0.40. The CFA results confirmed that the model represented the three sub-dimensions of data literacy well, and the RMSEA, CFI, IFI, and RFI fit indices were high. Compared with the available scales in the literature, this study makes a significant contribution by presenting a customized, comprehensive measurement tool in the context of the social sciences.
Ackoff, R. L. (1989). From data to wisdom. Journal of Applied Systems Analysis, 16, 3–9. https://faculty.ung.edu/kmelton/documents/datawisdom.pdf
Amidon, D. M. (1997). Innovation strategy for the knowledge economy: The ken awakening. Butterworth-Heinemann. https://doi.org/10.4324/9780080508795
Braeken, J., & van Assen, M. A. L. M. (2017). An empirical Kaiser criterion. Psychological Methods, 22(3), 450–466. https://doi.org/10.1037/met0000074
Brown, T. A. (2015). Confirmatory factor analysis for applied research (2nd ed.). Guilford Press.
Çalışkan, H. (2012). Development of the measurement and evaluation self-efficacy perception scale and the examination of the status of social studies teachers. Energy Education Science and Technology Part B: Social and Educational Studies, 4(1), 1003–1008.
Calzada Prado, J., & Marzal, M. Á. (2013). Incorporating data literacy into information literacy programs: Core competencies and contents. Libri, 63(2), 123–134. https://doi.org/10.1515/libri-2013-0010
Carlson, J., Fosmire, M., Miller, C. C., & Nelson, M. S. (2011). Determining data information literacy needs: A study of students and research faculty. Portal: Libraries and the Academy, 11(2), 629–657. https://doi.org/10.1353/pla.2011.0022
Chen, B., Chang, Y. H., Ouyang, F., & Zhou, W. (2018). Fostering student engagement in online discussion through social learning analytics. The Internet and Higher Education, 37, 21–30. https://doi.org/10.1016/j.iheduc.2017.12.002
Cohen, J., & Cohen, P. (1983). Applied multiple regression/correlation analysis for the behavioral sciences. L. Erlbaum.
D'Ignazio, C., & Bhargava, R. (2015, September). Approaches to building big data literacy. In Proceedings of the Bloomberg Data for Good Exchange Conference (Vol. 6).
Davenport, T. H., & Prusak, L. (2000). Working knowledge: How organizations manage what they know. Harvard Business School Press. https://doi.org/10.1145/348772.348775
Deahl, E. (2014). Better the data you know: Developing youth data literacy in schools and informal learning environments [Master's thesis, Massachusetts Institute of Technology]. https://doi.org/10.2139/ssrn.2445621
De Winter, J. C. F., Dodou, D., & Wieringa, P. A. (2009). Exploratory factor analysis with small sample sizes. Multivariate Behavioral Research, 44, 147–181. https://doi.org/10.1080/00273170902794206
Demirtaş, Ç. (2022). A model proposal for knowledge literacy in social studies education [Doctoral dissertation, Bolu Abant Izzet Baysal University]. YÖK National Thesis Center. https://tez.yok.gov.tr
Demirtaş, Ç. (2024). Data literacy and education: A science mapping study. Participatory Educational Research, 11(3), 220–243. https://doi.org/10.17275/per.24.43.11.3
DeVellis, R. F. (2012). Scale development: Theory and applications (3rd ed.). Sage.
Field, A. (2009). Discovering statistics using SPSS (3rd ed.). Sage.
Floyd, F. J., & Widaman, K. F. (1995). Factor analysis in the development and refinement of clinical assessment instruments. Psychological Assessment, 7(3), 286–299. https://doi.org/10.1037/1040-3590.7.3.286
Fontichiaro, K., & Oehrli, J. A. (2016). Why data literacy matters. Knowledge Quest, 44(5), 21–27.
Gebre, E. H. (2018). Young adults' understanding and use of data: Insights for fostering secondary school students' data literacy. Canadian Journal of Science, Mathematics and Technology Education, 18(4), 330–341. https://doi.org/10.1007/s42330-018-0034-z
Gencer, R., & Altun, A. (2021). Dijitalleşme, bilgi hiyerarşisini değiştirdi mi? (VEBB: veri, enformasyon, bilgi ve bilgelik) [Did digitalization change the knowledge hierarchy? (DIKW: data, information, knowledge and wisdom)]. Diyalektolog – Uluslararası Sosyal Bilimler Dergisi, (27). https://doi.org/10.29228/diyalektolog.52392
Goodwin, L. D. (1999). The role of factor analysis in the estimation of construct validity. Measurement in Physical Education and Exercise Science, 3(2), 85–100. https://doi.org/10.1207/s15327841mpee0302_2
Hair, J. F., Black, W. C., Babin, B. J., & Anderson, R. E. (2014). Multivariate data analysis (7th ed.). Prentice Hall.
Hoogland, I., Schildkamp, K., Van der Kleij, F., Heitink, M., Kippers, W., Veldkamp, B., & Dijkstra, A. M. (2016). Prerequisites for data-based decision making in the classroom: Research evidence and practical illustrations. Teaching and Teacher Education, 60, 377–386. https://doi.org/10.1016/j.tate.2016.07.012
Howard, M. C. (2016). A review of exploratory factor analysis decisions and overview of current practices: What we are doing and how can we improve? International Journal of Human–Computer Interaction, 32(1), 51–62. https://doi.org/10.1080/10447318.2015.1087664
Hu, L. T., & Bentler, P. M. (1998). Fit indices in covariance structure modeling: Sensitivity to underparameterized model misspecification. Psychological Methods, 3(4), 424–453. https://doi.org/10.1037/1082-989X.3.4.424
Hu, L. T., & Bentler, P. M. (1999). Cut-off criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling, 6, 1–55. https://doi.org/10.1080/10705519909540118
İnan, S. (2021). Siyaset okuryazarlığı "yöneten birey olmak ve okullarda siyaset eğitimi mümkün mü?" [Political literacy "Is it possible to be a governing individual and to give political education in schools?"]. Yeni İnsan Yayınevi.
Jeffery, K. (2014, April). Data is the new oil [Conference presentation]. Best Practices for Data Management & Sharing, Joint Research Centre (JRC), Ispra, Italy.
Jöreskog, K. G. (1971). Statistical analysis of sets of congeneric tests. Psychometrika, 36(2), 109–133. https://doi.org/10.1007/BF02291393
Karaman, M. (2023). Keşfedici ve doğrulayıcı faktör analizi: Kavramsal bir çalışma [Exploratory and confirmatory factor analysis: A conceptual study]. Uluslararası İktisadi ve İdari Bilimler Dergisi, 9(1), 47–63. https://doi.org/10.29131/uiibd.1279602
Kelley, J. (2002). Knowledge nirvana: Achieving the competitive advantage through enterprise content management and optimizing team collaboration. Xulon Press.
Kim, J., Hong, L., Evans, S., Oyler-Rice, E., & Ali, I. (2023). Development and validation of a data literacy assessment scale. Proceedings of the Association for Information Science and Technology, 60(1), 620–624. https://doi.org/10.1002/pra2.827
Kippers, W. B., Poortman, C. L., Schildkamp, K., & Visscher, A. J. (2018). Data literacy: What do educators learn and struggle with during a data use intervention? Studies in Educational Evaluation, 56, 21–31. https://doi.org/10.1016/j.stueduc.2017.11.001
Kline, R. B. (2016). Principle and practice of structural equation modelling (4th ed.). Guilford Press.
Koltay, T. (2015). Data literacy: In search of a name and identity. Journal of Documentation, 71(2), 401–415. https://doi.org/10.1108/JD-02-2014-0026
Koyuncu, I., & Kılıç, A. (2019). The use of exploratory and confirmatory factor analyses: A document analysis. Education and Science, 44(198). https://doi.org/10.15390/EB.2019.7665
Lawshe, C. H. (1975). A quantitative approach to content validity. Personnel Psychology, 28(4), 563–575.
Liew, A. (2007). Understanding data, information, knowledge and their inter-relationships. Journal of Knowledge Management Practice, 8(2), 1–16.
López-Meneses, E., Sirignano, F. M., Vázquez-Cano, E., & Ramírez-Hurtado, J. M. (2020). University students' digital competence in three areas of the DigCom 2.1 model: A comparative study at three European universities. Australasian Journal of Educational Technology, 36(3), 69–88. https://doi.org/10.14742/ajet.5583
MacCallum, R. C., Browne, M. W., & Sugawara, H. M. (1996). Power analysis and determination of sample size for covariance structure modeling. Psychological Methods, 1(2), 130–149.
Mahmud, M. M., & Wong, S. F. (2022). Digital age: The importance of 21st century skills among the undergraduates. Frontiers in Education, 7, 950553. https://doi.org/10.3389/feduc.2022.950553
Mandinach, E. B., & Gummer, E. S. (2013). Building educators' data literacy: Differing perspectives. The Journal of Educational Research & Policy Studies, 13(2), 1–5.
Mandinach, E. B., & Gummer, E. S. (2016). What does it mean for teachers to be data literate: Laying out the skills, knowledge, and dispositions. Teaching and Teacher Education, 60, 366–376. https://doi.org/10.1016/j.tate.2016.07.011
Mandinach, E. B., & Schildkamp, K. (2021). Misconceptions about data-based decision making in education: An exploration of the literature. Studies in Educational Evaluation, 69, 100842. https://doi.org/10.1016/j.stueduc.2020.100842
Merriam-Webster. (2024). Data. In Merriam-Webster's collegiate dictionary (12th ed.). https://www.merriam-webster.com/dictionary/data
Nikkhah, M., Heravi-Karimooi, M., Montazeri, A., Rejeh, N., & Sharif Nia, H. (2018). Psychometric properties the Iranian version of older people's quality of life questionnaire (OPQOL). Health and Quality of Life Outcomes, 16, 1–10. https://doi.org/10.1186/s12955-018-0910-y
Öz, S., & Özdemir, A. (2022). Validity and reliability study on the development of data literacy scale for educators. International Journal of Contemporary Educational Research, 9(3), 649–661. https://doi.org/10.33200/ijcer.1079774
Qin, J., & D'Ignazio, J. (2010). Lessons learned from a two-year experience in science data literacy education. In Proceedings of the 31st Annual IATUL Conference. IATUL.
Reisoğlu, İ., & Çebi, A. (2020). How can the digital competences of pre-service teachers be developed? Examining a case study through the lens of DigComp and DigCompEdu. Computers & Education, 156, 103940. https://doi.org/10.1016/j.compedu.2020.103940
Rowe, S., Riggio, M., De Amicis, R., & Rowe, S. R. (2020). Teacher perceptions of training and pedagogical value of cross-reality and sensor data from smart buildings. Education Sciences, 10(9), 234. https://doi.org/10.3390/educsci10090234
Sarmento, R. P., & Costa, V. (2017). Comparative approaches to using R and Python for statistical data analysis. IGI Global. https://doi.org/10.4018/978-1-68318-016-6
Sarmento, R. P., & Costa, V. (2019). Confirmatory factor analysis – A case study [Preprint]. arXiv. https://doi.org/10.48550/arXiv.1905.05598
Schield, M. (2004). Information literacy, statistical literacy and data literacy. IASSIST Quarterly, 28(2/3), 6–11.
Schildkamp, K. (2019). Data-based decision-making for school improvement: Research insights and gaps. Educational Research, 61(3), 257–273. https://doi.org/10.1080/00131881.2019.1625716
Shreiner, T. L., & Guzdial, M. (2022). The information won't just sink in: Helping teachers provide technology-assisted data literacy instruction in social studies. British Journal of Educational Technology, 53(5), 1134–1158. https://doi.org/10.1111/bjet.13255
Small, H. (1973). Co-citation in the scientific literature: A new measure of the relationship between two documents. Journal of the American Society for Information Science, 24(4), 265–269. https://doi.org/10.1002/asi.4630240406
Steiger, J. H. (2007). Understanding the limitations of global fit assessment in structural equation modeling. Personality and Individual Differences, 42(5), 893–898. https://doi.org/10.1016/j.paid.2006.09.017
The CERN Council. (2024a). Who we are: Our mission. https://www.home.cern/about/who-we-are/our-mission
The CERN Council. (2024b). Accelerators. https://home.cern/science/accelerators
Trantham, P. S., Sikorski, J., de Ayala, R. J., & Doll, B. (2021). An item response theory and Rasch analysis of the NUDKS: A data literacy scale. Educational Assessment, Evaluation and Accountability, 1–23. https://doi.org/10.1007/s11092-021-09372-w
Uyumaz, G., & Sırgancı, G. (2020). Doğrulayıcı faktör analizi için gerekli örneklem büyüklüğü kaç kişidir? [What is the required sample size for confirmatory factor analysis?]. OPUS International Journal of Society Researches, 16(32), 5302–5340. https://doi.org/10.26466/opus.826895
Vahey, P., Rafanan, K., Patton, C., Swan, K., van't Hooft, M., Kratcoski, A., & Stanford, T. (2012). A cross-disciplinary approach to teaching data literacy and proportionality. Educational Studies in Mathematics, 81, 179–205. https://doi.org/10.1007/s10649-012-9392-z
Vahey, P., Yarnall, L., Patton, C., Zalles, D., & Swan, K. (2006, April). Mathematizing middle school: Results from a cross-disciplinary study of data literacy [Paper presentation]. Annual Meeting of the American Educational Research Association, San Francisco, CA.
Veneziano, L., & Hooper, J. (1997). A method for quantifying content validity of health-related questionnaires. American Journal of Health Behavior, 21(1), 67–70.
Worthington, R. L., & Whittaker, T. A. (2006). Scale development research: A content analysis and recommendations for best practices. The Counseling Psychologist, 34(6), 806–838. https://doi.org/10.1177/0011000006288127

Copyright (c) 2026 Çağrı Demirtaş
This work is licensed under a Creative Commons Attribution 4.0 International License.
Downloads
Article Information
- Article Type Research Articles
- Submitted January 25, 2026
- Accepted February 22, 2026
- Published March 30, 2026
- Issue Vol. 5 No. 1 (2026): Pedagogical Perspective (March)
- Section Research Articles


