The Promise of MOOCs Revisited? Demographics of Learners Preparing for University
DOI:
https://doi.org/10.18608/jla.2023.7807Keywords:
distance education, massive open online courses, learning analytics, learning design, MOOCs, research paperAbstract
This paper leverages cluster analysis to provide insight into how traditionally underrepresented learners engage with entry-level massive open online courses (MOOCs) intended to lower the barrier to university enrolment, produced by a major research university in the United States. From an initial sample of 260,239 learners, we cluster analyze a subset of data from 29,083 participants who submitted an assignment in one of nine entry-level MOOC courses. Manhattan distance and Gower distance measures are computed based on engagement, achievement, and demographic data. To our knowledge, this marks one of the first such uses of Gower distance to cluster mixed-variable data to explore fairness and equity in the MOOC literature. The clusters are derived from CLARA and PAM algorithms, enriched by demographic data, with a particular focus on education level, as well as approximated socioeconomic status (SES) for a smaller subset of learners. Results indicate that learners without a college degree are more likely to be high-performing compared to college-educated learners. Learners from lower SES backgrounds are just as likely to be successful as learners from middle and higher SES backgrounds. While MOOCs have struggled to improve access to learning, more fair and equitable outcomes for traditionally underrepresented learners are possible.
References
Anderson, A., Huttenlocher, D., Kleinberg, J., & Leskovec, J. (2014). Engaging with massive online courses. Proceedings of the 23rd International Conference on World Wide Web (WWW ’14), 7–11 April 2014, Seoul, Republic of Korea (pp. 687–698). ACM Press. https://doi.org/10.1145/2566486.2568042
Arora, S., Goel, M., Sabitha, A. S., & Mehrotra, D. (2017). Learner groups in massive open online courses. American Journal of Distance Education, 31(2), 80–97. https://doi.org/10.1080/08923647.2017.1300461
Brooks, C., Thompson, C., & Teasley, S. (2015). Who you are or what you do: Comparing the predictive power of demographics vs. activity patterns in massive open online courses (MOOCs). Proceedings of the 2nd ACM Conference on Learning @ Scale (L@S 2015), 14–18 March 2015, Vancouver, BC, Canada (pp. 245–248). ACM Press. https://doi.org/10.1145/2724660.2728668
Carnegie Classifications of Institutions of Higher Education. (2017). Basic classification description. https://carnegieclassifications.iu.edu/classification_descriptions/basic.php
Charrad, M., Ghazzali, N., Boiteau, V., & Niknafs, A. (2014). NbClust: An R package for determining the relevant number of clusters in a data set. Journal of Statistical Software, 61(6), 1–36. https://doi.org/10.18637/jss.v061.i06
Chen, B., Håklev, S., Harrison, L., Najafi, H., & Rolheiser, C. (2015). How do MOOC learners’ intentions relate to their behaviors and overall outcomes? Proceedings of the American Educational Research Association Annual Conference (AERA 2015), 16–20 April 2015, Chicago, IL, USA. https://www.researchgate.net/publication/316605290_How_do_MOOC_learners'_intentions_relate_to_their_behaviors_and_overall_outcomes#fullTextFileContent
Chetty, R., Friedman, J. N., Saez, E., Turner, N., & Yagan, D. (2017). Mobility report cards: The role of colleges in intergenerational mobility (No. w23618). National Bureau of Economic Research. https://doi.org/10.3386/w23618
Deng, R., Benckendorff, P., & Gannaway, D. (2017). Understanding learning and teaching in MOOCs from the perspectives of students and instructors: A review of literature from 2014 to 2016. Proceedings of the 5th European MOOCs Stakeholders Summit (EMOOCs 2017), 22–26 May 2017, Madrid, Spain (pp. 176–181). Springer. https://doi.org/10.1007/978-3-319-59044-8_20
Deng, R., Benckendorff, P., & Gannaway, D. (2019). Progress and new directions for teaching and learning in MOOCs. Computers & Education, 129, 48–60. https://doi.org/10.1016/j.compedu.2018.10.019
Dillahunt, T. R., Wang, B. Z., & Teasley, S. (2014). Democratizing higher education: Exploring MOOC use among those who cannot afford a formal education. The International Review of Research in Open and Distributed Learning, 15(5), 177–196. https://doi.org/10.19173/irrodl.v15i5.1841
Dowle, M., Srinivasan, A., Gorecki, J., Chirico, M., Stetsenko, P., Short, T., Lianoglou, S., Antoyan, E., Bonsch, M., Parsonage, H., Ritchie, S., Ren, K., Tan, X., Saporta, R., Seiskari, O., Dong, X., Lang, M., Iwasaki, W., Wenchel, S., ... Schwen, B. (2022). Extension of ‘data.frame.’ https://cran.r-project.org/web/packages/data.table/data.table.pdf
Ebbert, D., & Dutke, S. (2020). Patterns in students’ usage of lecture recordings: A cluster analysis of self-report data. Research in Learning Technology, 28. https://doi.org/10.25304/rlt.v28.2258
Engle, D., Mankoff, C., & Carbrey, J. (2015). Coursera’s introductory human physiology course: Factors that characterize successful completion of a MOOC. The International Review of Research in Open and Distributed Learning, 16(2), 46–68. https://doi.org/10.19173/irrodl.v16i2.2010
Escobari, M., Seyal, I., & Meaney, M. J. (2019). Realism about reskilling: Upgrading the career prospects of America’s low-wage workers. The Brookings Institution. https://www.brookings.edu/research/realism-about-reskilling/
Ferguson, R., & Clow, D. (2015). Examining engagement: Analysing learner subpopulations in massive open online courses (MOOCs). Proceedings of the 5th International Conference on Learning Analytics and Knowledge (LAK ʼ15), 16–20 March 2015, Poughkeepsie, NY, USA (pp. 51–58). ACM Press. https://doi.org/10.1145/2723576.2723606
Ferguson, R., Clow, D., Beale, R., Cooper, A. J., Morris, N., Bayne, S., & Woodgate, A. (2015). Moving through MOOCS: Pedagogy, learning design and patterns of engagement. Proceedings of the 10th European Conference on Technology Enhanced Learning: Design for Teaching and Learning in a Networked World (EC-TEL 2015), 15–17 September 2015, Toledo, Spain (pp. 70–84). Lecture Notes in Computer Science, vol. 9307. Springer. https://doi.org/10.1007/978-3-319-24258-3_6
Ganelin, D., & Chuang, I. (2019). IP geolocation underestimates regressive economic patterns in MOOC usage. Proceedings of the 2019 11th International Conference on Education Technology and Computers (ICETC 2019), 28–31 October 2019, Amsterdam, Netherlands (pp. 268–272). ACM Press. https://doi.org/10.1145/3369255.3369301
Gardner, J., & Brooks, C. (2018). Student success prediction in MOOCs. User Modeling and User-Adapted Interaction, 28, 127–203. https://doi.org/10.1007/s11257-018-9203-z
Goldberg, L. R., Bell, E., King, C., O’Mara, C., McInerney, F., Robinson, A., & Vickers, J. (2015). Relationship between participants’ level of education and engagement in their completion of the Understanding Dementia massive open online course. BMC medical education, 15, 60. https://doi.org/10.1186/s12909-015-0344-z
Gower, J. C. (1971). A general coefficient of similarity and some of its properties. Biometrics, 27(4), 857–871. https://doi.org/10.2307/2528823
Greene, J. A., Oswald, C. A., & Pomerantz, J. (2015). Predictors of retention and achievement in a massive open online course. American Educational Research Journal, 52(5), 925–955. https://doi.org/10.3102/0002831215584621
Guo, P. J., & Reinecke, K. (2014). Demographic differences in how students navigate through MOOCs. Proceedings of the 1st ACM Conference on Learning @ Scale (L@S 2014), 4–5 March 2014, Atlanta, GA, USA (pp. 21–30). ACM Press. https://doi.org/10.1145/2556325.2566247
Hansen, J. D., & Reich, J. (2015). Democratizing education? Examining access and usage patterns in massive open online courses. Science, 350(6265), 1245–1248. https://doi.org/10.1126/science.aab3782
Jang, M., & Vorderstrasse, A. (2019). Socioeconomic status and racial or ethnic differences in participation: Web-based survey. JMIR Research Protocols, 8(4), e11865. https://www.researchprotocols.org/2019/4/e11865/
Joksimović, S., Kovanović, V., & Dawson, S. (2019). The journey of learning analytics. HERDSA Review of Higher Education, 6, 37–63. https://www.herdsa.org.au/herdsa-review-higher-education-vol-6/37-63
Joksimović, S., Poquet, O., Kovanović, V., Dowell, N., Mills, C., Gašević, D., Dawson, S., Graesser, A. C., & Brooks, C. (2018). How do we model learning at scale? A systematic review of research on MOOCs. Review of Educational Research, 88(1), 43–86. https://doi.org/10.3102/0034654317740335
Jordan, K. (2014). Initial trends in enrolment and completion of massive open online courses. The International Review of Research in Open and Distributed Learning, 15(1), 133–160. https://doi.org/10.19173/irrodl.v15i1.1651
Kassambara, A. (2017). Practical guide to cluster analysis in R: Unsupervised machine learning. CreateSpace Independent Publishing Platform.
Kassambara, A., & Mundt, F. (2016). Package ‘factoextra’: Extract and visualize the results of multivariate data analyses. https://cran.microsoft.com/snapshot/2016-11-30/web/packages/factoextra/factoextra.pdf
Kaufman, L., & Rousseeuw, P. J. (1990). Finding groups in data: An introduction to cluster analysis. John Wiley & Sons. https://onlinelibrary.wiley.com/doi/book/10.1002/9780470316801
Khalil, M., & Ebner, M. (2017). Clustering patterns of engagement in massive open online courses (MOOCs): The use of learning analytics to reveal student categories. Journal of Computing in Higher Education, 29(1), 114–132. https://doi.org/10.1007/s12528-016-9126-9
Kizilcec, R. F., & Halawa, S. (2015). Attrition and achievement gaps in online learning. Proceedings of the 2nd ACM Conference on Learning @ Scale (L@S 2015), 14–18 March 2015, Vancouver, BC, Canada (pp. 57–66). ACM Press. https://doi.org/10.1145/2724660.2724680
Kizilcec, R. F., Piech, C., & Schneider, E. (2013). Deconstructing disengagement: Analyzing learner subpopulations in massive open online courses. Proceedings of the 3rd International Conference on Learning Analytics and Knowledge (LAK ’13), 8–12 April 2013, Leuven, Belgium (pp. 170–179). ACM Press. https://doi.org/10.1145/2460296.2460330
Kizilcec, R. F., & Schneider, E. (2015). Motivation as a lens to understand online learners: Toward data-driven design with the OLEI scale. ACM Transactions on Computer–Human Interaction, 22(2), 1–24. https://doi.org/10.1145/2699735
Kovanović, V., Joksimović, S., Gašević, D., Owers, J., Scott, A.-M., & Woodgate, A. (2016). Profiling MOOC course returners: How does student behaviour change between two course enrollments? Proceedings of the 3rd ACM Conference on Learning @ Scale (L@S 2016), 25–28 April 2016, Edinburgh, Scotland (pp. 269–272). ACM Press. https://doi.org/10.1145/2876034.2893431
Kravvaris, D., Kermanidis, K. L., & Ntanis, G. (2016). How MOOCs link with social media. Journal of the Knowledge Economy, 7, 461–487. https://doi.org/10.1007/s13132-014-0219-2
Lambert, S. R. (2020). Do MOOCs contribute to student equity and social inclusion? A systematic review 2014–18. Computers & Education, 145, 103693. https://doi.org/10.1016/j.compedu.2019.103693
Li, Q., & Baker, R. (2018). The different relationships between engagement and outcomes across participant subgroups in massive open online courses. Computers & Education, 127, 41–65. https://doi.org/10.1016/j.compedu.2018.08.005
Lodder, P. (2014). To impute or not impute: That’s the question. In G. J. Mellenbergh & H. J. Adèr (Eds.), Advising on research methods: Selected topics 2013. Johannes van Kessel Publishing. https://www.paultwin.com/wp-content/uploads/Lodder_1140873_Paper_Imputation.pdf
Long, J. S. (1997). Regression models for categorical and limited dependent variables. Sage Publications. https://us.sagepub.com/en-us/nam/regression-models-for-categorical-and-limited-dependent-variables/book6071
Loohach, R., & Garg, K. (2012). Effect of distance functions on k-means clustering algorithm. International Journal of Computer Applications, 49(6), 7–9. https://doi.org/10.5120/7629-0698
Maechler, M., Rousseeuw, P., Struyf, A., Hubert, M., Hornik, K., Studer, M., Roudier, P., Gonzalez, J., Kozlowski, K., Schubert, E., & Murphy, K. (2022). “Finding groups in data”: Cluster analysis extended. http://mirrors.ucr.ac.cr/CRAN/web/packages/cluster/cluster.pdf
Martin, D. (2016). Clustering mixed data types in R. https://dpmartin42.github.io/posts/r/cluster-mixed-types
Meaney, M. (2021). Essays on the design of inclusive learning in Massive Open Online Courses, and implications for educational futures [Unpublished doctoral dissertation]. University of Cambridge. https://doi.org/10.17863/CAM.76128
Meaney, M. J., & Fikes, T. (2019). Early-adopter iteration bias and research-praxis bias in virtual learning environments. Companion Proceedings of the 9th International Conference on Learning Analytics and Knowledge (LAK ’19), 4–8 March 2019, Tempe, Arizona (pp. 513–519). Society for Learning Analytics Research (SoLAR). https://www.solaresearch.org/wp-content/uploads/2019/08/LAK19_Companion_Proceedings.pdf
Meaney, M., & Fikes, T. (2022). Adding a demographic lens to cluster analysis of participants in entry-level massive open online courses (MOOCs). Proceedings of the 9th ACM Conference on Learning @ Scale (L@S 2022), 1–3 June 2022, New York, NY, USA (pp. 355–359). ACM Press. https://doi.org/10.1145/3491140.3528306
R Core Team. (2019). R: A language and environment for statistical computing. The R Foundation for Statistical Computing. https://www.R-project.org/
Ramesh, A., Goldwasser, D., Huang, B., Daume III, H., & Getoor, L. (2014). Learning latent engagement patterns of students in online courses. Proceedings of the 28th Conference on Artificial Intelligence (AAAI-18), 27–31 July 2014, Québec City, Québec, Canada (pp. 1272–1278). AAAI Press. https://doi.org/10.1609/aaai.v28i1.8920
Reich, J., & Ruipérez-Valiente, J. A. (2019). The MOOC pivot. Science, 363(6423), 130–131. https://doi.org/10.1126/science.aav7958
Rohs, M., & Ganz, M. (2015). MOOCs and the claim of education for all: A disillusion by empirical data. The International Review of Research in Open and Distributed Learning, 16(6), 1–19. https://doi.org/10.19173/irrodl.v16i6.2033
Savje, F. (2021). Distances. GitHub, Inc. https://github.com/fsavje/distances
Schubert, E., & Rousseeuw, P. J. (2019). Faster k-medoids clustering: Improving the PAM, CLARA, and CLARANS algorithms. Proceedings of the 12th International Conference on Similarity Search and Applications (SISAP 2019), 2–4 October 2019, Newark, NJ, USA (pp. 171–187). Springer. https://doi.org/10.1007/978-3-030-32047-8_16
Si, Y., & Reiter, J. P. (2013). Nonparametric Bayesian multiple imputation for incomplete categorical variables in large-scale assessment surveys. Journal of Educational and Behavioral Statistics, 38(5), 499–521. https://doi.org/10.3102/1076998613480394
Stack Exchange. (2014, April 26). How should I interpret GAP statistic? [Online forum post]. https://stats.stackexchange.com/questions/95290/how-should-i-interpret-gap-statistic/210704
Stich, A. E., & Reeves, T. D. (2017). Massive open online courses and underserved students in the USA. The Internet and Higher Education, 32, 58–71. https://doi.org/10.1016/j.iheduc.2016.09.001
Tibshirani, R. (2013). Clustering 1: K-means, K-medoids. https://www.stat.cmu.edu/~ryantibs/datamining/lectures/04-clus1.pdf
Tibshirani, R., Walther, G., & Hastie, T. (2001). Estimating the number of clusters in a data set via the gap statistic. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 63(2), 411–423. https://doi.org/10.1111/1467-9868.00293
Torres-Reyna, O. (2012). Getting started in Logit and ordered Logit regression. Princeton University. http://dss.princeton.edu/training/Logit.pdf
Wang, Y., Fikes, T. G., & Pettyjohn, P. (2018). Open scale courses: Exploring access and opportunity for less-educated learners. Proceedings of the 2018 Learning with MOOCS Conference (LWMOOCS ’18), 26–28 September 2018, Madrid, Spain (pp. 102–105). IEEE. https://doi.org/10.1109/LWMOOCS.2018.8534667
Wickham, H. (2022). Easily install and load the ‘tidyverse’ [R package tidyverse version 1.3.2]. Comprehensive R Archive Network (CRAN). https://cran.r-project.org/web/packages/tidyverse/index.html
Zamecnik, A., Kovanović, V., Joksimović, S., & Liu, L. (2022). Exploring non-traditional learner motivations and characteristics in online learning: A learner profile study. Computers and Education: Artificial Intelligence, 3, 100051. https://doi.org/10.1016/j.caeai.2022.100051
Zhang, Q., Bonafini, F. C., Lockee, B. B., Jablokow, K. W., & Hu, X. (2019). Exploring demographics and students’ motivation as predictors of completion of a massive open online course. The International Review of Research in Open and Distributed Learning, 20(2), 140–161. https://doi.org/10.19173/irrodl.v20i2.3730
Zhenghao, C., Alcorn, B., Christensen, G., Eriksson, N., Koller, D., & Emanuel, E. J. (2015, September 22). Who’s benefiting from MOOCs, and why. Harvard Business Review. https://hbr.org/2015/09/whos-benefiting-from-moocs-and-why
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Journal of Learning Analytics
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
TEST