Using cluster analysis for data mining in educational technology research

Pavlo D. Antonenko, Serkan Toy, Dale S. Niederhauser

Research output: Contribution to journalArticlepeer-review

64 Scopus citations


Cluster analysis is a group of statistical methods that has great potential for analyzing the vast amounts of web server-log data to understand student learning from hyperlinked information resources. In this methodological paper we provide an introduction to cluster analysis for educational technology researchers and illustrate its use through two examples of mining click-stream server-log data that reflects student use of online learning environments. Cluster analysis can be used to help researchers develop profiles that are grounded in learner activity-like sequence for accessing tasks and information, or time spent engaged in a given activity or examining resources-during a learning session. The examples in this paper illustrate the use of a hierarchical clustering method (Ward's clustering) and a non-hierarchical clustering method (k-Means clustering) to analyze characteristics of learning behavior while learners engage in a problem-solving activity in an online learning environment. A discussion of advantages and limitations of using cluster analysis as a data mining technique in educational technology research concludes the article.

Original languageEnglish (US)
Pages (from-to)383-398
Number of pages16
JournalEducational Technology Research and Development
Issue number3
StatePublished - Jun 2012
Externally publishedYes


  • Cluster analysis
  • Data mining
  • Learner profiles
  • Online learning environments
  • Server logs

ASJC Scopus subject areas

  • Education


Dive into the research topics of 'Using cluster analysis for data mining in educational technology research'. Together they form a unique fingerprint.

Cite this