Evaluating the user’s experience, adaptivity and learning outcomes of a fuzzy-based intelligent tutoring system for computer programming for academic students in Greece

Chrysafiadi, Konstantina; Virvou, Maria; Tsihrintzis, George A.; Hatzilygeroudis, Ioannis

doi:10.1007/s10639-022-11444-3

Evaluating the user’s experience, adaptivity and learning outcomes of a fuzzy-based intelligent tutoring system for computer programming for academic students in Greece

Open access
Published: 17 November 2022

Volume 28, pages 6453–6483, (2023)
Cite this article

Download PDF

You have full access to this open access article

Education and Information Technologies Aims and scope Submit manuscript

Evaluating the user’s experience, adaptivity and learning outcomes of a fuzzy-based intelligent tutoring system for computer programming for academic students in Greece

Download PDF

Konstantina Chrysafiadi ORCID: orcid.org/0000-0001-8096-1407¹,
Maria Virvou¹,
George A. Tsihrintzis¹ &
…
Ioannis Hatzilygeroudis²

2519 Accesses
5 Citations
1 Altmetric
Explore all metrics

Abstract

Nowadays, the improvement of digital learning with Artificial Intelligence has attracted a lot of research, as it provides solutions for individualized education styles which are independent of place and time. This is particularly the case for computer science, as a tutoring domain, which is rapidly growing and changing and as such, learners need frequent update courses. In this paper, we present a thorough evaluation of a fuzzy-based intelligent tutoring system (ITS), that teaches computer programming. The evaluation concerns multiple aspects of the ITS. The evaluation criteria are: (i) context, (ii) effectiveness, (iii) efficiency, (iv) accuracy, (v) usability and satisfaction, and (vi) engagement and motivation. In the evaluation process students of an undergraduate program in Informatics of the University of Piraeus in Greece participated. The evaluation method that was used included questionnaires, analysis of log files and experiments. Also, t-tests were conducted to certify the validity of the evaluation results. Indeed, the evaluation results are very positive and show that the incorporated fuzzy mechanism to the presented ITS enhances the system with Artificial Intelligence and through this, it increases the learners’ satisfaction and new knowledge learning and mastering, improves the recommendation accuracy of the system, the efficacy of interactions, and contributes positively to the learners’ engagement in the learning process.

The Promises and Challenges of Artificial Intelligence for Teachers: a Systematic Review of Research

Article Open access 25 March 2022

Artificial Intelligence Technologies in Education: Benefits, Challenges and Strategies of Implementation

Artificial intelligence in online higher education: A systematic review of empirical research from 2011 to 2020

Article 26 February 2022

1 Introduction

Nowadays, education has benefitted from advances in computer technology. Particularly, individual students can participate in a lesson from wherever they are and whenever they can, receiving learning material tailored to their needs. This has been achieved to a large extent due to the development of advanced software for computer-assisted learning, such as Intelligent Tutoring Systems (ITS) (Sáiz-Manzanares et al., 2021; Cho & Kim, 2021; Urdaneta-Ponte et al., 2021; Alonso-Secades et al., 2022). Indeed, ITSs constitute a special kind of educational software programs that aim to model the cognitive state and the learning needs of the individual students and provide a personalized learning experience (Akyuz, 2020; Chrysafiadi et al., 2022). They incorporate Artificial Intelligence, which enhances the learning process making it attached to each individual learner’s needs (Sotiropoulos et al., 2019; Tsihrintzis et al., 2019, 2021; Virvou et al., 2020). They model the students’ characteristics and needs and imitate the way that a human tutor thinks and reacts during the teaching process (Chrysafiadi & Virvou, 2013a; Clancey & Hoffman, 2021; Khazanchi & Khazanchi, 2021). This is particularly significant in the case of computer science education, in which the learners have heterogeneous background, characteristics and needs. Moreover, according to (Nesbit et al., 2014), there is a significant advantage of ITS over teacher-led classroom instruction and computer-based instruction that are not based on intelligent techniques.

The main aim of an ITS is to provide a student-oriented learning process that helps learners acquire knowledge and accomplish the learning goal (Polson & Richardson, 2013; Erümit & Çetin, 2020; Paladines & Ramírez, 2020). To achieve this, it has to be able to (i) recognize the learner’s knowledge level, misconceptions and learning needs, (ii) provide lessons and feedback that are tailored to each individual leaner’s needs, (iii) create positive feelings to the student and motivate her/him to participate in the learning process (Graesser et al., 2018). Therefore, the success of an ITS depends on several factors (Kulik & Fletcher, 2016; Mousavinasab et al., 2021; Feng et al., 2021). Consequently, the evaluation of an ITS has to include usability evaluation (Chughtai et al., 2015; Chrysafiadi & Virvou 2021a; Wang et al., 2021), learning outcomes evaluation (Hosseini et al., 2020; Rebolledo-Mendez et al., 2022; Binh & Trung 2021; Chrysafiadi & Virvou, 2021b), student modeling and recommendation validity evaluation of the system (Chrysafiadi & Virvou, 2013b; Sosnovsky & Brusilovsky, 2015; Effenberger & Pelánek, 2021).

In view of the previous, in this paper we present a thorough evaluation of a fuzzy-based ITS that teaches computer programming. The aim is to examine how useful and effective the system is in terms of the learning process and how the educational process benefits from it. Therefore, the following questions are seeking answers in this research:

How helpful the system is in the learning process?
Does the system contribute to the acquisition of new knowledge?
How efficient the system is concerning the number of interactions needed to achieve the learning goal?
How accurate are the system’s recommendations?
How usable and pleasant the system is?
How the system affects the students’ engagement in the learning process?

For answering the above questions, a thorough evaluation of the system was conducted. For the evaluation, we combined two evaluation frameworks, the CIAO! framework (Jones et al., 1999) and the evaluation framework that was proposed by Lynch and Ghergulescu (2016), that were developed for evaluating educational software. In this way, we accomplish to assess multiple aspects of the tutoring system that include the intelligent features as well as the necessary educational aspects. The evaluation process was based on the participation of 140 learners who attended an undergraduate program in Informatics at the University of Piraeus, Greece. For the evaluation questionnaires and experiments were used.

The remainder of this paper is organized as follows. In Section 2, we present background knowledge about ITS evaluation. In Section 3, we present the theoretical framework and the methodology of research. In Section 4, we present the fuzzy-based ITS which was evaluated. In Section 5, we describe the evaluation method, testbed and results. In Section 6, we discussed the evaluation results and present the research’s impication. Finally, in Section 7, we draw conclusions from this work.

2 Related work

The evaluation of an ITS is crucial to its acceptance and contribution to the learning process. The evaluation criteria of most ITSs include usability, learners’ performance and learning outputs. However, a thorough evaluation should include additional criteria, like accuracy, precision, sensitivity, adaptivity, reliability, recognition rate, usability, and mean square error (MSE) (Lampropoulou et al., 2010; Mousavinasab et al., 2021). Furthermore, the most common techniques for an ITS evaluation are observations, questionnaires, and experiments. According to (Greer & Mark, 2016) experiments are ideal for ITS evaluation because they enable researchers to examine rela-tionships between teaching interventions and student-related teaching outcomes, and to obtain quantitative measures of the significance of such relationships.

In recent literature review there is a variety of ITSs that have been evaluated trough experiment and questionnaires. The authors in Wambsganss et al. (2020) used a questionnaire with 38 items to evaluate the usability, usefulness, adaptivity and effectiveness of an adaptive dialog-based tutoring system for augmenta-tion skills. Similarly, the authors in Wang et al. (2021) used questionnaires to evaluate the usabitity of an affective emotional mobile tutoring system and the user satisfaction. On the other hand, an experimental evaluation, which includes the performing of a pre-test and a post-test and the comparing of their results, was used for the evaluation of a tutoring system that teaches Algebraic concerning its contribution to the students’ performance (VanLehn et al., 2020). A similar experimental evaluation was used in Singh et al. (2022) to evaluate a custom-tailored tutoring system that was called SeisTutor. Particularly, pre-test and post-test method and questionnaires were used in order to evaluate the system, according to the four phases of the Kirkpatrick model (Kirkpatrick, 1994), which are: (i) evaluation of reaction, (ii) evaluation of learning, (iii) evaluation of behaviour, (iv) evaluation of results. However, this model was created to evaluate traditional tutoring systems and programs. It does not evaluate specific characteristics of an adaptive e-learning tutoring system, like accuracy of recommendations, usability, usefulness, interactions etc. Furthermore, the authors in Eryılmaz and Adabashi (2020) present an experimental study to evaluate the effectiveness of an intelligent tutoring system, which embeds artificial intelligence methods to support the higher student academic performance. They compared the developed tutoring system with other versions of it and used t-test to compare the different academic performance of the students, who used the systems. Moreover, a pilot study was conducted to evaluate the ability of an intelligent team tutoring system to provide feedback to positively influence team behaviour and improve team task performance (Ostrander et al., 2020). Two groups of 16 humans participated in the study, which included performance measuring and comparing through statistical t-test method, and a self-assessment survey through questionnaire. Also, in Kochmar et al. (2020) an experiment was conducted to measure the student’s learning gain and check if it is improved by a tutoring system, which uses machine learning, to provide automated personalised feedback. Another experiment, which concerned the use of an intelligent tutoring system, that is called WinITS, by students of Hanoi National University, was described in Binh and Trung (2021). The aim of the experiment was to evaluate the learning effectiveness of a proposed student model that is based on learning styles. The participants completed a final test, after the use of the tutoring system, to evaluate their performance and the time they need to finish the test. The results were compared with the corresponding results of a group of students, who did not used WinITS. In addition, students, who used WinITS, completed a questionnaire to evaluate the effect of adaptation of the system to students.

Taking into account the above, we come up with the conclusion that the most common-used evaluation methods of an ITS are: questionnaires and experiments. Performance is the most frequent evaluated metric in the experiments. Other common-evaluated metrics are users’ satisfaction and system’s usability. Furthermore, experiments, usually, include measuring performance through pre-test and post-test and using statistical t-test method for the comparison of measurements’ results of groups that used different versions of the evaluated system. However, in the literature review, there is not a widely approved evaluation framework and technique for the assessment of an ITS, especially since ITSs need to be evaluated concerning their intelligent features as awell as their educational effectiveness and usability aspects. Therefore, after a thorough investigation in the literature review, we decided to perform the evaluation of the fuzzy-based ITS following well-known and accepted evaluation methodologies: the CIAO! framework (Jones et al., 1999) and the evaluation framework that was proposed by Lynch and Ghergulescu (2016). We chose to use these frameworks because the CIAO! framework was developed especially for the evaluation of general educational aspects of computer assisted learning systems and the Lynch and Chergulescu framework concerns the evaluation of adaptive and intelligent learning systems. Therefore, the combination of these two evaluation frameworks is ideal for performing a thorough evaluation, which include the assessment of multiple aspect of the ITS.

3 Theoretical framework and methodology

The fuzzy-based tutoring systems embeds intelligent techniques for supporting the learning process. Therefore, its evaluation has to include both aspects that concern in general an educational software and aspects that concern its intelligent operation. To succeed it we combined two evaluation frameworks: the CIAO! framework (Jones et al., 1999), which evaluates in general aspects of a computer assisted learning (CAL) system, and the evaluation framework that was proposed by Lynch and Ghergulescu (2016), which evaluates aspects of adaptive and intelligent learning systems.

According to the CIAO! framework, the following three dimensions of a CAL system have to be evaluated:

1.
The CAL aim and its context of use. This dimension is assessed through questionnaires, interviews and analyzing policy documents.
2.
Interactions: Data that concern the learners’ interaction with the CAL system. These data are gathered, measured, and analyzed through observations, audio and/or video recording, interactions recording and log files.
3.
Attitudes and outcomes: Learning outcomes, students’ performance and changes in students’ perceptions and attitudes. For the evaluation of this dimension, questionnaires, interviews, and tests are used.

According to the evaluation framework of Lynch and Chergulescu, the following four criteria have to be assessed:

1.
Learning and training: It concerns factors, such as learning outcomes, knowledge acquisition and learning improvements, that are related to the effectiveness and factors, such as number and duration of interactions needed to achieve the learning goal, which are related to the efficiency.
2.
System: It concerns factors, such how accurate is the system grading in comparison to grading by physical teachers, how accurate are the predictive errors and the feedback, that are related to the accuracy of the student model and system recommendations.
3.
User experience: It concerns the system usability and the learners’ satisfaction.
4.
Affective: It concerns learners’ motivation and engagement in the learning process.

The combination of these two frameworks span more generic aspects that should be evaluated in an educational software that has the features of an Intelligent Tutoring Systems. From the combination of these evaluation frameworks, six evaluation criteria have arisen, namely (i) context, (ii) effectiveness, (iii) efficiency, (iv) accuracy, (v) usability and satisfaction, and (vi) engagement and motivation. In this way, we accomplish to assess multiple aspects of the tutoring system that include the intelligent features as well as the necessary educational aspects. Table 1 presents the criteria of our evaluation model, how they are mapped to CIAO! and Lynch and Chergulescu evaluation frameworks, their metrics and the method that was chosen to evaluate them.

Table 1 Fuzzy sets: linguistic values and trapezoidal membership functions

Evaluating the user’s experience, adaptivity and learning outcomes of a fuzzy-based intelligent tutoring system for computer programming for academic students in Greece

Abstract

Similar content being viewed by others

The Promises and Challenges of Artificial Intelligence for Teachers: a Systematic Review of Research

Artificial Intelligence Technologies in Education: Benefits, Challenges and Strategies of Implementation

Artificial intelligence in online higher education: A systematic review of empirical research from 2011 to 2020

1 Introduction

2 Related work

3 Theoretical framework and methodology

4 An overview of the fuzzy-based ITS

5 Evaluation

5.1 Implementation

5.2 The method

5.3 The testbed

5.4 Results and discussion

5.4.1 Context

5.4.2 Effectiveness

5.4.3 Efficiency

5.4.4 Accuracy

5.4.5 Usability and satisfaction

5.4.6 Engagement and motivation

6 Discussion and implication

7 Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation