Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Classroom network structure learning engagement and parallel temporal attention LSTM based knowledge tracing

  • Zhaoyu Shou,

    Roles Conceptualization, Funding acquisition, Project administration, Resources, Supervision, Writing – review & editing

    Affiliations School of Information and Communication, Guilin University of Electronic Technology, Guilin, China, Guangxi Wireless Broadband Communication and Signal Processing Key Laboratory, Guilin University of Electronic Technology, Guilin, China

  • Yihong Li,

    Roles Conceptualization, Data curation, Methodology, Software, Visualization, Writing – original draft

    Affiliation School of Information and Communication, Guilin University of Electronic Technology, Guilin, China

  • Dongxu Li ,

    Roles Formal analysis, Investigation, Validation

    ldx@mails.guet.edu.cn

    Affiliation School of Information and Communication, Guilin University of Electronic Technology, Guilin, China

  • Jianwen Mo,

    Roles Validation

    Affiliation School of Information and Communication, Guilin University of Electronic Technology, Guilin, China

  • Huibing Zhang

    Roles Validation

    Affiliation School of Computer and Information Security, Guilin University of Electronic Technology, Guilin, China

Abstract

In order to accurately assess the students’ learning process and the cognitive state of knowledge points in smart classroom. A classroom network structure learning engagement and parallel temporal attention LSTM based knowledge tracing model (CL-PTKT) is proposed in this paper. First, a classroom network is constructed based on the information of student ID, seating relationship, and head-up/head-down state obtained from the smart classroom video. Second, a learning engagement model is established by utilizing the head-up/head-down state of students and the structural characteristics of the classroom network. Finally, in this paper innovatively proposes a parallel temporal attention LSTM feature tracking algorithm based on the learning engagement model and the knowledge-exercise data. It can fully considers the potential associated attributes of the knowledge-knowledge, knowledge-exercise and knowledge-learning engagement. And accurately characterizes the knowledge state during the knowledge point’s lecture time. To provide effective support for teachers to make seat adjustments and accurate interventions for in the teaching process. This paper conducts extensive experiments under four real datasets. The algorithm in this paper shows optimal performance in all four evaluation metrics compared to the state-of-the-art knowledge tracing models.

Introduction

Improving learning effect is the eternal theme of education, and measuring learning outcomes can be analyzed in terms of cognitive engagement, affective engagement, and behavioral engagement. In the classification of educational objectives, Bloom separates cognition into a module, which indicates that cognition has an important place in education. Meanwhile, the research on quantitatively assessing learners’ knowledge of knowledge points is becoming increasingly important in the field of smart education [1]. A large number of researchers have analyzed the cognitive state of students in terms of the knowledge tracing model. Masaki Uto et al. [2] proposed to identify the extent of students’ knowledge based on their answers of questions with certain characteristics such as difficulty and differentiation. This study provides new ideas for cognitive assessment of students. To address the problem of inadequate extraction of dependency features of knowledge points in knowledge tracing models, Haowen Yang et al. [3] proposed a novel quantitative relationship neural network for explainable cognitive diagnosis model (QRCDM). It utilizes explicit and implicit correlations between exercises and relating knowledge concepts to calculate students’ mistakes and guesses. In this way, it improves the utilization of attribute relations and enhances the explainability of the model. For the lack of contextual feature extraction for exercise-response in existing models. Cui et al. [4] proposed a multi-relational transformer for knowledge tracing (MRT-KT). Modeling fine-grained interactions between exercises and responses.

However, most existing knowledge tracking models are limited to analyzing students’ knowledge state based on their historical test information on online education platforms. The analysis was based primarily on student test data. At the same time, offline teaching methods are still the dominant mode of education at present. In a smart classroom, students’ cognitive states are influenced by multiple and complex factors. These factors include learning engagement in the learning process, interaction and cooperation among students [5]. Liu et al. [6] suggested that there is an interactive relationship between affective and cognitive inputs, which is important for predicting academic performance. And students’ affective engagement can be mapped to students’ performance of learning behaviors in the classroom. Therefore, existing knowledge tracing models when applied to complex smart classroom environments, it will expose the limitations of its single-dimensional data analysis. Therefore, taking into account the characteristics of students’ learning engagement in smart classroom, it can characterize students’ cognitive state more accurately. And can be better understanding and promote of students’ learning processes.

The learning engagement reflects the degree of attention and participation in learning tasks by students. It reflects the learning state of students to a certain extent [7]. Delgado et al. [8] used deep convolutional neural networks and head pose estimation methods to analyze students’ attentional state (pitch and yaw angle). To obtain students’ learning engagement. Meanwhile, students are easily influenced by the students around them during classroom learning. And this implicit impact is often reflected in students’ classroom learning status and academic performance. Yang et al. [9] used questionnaires and case study methodology, It was shown through a two-way repeated measures analysis of variance (ANOVA) that adjusting students’ seating affects students’ learning engagement in the classroom. Putnik et al. [10] found a significant correlation between students’ academic performance and social network structural characteristics. By analyzing mutual support relationships between students, van Rijsewijk et al.[11] found an association between academic achievement and an unbalanced distribution of classroom helping relationships. Gutierrez et al.[12] found that effective seating arrangements could promote social interaction and classroom participation. And thus improved student learning outcomes. Li et al.[13] proposed a model of emotional transmission determined by students’ facial expressions of one kind. Experimental analysis showed the existence of learning emotions transmitted from student to student in classroom networks. This would affected the individual’s learning.

To address the difficulty of existing knowledge tracing models to accurately characterize students’ knowledge state in complex smart classroom. This paper is based on classroom network structure learning engagement and knowledge point test data. A knowledge tracing model based on classroom network structure learning engagement and parallel temporal attention LSTM (CL-PTKT). The parallel temporal attention LSTM algorithm is used for feature tracking. It can fully consider the potential associated attributes of the knowledge–knowledge, knowledge–exercise and knowledge–learning engagement. To achieve accurate characterization of learners’ cognitive state under smart classroom.

This paper consists of the following main contributions:

  1. A learning engagement model utilizing students’ head-up/head-down state and the classroom network structure characteristics is proposed.
  2. A parallel temporal attention LSTM feature tracking algorithm is proposed, which integrates classroom network structure learning engagement and knowledge–exercise data to analyze cognitive state so that the limitations of single-dimensional analysis can be avoided. Meanwhile, the extraction of key feature information is enhanced by the temporal attention mechanism, which can characterize the learner’s cognitive state more accurately.
  3. A large number of experiments were carried out on four real datasets, and the experimental results proved that the CL-PTKT has certain advantages over the state-of-the-art knowledge tracing algorithms.

The rest of the paper unfolds as follows: The next section, “Related Works” delves into knowledge tracing model. The “Relevant definitions” section then explains the relevant definitions of the algorithms in this paper. ”CL-PTKT model” describes the proposed method in detail. ”Experimental results and analysis” shows the experimental results and reliability of the algorithm in this paper. Finally, this paper summarizes in the ”Conclusions and future work” section and highlight the future research directions.

Related work

The existing knowledge tracing models fall into two categories: statistics-based and deep learning-based. The more classical statistical knowledge tracing model is the BKT-based model proposed by Corbett et al. [1416], which modeled a learner’s potential knowledge state as a set of binary variables that represent whether or not a knowledge point is grasped. Liu et al. [17] hypothesized that as learners master more number of knowledge points. The higher the probability that it will be answered correctly. Statistical-based knowledge tracing models can make the prediction results interpretable. However, they couldn’t accurately express the knowledge competencies corresponding to multiple knowledge points when interpreting them. And couldn’t dynamically assess students’ knowledge competencies. Therefore, in order to track students’ knowledge ability in real time, this paper explores deep learning models with powerful fitting capabilities based on big data of classroom students’ behaviors.

To address the limitations brought about by traditional methods. And to adequately deal with the complex relationship of nonlinear interaction between learners and exercises. A large number of researchers had devoted themselves to the study of deep learning-based knowledge tracing models. Piech et al. [18] proposed a DKT model with Recurrent Neural Networks (RNN) as the main body for knowledge tracking of students. It proved for the first time the effectiveness and potential of deep learning in knowledge tracing tasks. And solved the problem that the traditional model could not learn the features of knowledge relations autonomously. Zhang et al. [19] attempted to apply Dynamic Key-Value Pair Memory Networks (DKVMN) to knowledge tracking. It integrated knowledge relationships to obtain more accurately predict knowledge state prediction than the DKT model.

In order to address the problem of forgetfulness that exists in students. Im et al. [20] reacted the Forgetting-aware Linear Bias (FoLiBi) and applied it to a knowledge tracing model based on a contrastive learning framework(CL4KT). Meanwhile, Abdelrahman et al. [21] summarized that sequence modeling KT models could fully exploit the correlation features between test data. It improved prediction accuracy to some extent. However, they also suggested that knowledge tracing algorithm decisions should take into account the correlation with students’ classroom behavioral performance, learning engagement, and curriculum design. Therefore, such methods analyzed knowledge state with single data and lack of consideration of other factors affecting students’ knowledge state. It would decrease the interpretability of the model.

To address the weak ability of existing models to model knowledge point relationships. A large number of researchers had applied attentional mechanisms to knowledge tracing models. Pandey et al. [22] were the first to apply attentional mechanisms to knowledge tracing model. It determined the importance of different historical interactions to current student knowledge state by the scaled dot-product attention mechanism. Ghosh et al. [23] proposed attentive knowledge tracing model (AKT). The attention weights were computed using exponential decay and a context-aware relative distance measure. Choi et al. [24] categorized a series of students’ interactions into question and response embedding sequences based on the students. To address the problem that existing KT models ignore the existence of discrepancies between exercises and prior knowledge. Mao et al. [25] proposed a fine-grained knowledge tracing model (FGKT). It captured correlations between exercises and historical interactions by designing an effective attention mechanism. Huang et al. [26] proposed to increase the time consumed by the student testing process. And proposed a time-distance attention mechanism to predict learners’ knowledge state. Attentive KT model predicted a student’s current state by assigning different weights to different historical interactions. It improved interpretability. However, such methods were mainly based on a larger model of the attention mechanism constructed by encoders and decoders. It required a lot of data training to get better results. It is not applicable to small datasets in smart classrooms.

Meanwhile, in order to incorporate more features to accurately characterize students’ knowledge state. Wang et al. [27] proposed to obtain low-dimensional embeddings for channels with different features using stacked autoencoders. To strengthen the intrinsic links between learning characteristics. Kim et al. [28] reconstructed input features in conjunction with the difficulty of knowledge test questions. And to explore the multiple facets. Liu et al. [29] explored students’ practice records and the text content of the corresponding exercises. They proposed a general Exercise-Enhanced Recurrent Neural Network (EERNN) framework. And extends EERNN into an interpretable exercise-aware knowledge trace (EKT) in conjunction with the concept of knowledge. To predict students’ knowledge state. Yin et al. [30] used a masked language model (MLM) for a pre-training task to learn the representation of practice texts. It improved the performance of the EERNN model further. Text-Aware KT models made some headway in the field of knowledge tracing. However, because of the lack of consistency between datasets. And text markup is difficult. It is difficult to apply it widely.

In recent years, the continuous development of graph neural networks. Graph neural networks can adaptively extract spatial information between data. It is automatically updated by edge weights to obtain the node state. Shun Mao et al [31] proposed improved graph attention networks (GATs). It aggregated the knowledge structure features between knowledge concepts and test questions to obtain valid input. Su et al [32] proposed to incorporate features of the test and knowledge concepts. It solved the problem of multiple knowledge points and gained more predictive performance. To address test data sparsity and multiple knowledge points. Yang et al. [33] proposed a Graph-based Interaction model for Knowledge Tracing (GIKT). It modeled the interactions between exercises and knowledge points. And integrated exercise-knowledge point correlation via embedded propagation using graph convolutional networks (GCN). However, graph-based KT models need to presuppose the existence of data correlations and limited scope of use.

Based on the above KT models analysis, this paper provides an overview of relevant research models, as show in Table 1.

In summary, most of existing KT models have been studied mainly for online education platforms. And it is difficult to accurately characterize the knowledge state of students’ in complex smart classroom. The reality that knowledge state is often complex and ambiguous. Students are susceptible to the influence of students around them in smart classroom, so it is challenging to accurately characterize students’ knowledge state. In summary, This paper is based on classroom network structure learning engagement and knowledge-exercise data. Using parallel temporal attention LSTM to track students’ accumulation, forgetting and learning basis of knowledge points. Mining the correlation properties of student learning with knowledge–knowledge, previous-Learning engagement and post learning engagement. Achieving accurate knowledge diagnosis.

Relevant definitions

Marker Definition

According to the course data, the set of learners of a course is noted as , here p is the number of learners. The set of knowledge points of the course is represented as . with a total of k knowledge points. The exercise set for the course is represented as . with a total of m test questions. The containment relationship between questions and knowledge points is represented as matrix . where , represented as whether question em contains knowledge point kpk or not. The test of learner u is recorded as the vector .

Modeling learning engagement based on classroom network structure

Smart classroom learning engagement.

Students’ heads-up attention state corresponds to their learning engagement in smart classrooms. A higher heads-up rate indicates student focus in class. Therefore, through video monitoring systems that monitor students’ heads-up rate and correlate them with learning engagement in smart classroom. It can allows teachers to better observe and assess students’ behaviors and understand their learning state. According to Shou et al. [34] proposed to analyze the learner’s head pose assessment in the smart classroom video. To obtain the heads-up rate of learner during the time duration of learning k knowledge point as Eq. (1).

(1)

Here, m represents the number of image frames extracted within the knowledge point, and t represents the number of heads-up frames. In this paper, the heads-up rate within the knowledge point duration obtained from the learner’s head pose assessment model. It is used as the learning engagement in the smart classroom.

Classroom network structure learning engagement.

In smart classroom, students with seats in close neighborhood are more likely to influence each other [35]. In this paper, the nearest-neighbor nodes around a node are selected as learning influence factors based on the Nearest-Neighbor Effective Distance Criterion (NEDC) [13]. The solid box nodes in Fig 1 indicate the NEDC range near-neighbor nodes that have a learning influence on node .

thumbnail
Fig 1. Effective distance for near-neighbor relationships.

https://doi.org/10.1371/journal.pone.0320303.g001

The seat relationship data is used to construct the entitled and undirected seat relationship network that shown in Fig 2. Here U represents the set of student nodes and represents the set of edges. This is expressed as follows:

(2)

Here, represents the weight corresponding to edge . This is specified as the inverse

of the Euclidean distance between the and seat coordinates .

Taking learner as an example to calculate its learning engagement based on the classroom network structure as shown in Fig 1. According to Fig 1 to obtain the nodes and edge weight set within the influence of immediate neighbors as .

(3)

learner combines the learning engagement and node-to-node weight of the nodes within NEDC range the knowledge point k lecture time. To obtain the learning engagement of learner based on the classroom network structure.

(4)

Here, n represents the number of nearest neighbor influencing nodes and represents the own learning engagement.

CL-PTKT model

According to the above definition, the CL-PTKT model consists of the following main steps: First, according to the actual conditions of the dataset, the classroom network structure learning engagement and knowledge point test sequences were feature coded separately. Second, the feature encoding vectors are inputted into a parallel temporal attention LSTM network to track the accumulation of knowledge points, forgetting and learning bases, so as to obtain synchronized feature tensor and fuse them. Finally, the fused feature tensor is inputted into the multilayer perceptron network(MLP) for nonlinear mapping to obtain the knowledge diagnosis of students’ temporal knowledge points.The CL-PTKT model architecture is shown in Fig 3. (This paper’s algorithm source code and all datasets available at: https://github.com/LL-stars/Code-Dataset).

Feature tracking based on temporal attention LSTM

LSTM feature tracking.

LSTM is mainly used to model sequence of time-series data, which can effectively mine the temporal and semantic information in the data [36]. The LSTM structure is shown in Fig 4.

In this paper, the student’s question records are divided into sequences with a fixed length l as the interval. If the student’s question records are less than the length l, then complement 0. So as to get the set X of all students’ question sequences. LSTM is used to capture the long dependencies in the data of test questions X taken by the learners. As well as the long dependencies in the classroom network structure learning engagement Net within the learning hours of the knowledge points involved in the test questions. Here, X is shown in Eq. (5) and Net is shown in Eq. (6).

(5)(6)

Before inputting into the LSTM, the knowledge point test questions and the classroom network structure learning engagement sequence data are feature coded separately. Here, the one-hot encoding of a particular knowledge point test question sequence data is x.

(7)

Here, represents the vector of test question-knowledge point relationships for the test question with serial number l taken by the learner. Here, 0 represents that the test question does not involve the knowledge point corresponding to the subscript. 1 represents that it involves the knowledge point corresponding to the subscript.

The sequence of knowledge point test questions corresponds to the classroom network structure learning engagement input coded as net:

(8)

Here, represents the classroom network structure learning engagement. It contains test questions l involved all the knowledge points of the classroom network structure learning engagement.

According to the feature coding of the learners’ temporal test data. And feature coding of classroom network structure learning engagement within the learning hours of the knowledge point corresponding to the test data. The input data are linearly transformed using LSTM. And then the results are passed to the activation function and mapped to the fixed range (0,1) to control the flow of information. Three gating units, the forgetting gate , the input gate and the output gate , realize the functions of selectively forgetting the information of the previous moment, selectively memorizing the information of the current moment and selecting the information as the output of the current moment. The calculation is shown in Eq 9:

(9)

Similarly the input data and are linearly transformed. The result is passed to the Tanh function. The final result is the information to be input into the current LSTM cell’s state vector . The information to be input into the current LSTM cell is shown in Eq. (10).

(10)

The state of the previous moment and the pending input information of the current moment are selectively memorized and inputted, respectively, with the results of the two being summed to obtain the memory cell vector to maintain the memory of the LSTM cell in real time. Eventually, the sequence state vector is obtained by multiplying the output gate output vector and the memory cell vector after the activation function Tanh. This is shown in Eq. (11).

(11)

The feature tensor and are obtained by tracking both the knowledge point test data features and the classroom network structure learning engagement features using parallel temporal attention LSTM networks.

Temporal attention mechanismt.

The attention mechanism in deep learning facilitates the neural network model to focus on the information more critical to the current task in the input information. It can effectively reduce the problem of the LSTM’s loss of capturing the long sequence information and the shortage of extracting the short sequence feature information. So as to improve the model’s performance and generalization ability [37]. Therefore, this paper proposed the temporal attention mechanism to reassign the weights to the temporal knowledge points and learning engagement feature information from LSTM output. It can better extract the feature information related to the current knowledge points and improve the model prediction accuracy. The specific calculation of the temporal attention mechanism is shown in Eq. (12).

(12)

Here, the visualization of the temporal attention mechanism to reassign weights to the temporal knowledge point test data feature information is shown in Fig 5.

thumbnail
Fig 5. Visualization of the temporal attention mechanism for the weight reassignment of the knowledge point test feature information.

https://doi.org/10.1371/journal.pone.0320303.g005

Similarly, the visualization of the temporal attention mechanism to reassign weights to the temporal classroom network structure learning engagement feature information is shown in Fig 6.

thumbnail
Fig 6. Visualization of the temporal attention mechanism for the weight reassignment of the classroom network structure learning engagement feature information.

https://doi.org/10.1371/journal.pone.0320303.g006

Eventually the features and of the parallel temporal attention LSTM outputs are added to fuse them for obtaining the feature tensor h. The calculation is shown in Eq. (13).

(13)

MLP-based nonlinear mapping

MLP networks have nonlinear fitting ability to capture complex data patterns, and the main role is to map the input tensor nonlinearly. To extract the correlation information between features and map it to the output space [38]. The MLP network used in this paper is shown in Fig 7.

The final output of the fusion tensor through the MLP network is calculated as shown in Eq.

(14)

Here, p represents the model prediction, and H represents the feature fusion tensor of the output of the parallel temporal attention LSTM.

The output layer takes the learners’ real answer as the target. It iteratively updates the model parameters using the cross-entropy loss function. As shown in Eq. (15):

(15)

Here, xlj represents the actual answer to knowledge point j involved in the l exercise, and plj represents the predicted value of the model prediction for the knowledge point j involved in exercise l.

Algorithm 1 illustrates the basic steps of CL-PTKT:

Experimental results and analysis

Dataset

To validate the performance of the proposed CL-PTKT model, the experiments are conducted on four real datasets. They are Assistment0910, Assistment2017 [39], Eedi [40] and SCD(Smart Classroom Dataset) datasets, respectively.

The ASSISTment0910 dataset was collected by the ASSISTments online learning system during the 2009-2010 school year and contains records of unanswered and repeated exercises. It contains 4,217 students, 401,756 knowledge interactions, 123 knowledge points and 26,688 exercises.

The ASSISTment2017 dataset was collected by the ASSISTments online learning system during the 2017–2018 academic year and contains records of unanswered and repeated exercises. It contains 1,709 students, 942,816 knowledge interactions, 102 knowledge points and 3,162 exercises.

The Eedi dataset was released by the NeuralPS2020 Education Challenge and consists of four task datasets. In this paper, Task 3 and Task 4 parts of this dataset are selected for the knowledge tracing task. It contains 4,918 students, 1,048,575 knowledge interactions, and 948 exercises. Since there is no official information on which knowledge points are included in the exercises, each exercise is treated as a knowledge point in this paper.

The SCD dataset was a smart classroom dataset consisting of historical behavioral data of students attending the C Programming course in the year 2022. It contains 8 classroom videos teaching the process of the knowledge points, 58 students, 11,312 knowledge interactions data, 37 knowledge points and 45 exercises.

Because the Assistment0910, Assistment2017, and Eedi datasets do not contain information about the classroom network of the learner. They are supplemented with a value of 0 in these three datasets to ensure the wholeness of the model and to minimize the impact on the datasets. The statistics of the four datasets are shown in Table 2.

Evaluation indicators and baseline model

In order to evaluate the performance of the CL-PTKT model proposed in this paper, AUC, ACC, MAE and RMSE are used as evaluation metrics, which are also commonly used in knowledge tracing models.

In order to validate the advantages of the CL-PTKT model. Five baseline models are used for comparison experiments with the CL-PTKT model on four datasets, as shown in Table 3.

Experimental environment and hyper-parameter settings

The experimental environment of this paper is shown in Table 4.

In the CL-PTKT model, the learning rate is set to 0.001. The length of the input feature sequence l is set to 100. The number of one-sided LSTM layers is set to 1. The embedding dimension of the hidden layer is set to 200. The batch size of the model is set to 64, and the gradient descent optimization is carried out using the Adam optimizer.

Results

The ROC curves for each model on the Assistment0910, Assistment2017, Eedi and SCD datasets as shown in Fig 8. The ROC curves for each model confronting the e-learning platform dataset using only the knowledge point test question data as shown in Fig 8a–c. The analysis shows that the AUC value of the CL-PTKT model is better than that of the baseline model. Even if they are online platform datasets using only the knowledge point test data. Fig 8d represents the diagnostic performance of the model on the SCD dataset. The CL-PTKT model that incorporates learners’ classroom network structured learning engagement exhibits better performance.

thumbnail
Fig 8. ROC curves for the CL-PTKT model and the baseline model on the dataset. (a–d) represent the Assistment0910, Assistment2017, Eedi, and SCD datasets, respectively.

https://doi.org/10.1371/journal.pone.0320303.g008

The performance metrics of the CL-PTKT model on the Assistment0910, Assistment2017, Eedi, and SCD datasets with other baseline models are shown in Table 5 and analyzed as follows:

thumbnail
Table 5. Performance metrics of all models on the dataset

https://doi.org/10.1371/journal.pone.0313772.t005

  1. (1) The method proposed in this paper achieves the best performance in most of the evaluation metrics. In the SCD dataset, most of the evaluation metrics are better than the baseline model, and that demonstrates the effectiveness of integrating the classroom network structure learning engagement.
  2. (2) Comparing with the RNN-based DKT model and the DKVMN based on different activation functions to compute the accumulation and forgetting of knowledge points. CL-PTKT has a remarkable improvement. It proves the superiority of computing accumulation and forgetting based on the dynamic control gates of LSTM.
  3. (3) AKT mainly uses monotonic attention mechanism for contextual feature extraction, and QRCDM mainly uses the feature extraction method of α cross-validation idea. CL-PTKT enhances the feature extraction by using the temporal attention mechanism. The results are better than those of AKT, QRCDM. It proves that the temporal attention mechanism can better pay attention to the important feature information. As well as enhance the model prediction accuracy.
  4. (4) CL4KT-FoLiBi embeds the forgetting linear deviation mechanism to simulate students’ forgetting behavior, but ignores the fact that in reality the students’ knowledge process is ambiguous and complex. The CL-PTKT takes the aspect of enhanced feature extraction and achieves better results.
  5. (5) According to Table 5, the CL-PTKT model shows superior performance in all three online learning platform datasets and one smart classroom dataset. In this case, the distribution and number of the four datasets are different, as to prove that the CL-PTKT model has stronger robustness.
  6. (6) The RMSE values on the Assistment2017 dataset, the CL-PTKT model underperforms comparing with some baseline model. After experimentation, because of the long average length of student test sequences in the Assistment2017 dataset. The poor performance of the RMSE value CL-PTKT on the Eedi dataset. This is because the dataset contains many knowledge points.

Ablation experiment

Effects of temporal attention mechanisms.

In order to verify the advancement of the proposed temporal attention mechanism. It can enhance the model to capture learner’s feature information and improve the model’s prediction ability. This paper conducts the ablation experiments of the temporal attention mechanism on four datasets. The CL-PKT represents the model without the addition of the temporal attention mechanism. The results of the experiments are shown in Fig 9.

thumbnail
Fig 9. Performance metrics of CL-PTKT and CL-PKT models on four datasets. (a–d) represent the Auc, Acc, MAE, and RMSE values of the model with or without the temporal attention mechanism, respectively.

https://doi.org/10.1371/journal.pone.0320303.g009

According to Fig 9 and the analysis of the experimental results, CL-PTKT outperforms CL-PKT on all four datasets. It proves that CL-PTKT has better stability and robustness.

In summary, adding the temporal attention mechanism to LSTM can enhance the feature extraction ability of the model. It effectively improves the stability and prediction ability of the model.

Classroom network structure learning engagement ablation experiment.

In order to illustrate that in smart classrooms with complex environments, fusing learners’ classroom network structure learning engagement model(CL-PTKT) has better predictive performance. This paper conducts ablation experiments on the SCD dataset. The results of the experiment are shown in Fig 10. Here, PTKT represents without integrating the learner’s classroom network structure learning engagement, and L-PTKT represents integrating its own learning engagement model.

thumbnail
Fig 10. Comparison of ROC curves for ablation experiments on the SCD dataset for the CL-PTKT model.

https://doi.org/10.1371/journal.pone.0320303.g010

thumbnail
Table 6. Performance metrics of CL-PTKT model based on different learning engagement

https://doi.org/10.1371/journal.pone.0313772.t006

thumbnail
Table 7. Comparison of the accuracy of the PTKT, L-PTKT and CL-PTKT modelss

https://doi.org/10.1371/journal.pone.0313772.t007

The performance metrics of using different classroom network structure learning engagement models on the SCD dataset are shown in Table 6.

Meanwhile, this paper conducts experimental validation on some students within 37 knowledge point lecture time. The results of the comparison of the accuracy of the knowledge state diagnosis results output from the PTKT, L-PTKT and CL-PTKT models are shown in Table 7.

From the analysis of Fig 10, Tables 6 and 7, the performance of the CL-PTKT model incorporating classroom network features outperformed that without the incorporation of classroom network features. It also outperforms the model that incorporates the learner’s own learning engagement features. The overall results indicate that in smart classroom with multiple learners, a single dimension of test question data cannot accurately predict student performance. In addition, students’ learning engagement does not only depend on themselves, but also the learners’ learning engagement will be influenced by the nearest-neighbor nodes based on the seating relationship network. These influencing factors play a direct role in learners’ grasp of knowledge points during the learning process. It is ultimately reflected in the results of the knowledge point test.

Analysis of results

In order to verify the effectiveness of the model’s predicted outputs for knowledge state diagnosis. This paper uses learners’ classroom test scores as targets on the SCD dataset for classification validation. this paper uses learners’ classroom test scores as targets on the SCD dataset for classification validation. Some of this student knowledge state diagnostic results from the predicted output of the CL-PTKT model are shown in Table 8.

thumbnail
Table 8. Diagnostic results of students’ knowledge state within each knowledge point’s lecture time

https://doi.org/10.1371/journal.pone.0313772.t008

In order to assess the reliability of the resulting knowledge state diagnostic results, the cognitive state levels of the pyramid proposed by Bloom [41] were relied upon (memorization, comprehension, application, analysis, evaluation, and innovation). According to the correlation between test scores and cognitive levels proposed by [4246] et al. The learners’ knowledge diagnostic results are separated into intervals by the duration of the knowledge points learned in the course. The results are categorized as Memory ([0,40), F), Comprehension ([40,65), E), Application ([65,80), D), Analysis ([80,90), C), Evaluation ([90,97), B), and Innovation ([97,100], A), as shown in Table 9.

thumbnail
Table 9. Diagnostic results of students’ knowledge state within each knowledge point’s lecture time

https://doi.org/10.1371/journal.pone.0313772.t009

In the end, the results of Table 8 are combined to analyze the knowledge diagnostic results of these six students within these 37 lecture times of knowledge points. Also comparing each student’s performance on the classroom final test, as shown in Table 10.

thumbnail
Table 10. Students’ comprehensive cognitive state and final examination scores

https://doi.org/10.1371/journal.pone.0313772.t010

Classroom tests are effective in assessing students’ cognitive state. Cognitive state affect students’ learning outcomes. When students are in an environment of positive learning engagement, they are more aware of the knowledge points and their assessment scores are relatively higher. According to Table 9, the results obtained from the cognitive state diagnostic model proposed in this paper correspond to their test scores. The students with higher test scores are basically in a higher level of cognitive state. It indicates that the proposed cognitive state diagnostic model has good reliability.

Discussion

Our proposed CL-PTKT model can be used as an auxiliary teaching module of the smart classroom teaching platform to help teachers improve the teaching process design and improve teaching efficiency. By observing the students’ knowledge tracking results predicted by the model, teachers can have a deeper understanding of students’ current knowledge status, and then intervene in time. Students can know what knowledge they have mastered well enough according to the knowledge tracking results, so as to clarify their learning objectives and improve their academic performance.

The above experimental results show that in the smart classroom scenario, the integration of learning engagement based on classroom network structure can effectively improve the accuracy of the model in predicting students ’knowledge state. Similarly, in the field of online education, students ’learning engagement can be obtained by collecting information such as the frequency and duration of interaction between students and knowledge points, so as to predict students ’knowledge status more accurately. Therefore, this model can also be considered to be applied to online education platforms to track students’ knowledge status, and provide stronger interpretability for teaching recommendation systems according to the knowledge tracking results.

Conclusions and future work

In this paper, a classroom network structure learning engagement and parallel temporal attention LSTM based knowledge tracing model (CL-PTKT). Aiming to address the inability to accurately characterize student knowledge state in complex smart classroom, and the problem that existing KT models only consider historical test behavior data of students on online educational platforms. The CL-PTKT model is effective in characterizing students’ knowledge state even with limited interaction data in smart classroom, as well as shows superior performance even in the face of large datasets online. The experimental results show that the prediction accuracy of the learning engagement degree model(CL-PTKT) of the fusion classroom network structure is better than the benchmarking model(PTKT, L-PTKT). Moreover, the use of parallel temporal attention LSTM helps to enhance the extraction of feature information, which further improves the accuracy of model prediction. Extensive experiments are conducted on four real datasets. It shows that the CL-PTKT model exhibited better diagnostic performance compared to five baseline models (DKT, DKVMN, AKT, QRCDM, CL4KT-FoLiBi) under different conditions.

The CL-PTKT model can effectively address the problem of diagnosing students’ knowledge state in complex smart classrooms, and enhance the design of the education platform system. Characterizing students’ knowledge state by incorporating classroom network structure learning engagement. It can help teachers make timely adjustments to student seating during the teaching process, and appropriate interventions. Improving students’ attitudes towards learning and learning engagement. This promotes effective and enhanced student learning. This research can be applied to online platforms and offline smart classroom platforms. but existing image processing techniques applied to the smart classroom in a large scene have problems such as the difficulty of recognizing students in the back row, therefore, the classroom network structure data in this study can not be fully automated, and it is necessary to add a manual proofreading link, which increases a certain amount of workload.

In future research, the effects of different classroom social relationships and network characteristics on student cognition will be considered, so as to achieve a more accurately characterize cognitive state and improve the predictive performance of the model. In addition, this paper plans to deploy the algorithm into embedded devices for smart classrooms. To provide effective support for teachers to make precise interventions on students during the teaching process. So as to continuously improve the students’ learning effect.

References

  1. 1. Xing Y, Liu P. Research on the changes of user cognitive structure before and after exploratory search. Library Inf Work. 2021:65(22);74.
  2. 2. Uto M, Ueno M. Item response theory for peer assessment. IEEE Trans Learn Technol. 2015;9(2):157–70.
  3. 3. Yang H, Qi T, Li J, et al. A novel quantitative relationship neural network for explainable cognitive diagnosis model. Knowl-Based Syst. 2022;250(C):109156.
  4. 4. Cui J, Chen Z, Zhou A, et al. Fine-grained interaction modeling with multi-relational transformer for knowledge tracing. ACM Trans Inf Syst. 2023,41(4):1–26.
  5. 5. Lu K, Yang H H, Shi Y, et al. Examining the key influencing factors on college students’ higher-order thinking skills in the smart classroom environment. Int J Educ Technol Higher Educ. 2021;18:1–13.
  6. 6. Liu S, Liu S, Liu Z, et al. Automated detection of emotional and cognitive engagement in MOOC discussions to predict learning achievement. Comput Educ. 2022;181:104461.
  7. 7. Maamin M, Maat SM, Iksan ZH. The influence of student engagement on mathematical achievement among secondary school students. Mathematics. 2021:10(1);41.
  8. 8. Delgado K, Origgi J M, Hasanpoor T, et al. Student engagement dataset. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 3628–36.
  9. 9. Yang X, Zhou X, Hu J. Students’ preferences for seating arrangements and their engagement in cooperative learning activities in college English blended learning classrooms in higher education. Higher Educ Res Dev. 2021;41(1):1–16.
  10. 10. Putnik G, Costa E, Alves C, et al. Analysing the correlation between social network analysis measures and performance of students in social network-based engineering education. Int J Technol Design Educ. 2016,26:413–37.
  11. 11. van Rijsewijk, Oldenburg B, Snijders , et al. A description of classroom help networks, individual network position, and their associations with academic achievement. PLoS ONE. 2018:13(12);e0208173.
  12. 12. Gutierrez A. The effects of various classroom seating arrangements on English learners’ academic achievement. University Honors Program Senior Projects 31. 2022. https://neiudc.neiu.edu/uhp-projects/31
  13. 13. Li J, Shi D, Tumnark P, et al. A system for real-time intervention in negative emotional contagion in a smart classroom deployed under edge computing service infrastructure. Peer-to-Peer Netw Appl. 2020;13(5):1706–19.
  14. 14. Corbett AT, Anderson JR. Knowledge tracing: modeling the acquisition of procedural knowledge. User Model User-adapted Interact. 1994;4:253–78.
  15. 15. Wang Y, Heffernan N. Extending knowledge tracing to allow partial credit: using continuous versus binary nodes. In Artificial Intelligence in Education: 16th International Conference, AIED 2013, Memphis, TN, USA, July 9–13, 2013. Proceedings 16. Berlin, Heidelberg: Springer, 2013.
  16. 16. Zhang K, Yao Y. A three learning states Bayesian knowledge tracing model. Knowledge-Based Syst. 2018;148:189–201.
  17. 17. Liu Q, Wu R, Chen E. Fuzzy cognitive diagnosis for modelling examinee performance. ACM Trans Intel Syst Technol. 2018;9(4):1–26.
  18. 18. Piech C, Bassen J, Huang J. Deep knowledge tracing. Adv Neural Inf Proces Syst. 2015;28.
  19. 19. Zhang J, Shi X, King I et al. Dynamic key-value memory networks for knowledge tracing. In: Proceedings of the 26th International Conference on World Wide Web, 2017, pp. 765–74.
  20. 20. Im Y, Choi E, Kook H et al. Forgetting-aware Linear Bias for Attentive Knowledge Tracing. In: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023, pp. 3958–62.
  21. 21. Abdelrahman G, Wang Q, Nunes B. Knowledge tracing: a survey. ACM Comput Surv. 2023;55(11):1–37.
  22. 22. Pandey S, Karypis G. A self-attentive model for knowledge tracing. Educ Data Mining. 2019.
  23. 23. Ghosh A, Heffernan N, Lan AS. Context-aware attentive knowledge tracing. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2020, pp. 2330–9.
  24. 24. Choi Y, Lee Y, Cho J et al. Towards an appropriate query, key, and value computation for knowledge tracing. In: Proceedings of the Seventh ACM Conference on Learning @ Scale, 2020, pp. 341–4.
  25. 25. Mao S, Zhan J, Wang Y. Improving knowledge tracing via considering two types of actual differences from exercises and prior knowledge. IEEE Trans Learn Technol. 2023;16(3):324–38.
  26. 26. Huang T, Hu S, Yang H. Response speed enhanced fine-grained knowledge tracing: a multi-task learning perspective. Expert Syst Appl. 2024;238:122107.
  27. 27. Wang Z, Hou Y, Zeng C. Multiple learning features–enhanced knowledge tracing based on learner–resource response channels. Sustainability. 2023:15(12);9427.
  28. 28. Kim J, Koo S, Lim H. A multi-faceted exploration incorporating question difficulty in knowledge tracing for English proficiency assessment. Electronics. 2023:12(19);4171.
  29. 29. Liu Q, Huang Z, Yin Y. Ekt: exercise-aware knowledge tracing for student performance prediction. IEEE Trans Knowl Data Eng. 2019;33(1):100–15.
  30. 30. Yin Y, Liu Q, Huang Z et al. Quesnet: a unified representation for heterogeneous test questions. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019, pp. 1328–36.
  31. 31. Mao S, Zhan J, Li J et al. Knowledge structure-aware graph-attention networks for knowledge tracing. In: International Conference on Knowledge Science, Engineering and Management. Cham: Springer, 2022; pp. 309–21.
  32. 32. Su Y, Liu Q, Liu Q et al. Exercise-enhanced sequential modeling for student performance prediction. In: Proceedings of the AAAI Conference on Artificial Intelligence, 2018, 32(1).
  33. 33. Yang Y, Shen J, Qu Y et al. GIKT: a graph-based interaction model for knowledge tracing. In: Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2020, Ghent, Belgium, September 14–-18, 2020, Proceedings, Part I. Springer International Publishing, 2021; pp. 299–315.
  34. 34. Shou Z, Tang M, Wen H. Key student nodes mining in the in-class social network based on combined weighted GRA-TOPSIS method. Int J Inf Commun Technol Educ. 2023;19(1):1–19.
  35. 35. Rani S, Kumar M. Influential node detection and ranking with fusion of heterogeneous social media information. IEEE Trans Comput Soc Syst. 2023;10(4):1852–74.
  36. 36. Hu J, Wang X, Zhang Y. Time series prediction method based on variant LSTM recurrent neural network. Neural Proces Lett. 2020;52:1485–500.
  37. 37. Niu Z, Zhong G, Yu H. A review on the attention mechanism of deep learning. Neurocomputing. 2021;452:48–62.
  38. 38. Tolstikhin I O, Houlsby N, Kolesnikov A. MLP-mixer: an all-MLP architecture for vision. Adv Neural Inf Proces Syst, 2021;34:24261–72.
  39. 39. Feng M, Heffernan NT. Informing teachers live about student learning: Reporting in the assistment system. Technol Instr Cognit Learn. 2006;3(1/2):63.
  40. 40. Jiang Y, Foret P, Yak S. NeurIPS 2020 competition: predicting generalization in deep learning. arXiv. 2012. https://arxiv.org/abs/2012.07976.
  41. 41. Bloom BS, Krathwohl DR. Taxonomy of educational objectives: the classification of educational goals. The Classification of Educational Goals. Book 1, Cognitive Domain. Longman 2020.
  42. 42. Furst EJ. Bloom’s taxonomy of educational objectives for the cognitive domain: philosophical and educational issues. Rev Educ Res. 1981;51(4):441–53.
  43. 43. Huitt W. Bloom et al.’s taxonomy of the cognitive domain. Educ Psychol Interact. 2011;22:1-4.
  44. 44. Adams N E. Bloom’s taxonomy of cognitive learning objectives. J Med Libr Assoc. 2015:103(3);152.
  45. 45. Tikhomirova T, Malykh A, Malykh S. Predicting academic achievement with cognitive abilities: cross-sectional study across school education. Behav Sci. 2020:10(10);158.
  46. 46. Sedaghat M, Abedin A, Hejazi E. Motivation, cognitive engagement, and academic achievement. Procedia. 2011;15:2406–10.