Machine learning based approach to exam cheating detection

doi:10.1371/journal.pone.0254340

Fig 1.

Both sequences of scores consist of the same values, but in different order.

The steady progression of grades of Student 2 makes a score of 95 on the final exam seem plausible. On the other hand, the pattern of grades for Student 1 makes a grade of 95 on the final exam highly unexpected.

More »

Expand

Fig 2.

1D kernel density estimate of a Gaussian distribution using various bandwidth values.

More »

Expand

Table 1.

The neural network architecture for the proposed algorithm.

More »

Expand

Fig 3.

Representative samples of the simulated datasets used in our experiments.

The datasets capture different scenarios for the distribution of the grades. (a) A representative sample of Dataset 1 grades. The dataset consists of 91% normal and 9% anomalous grades. The normal grades consist of three quarters homogeneous grades and one quarter increasing grades. The anomalous grades rise sharply—by 35 points—during the final exam. (b) A representative sample of Dataset 2 grades. The dataset is similar to Dataset 1. However, the anomalous grades rise less sharply—by 20 points—during the final exam. As a result, the outliers are harder to identify. (c) A representative sample of Dataset 3 grades. The dataset is similar to Dataset 2. However, around 10% of the normal grades are increasing at an incremental pace so that the difference between the average prior and final exam scores are same as in the anomalous instances. As a result, it is even more challenging to identify the outlier scores. (d) A representative sample of Dataset 4 grades. The dataset is designed to simulate a scenario when the final exam is easy and everyone receives a relatively high grade. The normal final exam scores 10 points higher than the average on prior assessments. The anomalous final exam scores 25 points higher than the average preceding scores.

More »

Expand

Table 2.

The mean and standard deviation TPR for the anomaly detection algorithms.

The results represent experiments on four datasets based on 20 simulated experiments. The proposed method (NewAlgo) produces the best overall results.

More »

Expand

Table 3.

The mean and standard deviation of FPR for the anomaly detection algorithms.

The results represent experiments on four datasets based on 20 simulated experiments.

More »

Expand

Table 4.

The mean TPR for the anomaly detection algorithms.

The results represent experiments on four datasets based on 20 simulated experiments and the class size of 220. The proposed method (NewAlgo) produces the best overall results.

More »

Expand

Table 5.

The scores of the (true) cheating cases and the outlier cases determined by the detection methods in DS2 dataset.

More »

Expand

Table 6.

The true positive and false positive rates of the anomaly detection algorithms.

The results represent experiments on a single real-life dataset. The proposed method (NewAlgo) produces the best overall results.

More »

Expand