Fig 1.
Example of AST extracted from f() function.
Fig 2.
Two functions for summation using integer and float data types.
Fig 3.
Two functions for summation using integer and float data types.
Fig 4.
Baf IR for the Integer_Sum() and Float_Sum() functions.
Fig 5.
Code fragment validate() and its optimized Jimple IR.
Fig 6.
Code fragments with different loop statements and their Jimple IR.
Fig 7.
Jimple block program dependence graph example.
Table 1.
Code representation techniques used in the literature.
Fig 8.
The proposed work’s architecture workflow.
Fig 9.
The process of extracting syntactic and semantic features.
Table 2.
Selected AST non-terminal nodes.
Table 3.
Selected Baf instructions.
Table 4.
Selected Jimple and Block PDG features.
Table 5.
Details of dataset information.
Fig 10.
Performance comparison of fifteen classifiers utilizing various feature combinations on the BigCloneBench dataset.
Fig 11.
Performance of top 5 classifiers with linearly combined features on four BigCloneBench datasets.
Fig 12.
Performance of top 5 classifiers with multiplicative combined features on four BigCloneBench datasets.
Fig 13.
Performance of top 5 classifiers with distance combined features on four BigCloneBench datasets.
Fig 14.
Performance of XgBoost, LightGBM, Random Forest, and Rotation Forest classifiers with different feature sizes using distance combination approach on the BigCloneBench dataset.
Fig 15.
Performance of XgBoost, LightGBM, Random Forest, and Rotation Forest classifiers with different feature sizes using multiplicative combination approach on the BigCloneBench dataset.
Fig 16.
Performance of XgBoost, LightGBM, Random Forest, and Rotation Forest classifiers with different feature sizes using linear combination approach on the BigCloneBench dataset.
Table 6.
Results of detected semantic clones using the proposed technique.
Fig 17.
Recall values when comparing the proposed method against selected methods with respected to different clone types.
Fig 18.
F1-Score values when comparing the proposed method against selected methods with respected to different clone types.