Figures
Abstract
Defects in printed circuit boards (PCBs) are being detected using computer vision-based techniques. Defect-free PCBs are essential for the reliability of consumer electronics. However, deep learning-based methods often struggle with imbalanced defect distributions and limited generalization. To address these challenges, we propose ViT-Mamba, a hybrid framework that combines Vision Transformers with a Mamba-inspired attention mechanism for global feature extraction and precise defect segmentation. We further introduce an artificial defect generation module that systematically creates six types of PCB defects to improve robustness. A multiscale hierarchical refinement strategy is employed to enhance feature representation for accurate segmentation. Experiments on a public PCB defect dataset show that ViT-Mamba outperforms existing methods, achieving a mean Average Precision (mAP) of 99.69%.
Citation: Niaz A, Umraiz M, Soomro S, Choi KN (2025) Vision transformer and Mamba-attention fusion for high-precision PCB defect detection. PLoS One 20(9): e0331175. https://doi.org/10.1371/journal.pone.0331175
Editor: Azim Uddin, Zhejiang University, CHINA
Received: May 14, 2025; Accepted: August 12, 2025; Published: September 25, 2025
Copyright: © 2025 Niaz et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript.
Funding: This work was supported by the Ministry of Science and Information and Communication Technology (ICT) and National IT Industry Promotion Agency (NIPA) through the High Performance Computing (HPC) Support Project. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Defects are unexpected patterns that may emerge across various data modalities, including tabular, visual, textual, and time-series data. They may occur at any time and in any domain, often requiring automated detection for security, access control, or quality assurance. Automated systems are designed to flag such deviations, enabling timely intervention and response [1]. PCB defect detection is vital for maintaining the quality and reliability of consumer electronics, as even minor defects can lead to significant functional failures and economic losses [20]. In 2024, PCBs had an estimated market value of approximately $80 billion. Conventional manual inspection techniques are highly susceptible to environmental factors, leading to decreased defect detection accuracy. Recent advancements in computer vision and deep learning have significantly improved anomaly detection across various domains, including image aesthetics assessment [2], network anomaly detection [3,4], and 3D reconstruction [5]. Transformer-based architectures, such as PIPformers for image inpainting [6] and EAPT for image processing [7], have demonstrated superior feature extraction capabilities, enabling precise identification of complex patterns [8,9]. In medical imaging, models like Ecsu-Net [10] have been effective in segmentation and classification, showcasing the potential of deep learning for intricate defect detection tasks. Similarly, multimodal cascaded CNNs have been successfully applied to waste classification [11], highlighting the versatility of convolutional and transformer-based models for anomaly detection.
Detecting small PCB defects is challenging due to their subtle visual characteristics, often leading to visual strain and misclassification. To overcome these issues, researchers have increasingly adopted machine learning techniques, which have significantly advanced PCB defect detection in recent years. Despite the development of various computer vision-based automated methods, accurately identifying defects remains complex and continues to evolve [12,13]. Wang et al. [14] developed an automated algorithm capable of identifying 2 mm pinhole defects within ten seconds. Yuk et al. [15] combined accelerated robust features with a random forest algorithm to enhance defect detection by focusing on feature density in defect-prone areas. Gaidhane et al. [16] employed similarity metrics to detect PCB surface defects, demonstrating effectiveness in identifying and localizing anomalies even in complex component layouts. While other machine learning-based approaches [17,18] have been proposed, they often lack real-time processing capabilities and rely on manual feature extraction, limiting their adaptability to diverse defect types.
Fully supervised detection methods [19–21] are widely used for detecting small targets, leveraging well-established networks such as YOLO [22,23] and TDD-Net [24]. While these approaches achieve high accuracy, their ability to generalize remains limited, making it difficult for the model to recognize PCB defect types not included in the training data. Additionally, training such models demands large-scale annotated datasets, which involve significant labor costs. The imbalanced distribution of defects in real-world PCB datasets further hampers the performance of fully supervised models.
Despite significant advancements in PCB defect detection, existing methods often struggle with limited generalization, computational inefficiency, and imbalanced defect distributions. Fully supervised approaches require extensive labeled datasets, making them costly and labor-intensive, while unsupervised methods frequently lack precision in anomaly localization. Additionally, CNN-based models, though effective in extracting spatial features, often fail to capture global dependencies, which are crucial for complex defect patterns.
To address these challenges, we propose ViT-Mamba, a hybrid framework that combines ViT-based feature extraction with a Mamba-inspired attention mechanism for precise segmentation and anomaly detection. Unlike traditional CNN-based architectures, our method effectively captures long-range dependencies, enhancing its ability to detect subtle and irregular PCB defects. Moreover, our artificial defect creation module systematically augments training data, improving robustness against real-world variations. The multi-scale hierarchical refinement in our decoder further refines feature representations, ensuring accurate anomaly segmentation while maintaining computational efficiency. The key contributions of this work are as follows:
- Artificial Defect Creation Module: Systematically generates diverse PCB defects, mitigating data imbalance issues, and ensuring diverse and realistic training data.
- ViT-Mamba Hybrid Architecture: Combines Vision Transformers for global feature extraction with a Mamba-inspired decoder to improve segmentation precision.
- Multi-Scale Hierarchical Refinement: Utilizes multi-level convolutional features and Mamba attention gates to refine spatial representations, ensuring precise and computationally efficient anomaly segmentation.
The subsequent sections of this paper are organized as follows: the proposed method is first introduced, followed by the experiments and results. Next, the limitations and potential directions for future research are discussed, and finally, the study is concluded.
Related work
PCB defect detection methods generally fall into two main categories: traditional machine vision techniques and deep learning-based approaches.
Traditional methods typically involve image preprocessing, segmentation, feature extraction, and classification. Moganti et al. [25] surveyed early inspection algorithms that relied on techniques such as image subtraction and template matching. These approaches use handcrafted features—shape, texture, color—to detect issues like missing holes, open circuits, and shorts. For example, Raihan and Ce [26] applied image subtraction using OpenCV, while Raj and Sajeena [27] utilized general-purpose image processing techniques for defect detection. However, such methods are limited by low adaptability to complex defect types and varying conditions, and they require extensive manual feature engineering [28]
Deep learning has significantly improved defect detection by enabling end-to-end learning from data. Convolutional Neural Networks (CNNs), in particular, have shown strong performance. Chen et al. [28] reviewed CNN-based models like AlexNet, VGG, and ResNet, highlighting their high accuracy. Li and Guo [29] implemented Faster R-CNN with a VGG-16 backbone and data augmentation. Hu and Wang [30] enhanced Faster R-CNN using a Feature Pyramid Network (FPN) and generative adversarial region proposals, improving small defect detection.
Single-stage detectors such as YOLO and SSD offer fast inference, making them suitable for real-time inspection. Xin et al. [31] optimized YOLOv4 with mosaic augmentation and Mish activation. Zhang et al. [32] proposed a lightweight detector using dual attention and a path-aggregation FPN, and Wu et al. [33] introduced GSC YOLOv5 with Ghost Convolution and attention modules to enhance speed and accuracy.
Transformer-based models have recently gained attention for their ability to capture long-range dependencies. An et al. [34] proposed LPViT using label smoothing and patch correlation. Chen et al. [35] replaced CNNs with a Swin Transformer in a YOLO-like architecture. Yang et al. [36] integrated SwinV2TDD into YOLOv7, combining local and global features for improved performance.
Despite progress, challenges remain. Deep models often require large datasets, significant computational resources, and lack interpretability. Traditional methods, while lightweight, fail to generalize across diverse defect types. Recent trends involve hybrid models combining CNNs and Transformers [37], with future directions including GAN-based data augmentation, transfer learning for small datasets, and explainable AI solutions [28].
In summary, the evolution from traditional to deep learning approaches has advanced PCB defect detection. CNNs offer a balance between speed and accuracy, while Transformers improve performance in complex scenarios. Building on this, our proposed method combines mathematically generated artificial defects with a ViT-Mamba architecture. By simulating six realistic defect types, we address data imbalance, and through Vision Transformers and Mamba-inspired attention mechanisms, we enable accurate segmentation and localization—providing a robust, efficient solution for diverse PCB defect detection tasks.
Proposed method
The proposed method consists of two main modules: artificial defects creation and ViT-Mamba. The overall pipeline of the proposed PCB anomaly detection framework is illustrated in Fig 1.
A defect-free PCB undergoes artificial anomaly creation to generate synthetic defects for training. The ViT-Mamba architecture processes these images using a ViT encoder for feature extraction, multi-scale convolutional layers for hierarchical representation, and a Mamba-inspired decoder with attention gates.
Artificial defects creation
Defect detection in PCBs is inherently challenging due to the limited availability of labeled data for rare defect types and the imbalance in defect categories. To address these challenges, we propose a structured framework for artificially generating six distinct PCB defect types: Missing Hole, Mouse Bite, Open Circuit, Short, Spur, and Spurious Copper. The artificial creation of these defects is governed by mathematically formulated rules to ensure realism, alignment with real-world manufacturing anomalies, and compatibility with diverse PCB designs. Additionally, defects are induced into the dataset using a randomization strategy to enhance variability and robustness.
1. Missing hole.
Condition: Missing holes occur when predefined vias or mounting holes are not drilled properly during manufacturing. These defects disrupt the electrical connections between PCB layers or impede mechanical assembly.
Mathematical Formulation:
Constraint:
Explanation: The predefined hole positions are derived from PCB design files, such as Gerber drill files, which specify the coordinates of critical vias, mounting holes, or component connections. The defect is simulated by applying a circular mask to remove specific hole regions, ensuring the defect aligns with realistic manufacturing errors. Missing hole defects are introduced randomly into selected hole positions.
2. Mouse bite.
Condition: Mouse bite defects are irregular notches along conductive traces caused by incomplete etching or mechanical stress. These defects can weaken the trace and lead to electrical instability.
Mathematical Formulation:
Here, T is the set of conductive trace points, ε is the max notch distance, and defines the notch shape.
Explanation: Mouse bite defects are applied by introducing notches along the conductive traces of the PCB. The trace points T are randomly selected, and the size, shape, and orientation of the notches are varied to simulate real-world imperfections. These defects mimic failures in the etching process and mechanical handling errors.
3. Open circuit.
Condition: Open circuit defects occur when breaks along conductive traces interrupt the electrical continuity of a circuit, often due to incomplete etching or mechanical damage.
Mathematical Formulation:
Here, T is all trace points, is the broken subset, and
limits break length.
Explanation: Open circuit defects are simulated by removing segments of conductive traces. The location and length of the break are randomized to replicate diverse failure scenarios. These defects disrupt the continuity of the circuit, providing an effective training dataset for detecting trace interruptions.
4. Short.
Condition: Short circuit defects are unintended connections between two conductive elements, often caused by excess solder, copper residues, or manufacturing errors.
Mathematical Formulation:
Here:
: Sets of points for the two conductive elements being shorted.
: threshold for short placement.
Explanation: Short circuits are introduced by artificially connecting two nearby conductive elements. The placement and geometry of the short are randomized to reflect realistic manufacturing or assembly errors. This defect simulates unintended bridging, a critical anomaly in PCB functionality.
5. Spur.
Condition: Spur defects are thin, unintended protrusions extending from conductive traces, often caused by over-etching or material deposition errors.
Mathematical Formulation:
Here:
Explanation: Spur defects are applied as protrusions along the traces, varying in size and shape. The placement of these defects is controlled to ensure alignment with real-world patterns observed in faulty PCBs.
6. Spurious copper.
Condition: Spurious copper defects occur as random copper patches left on the PCB due to incomplete etching or material deposition.
Mathematical Formulation:
Here:
- C: Set of randomly placed copper patch points.
is a random binary mask.
Explanation: Spurious copper defects are simulated by randomly placing irregular copper patches across the PCB. The randomness ensures variation in patch size, shape, and placement, making the dataset more comprehensive.
Random induction of defects.
A randomization strategy introduces defects into defect-free PCB images by generating binary masks that define the size, location, and orientation of each defect. The artificially defective PCB image, denoted as Ia, is generated by combining the original defect-free image I, the defect-only image M, and the binary mask Ma. The generation formula is defined as follows:
where represents the inverse of the binary mask, marking the non-defective regions,
denotes element-wise multiplication. This ensures seamless integration of defects while preserving the intact areas of the PCB.
To further enhance variability, the defects and their corresponding masks undergo a series of random transformations, including rotation, translation, and scaling. Rotation applies a random angle to the defects and masks, translation shifts them randomly along the x- and y-axes, and scaling adjusts their size using a random factor. This diversifies the dataset, improving model robustness for anomaly detection and segmentation. Figs 2 and 3 illustrate the defect creation pipeline and generated synthetic masks.
ViT-Mamba
This section presents ViT-Mamba, a novel architecture for image anomaly detection and segmentation. The proposed method integrates the global feature extraction capabilities of ViTs in the encoder with a Mamba-inspired decoder that utilizes attention mechanisms for precise segmentation and anomaly localization. The framework is designed to address challenges in industrial and medical imaging, where accurate anomaly detection and pixel-level segmentation are crucial.
Encoder: Vision transformer-based feature extraction.
The encoder leverages the Vision Transformer (ViT) to capture global dependencies and contextual information by processing images as non-overlapping patches. The input image is divided into patches of size
, forming a sequence of flattened patches
, where
is the number of patches. The embeddings of these patches are then processed through a multi-layer transformer.
The transformer computes self-attention as follows:
where ,
, and
are the query, key, and value matrices, respectively, and dk is the dimensionality of the keys.
To adapt the ViT for segmentation tasks, the encoded feature maps are passed through lightweight convolutional layers to generate hierarchical feature representations at multiple scales. These feature maps are denoted as , where b1 corresponds to shallow features and b5 contains the most semantic information. The multi-scale features are then forwarded to the decoder.
Decoder: Mamba-inspired attention-gated design.
The decoder employs a Mamba-inspired design that incorporates attention gates and efficient upsampling mechanisms to produce high-resolution segmentation maps. The decoder comprises three key components:
Mamba attention gates.
Mamba Attention gates refine the skip connections from the encoder by suppressing irrelevant features and enhancing the most relevant ones. Given the decoder’s intermediate features and the encoder’s skip connection
, the attention gate outputs a refined feature map
:
where and
are learnable weight matrices, and ψ is a sigmoid activation function. This mechanism improves the focus on anomalous regions. The Mamba Attention Gate mechanism is illustrated in Fig 4.
Encoder () and decoder (
) features are weighted, combined, and passed through ReLU and sigmoid activations to compute attention weights (ψ). These weights refine
, producing the final feature map (
) for improved segmentation.
Efficient upsampling.
The decoder progressively upsamples the feature maps using bilinear interpolation followed by convolutional layers. Each upsampling stage integrates refined skip connections from the encoder to recover fine-grained details.
Multi-scale refinement.
The decoder processes the hierarchical features starting from the deepest map b5. At each stage, the upsampled feature is concatenated with the corresponding skip connection and passed through a decoding block:
where is the upsampled feature,
is the refined skip connection, and
denotes concatenation.
Table 1 summarizes the ViT-Mamba architecture, detailing the encoder, Mamba-based decoder, layer types, output shapes, and key design components for anomaly detection and segmentation.
Loss calculation.
The proposed ViT-Mamba framework optimizes a hybrid loss function designed to address both segmentation accuracy and anomaly detection performance. The total loss, , combines pixel-wise segmentation loss, anomaly detection loss, and focal loss to handle challenges such as class imbalance and hard-to-detect anomalies. The loss is defined as:
where ,
, and
are weighting factors that balance the contribution of each loss component.
The segmentation loss, , ensures accurate pixel-wise predictions for both normal and anomalous regions. This loss penalizes the difference between the predicted segmentation mask
and the ground truth
. Common choices for
include the Dice loss or binary cross-entropy (BCE) loss, defined as:
where N is the total number of pixels, and yi and represent the ground truth and predicted probabilities, respectively, for pixel i.
The anomaly detection loss, , guides the model to distinguish anomalous regions from normal regions. This loss emphasizes the detection of rare and subtle anomalies, penalizing false positives and false negatives. For binary anomaly detection,
can also be formulated using BCE loss.
To address class imbalance and improve the detection of hard-to-classify regions, focal loss, , is incorporated into the total loss. Focal loss dynamically reduces the weight of well-classified examples, allowing the model to focus on harder examples. It is defined as:
where pt is the predicted probability for the true class, is a balancing factor, and
is the focusing parameter. For multi-class segmentation, this loss can be extended by summing over all classes.
The weighting factors ,
, and
allow fine-tuning of the relative importance of segmentation accuracy, anomaly detection performance, and hard example emphasis. These weights are determined empirically based on the specific characteristics of the dataset and the task requirements.
The hybrid loss function ensures that the model effectively learns to segment and localize anomalies while addressing challenges like class imbalance and subtle anomaly patterns, making the ViT-Mamba framework robust and efficient for anomaly detection tasks.
The ViT-Mamba framework integrates a Vision Transformer encoder for global context awareness, enhancing robustness to anomalies of varying sizes. Attention gates in the decoder refine skip connections for precise segmentation, while efficient upsampling and multi-scale refinement ensure computational efficiency. Hierarchical features enable accurate anomaly localization, making ViT-Mamba a powerful and scalable solution for image anomaly detection and segmentation. Algorithm 1 outlines the framework’s detailed steps.
Algorithm 1. ViT-Mamba framework for PCB defect detection.
1: Input: Image , ViT parameters
, Decoder parameters
, Loss weights
2: Output: Predicted segmentation mask
3: 1. Preprocessing & Data Augmentation
4: Introduce artificial defects (e.g., missing hole, spur) and generate binary masks Ma
5: 2. Feature Extraction via ViT
6: Divide I into patches, obtain embeddings Xp, and extract hierarchical features
via ViT
7: 3. Decoding via Mamba-Attention
8: for each feature do
9: Refine skip connection with attention gates:
10:
11: Upsample, concatenate, and pass through decoding blocks
12: end for
13: 4. Segmentation Output
14: Apply convolution to obtain
15: 5. Loss Calculation
16: Compute loss:
17: 6. Optimization
18: Update via backpropagation
return Predicted segmentation mask
Experiments and results
Dataset and implementation details
We utilize the publicly available PCB defect dataset[38] released by the Open Lab on Human-Robot Interaction at Peking University. This dataset comprises 1,386 images containing six types of PCB defects, namely missing hole, mouse bite, open circuit, short, spur, and spurious copper. These defects are introduced into PCB images, making the dataset suitable for tasks such as defect detection, classification, and registration. Fig 5 presents examples from the PCB defect dataset, illustrating one image per defect category with annotated bounding boxes.
During the experimental evaluation, the model was trained for 1000 epochs. The batch size was set to 8, and the learning rate was initialized at 0.0001. To address potential overfitting due to the limited size of the training dataset, data augmentation was employed by applying random rotations within the range of −45 to 45 degrees. The proposed network was implemented using PyTorch, and all experiments were conducted on a workstation equipped with an NVIDIA GeForce RTX 3090Ti GPU.
Comparison with state-of-the-art methods
Table 2 presents a comparative analysis of various supervised and unsupervised methods for PCB defect detection, evaluated across six defect types: Missing Hole, Mouse Bite, Open Circuit, Short, Spur, and Spurious Copper. The results are reported in terms of Average Precision (AP) for each defect type, along with the mean Average Precision (mAP) for overall performance comparison.
Among the SVM-based supervised approaches, Chaudhary et al. achieved the highest mAP of 91.7%, significantly outperforming Zhang et al. and Li et al., whose mAP scores were 56.2% and 40.4%, respectively. The Neural Network-based supervised approaches demonstrated higher accuracy, with Li et al. and Huang et al. both achieving an mAP of 97.7% and 97.5%, respectively. Other deep learning-based methods, such as Lim et al. and Chen et al., reported mAP values exceeding 99.0%, indicating strong performance in supervised learning settings.
In contrast, unsupervised methods varied significantly in performance. While Roth et al. achieved a relatively lower mAP of 73.77%, other methods such as Zeng et al. (93.27%) and Changlin et al. (99.29%) showed strong results. The proposed method, ViT-Mamba, outperformed all existing techniques, achieving the highest mAP of 99.69%, with near-perfect AP values across all defect categories. These results highlight the effectiveness of ViT-Mamba in accurately detecting PCB defects without requiring extensive supervised training. The radar chart in Fig 6 illustrates the AP scores of four top-performing PCB defect detection methods (Changlin et al., ViT Mamba, Zeng et al., and Chaudhary et al.) across six defect categories. Each axis represents a different defect type, and the area covered by each method indicates its overall detection capability. ViT Mamba and Changlin et al. exhibit consistently high performance across all defect types, while Zeng et al. shows slightly lower accuracy in detecting open circuits. The chart highlights variations in method effectiveness, offering an intuitive comparison of strengths and weaknesses in PCB defect detection.
The significant improvement in performance demonstrates the potential of ViT-Mamba as a robust and scalable solution for real-world PCB defect detection. The results indicate that transformer-based architectures, particularly ViT-Mamba, can generalize well to diverse defect types while maintaining superior detection accuracy.
Ablation studies
To assess the contribution of each component in the proposed ViT-Mamba framework, we conducted ablation experiments, as per Table 3, on the PCB defect dataset. The study isolates the impact of three core modules: (1) the Vision Transformer (ViT) encoder, (2) Mamba-inspired attention gates, and (3) the multiscale hierarchical refinement strategy. In each experiment, one component was removed or replaced, while the rest of the model remained unchanged.
The full ViT-Mamba model achieved the highest accuracy. Removing the artificial defect generation module significantly degraded performance, emphasizing its role in addressing data imbalance. Replacing Mamba attention with simple skip connections reduced segmentation precision, and removing multiscale refinement affected the model’s ability to recover fine-grained details. Substituting the ViT encoder with a ResNet backbone also resulted in lower performance, indicating the importance of modeling global dependencies.
These results confirm the effectiveness of each proposed module and validate the overall architecture design of ViT-Mamba for high-precision PCB defect detection.
Discussion
The ViT-Mamba framework, while highly accurate, could face challenges in computational efficiency and real-time deployment, especially on edge devices. Its reliance on ViTs increases processing demands compared to CNNs, and its generalization to real-world PCB images with diverse lighting, resolutions, and textures remains untested. Industrial variations like noise, reflections, and occlusions could impact robustness, requiring further validation. The artificial defect creation module, though useful, may not fully capture all real-world defect patterns, affecting adaptability.
Future research should focus on optimizing computational efficiency through lightweight transformers, model compression (pruning, quantization), and edge AI implementations. Unsupervised and self-supervised learning can enhance domain generalization, while few-shot and zero-shot learning could improve adaptability to novel defects. Multi-modal imaging (thermal, X-ray) may further refine defect detection, and adversarial training can boost robustness against industrial variations. Lastly, explainable AI (XAI) will improve model transparency and trust, supporting broader adoption in automated PCB quality inspection for consumer electronics manufacturing.
Conclusion
This study proposed ViT-Mamba, a hybrid Vision Transformer (ViT) and Mamba-inspired framework for PCB defect detection. By integrating global feature extraction with attention-driven segmentation, the model effectively detects subtle and irregular defects while maintaining computational efficiency. The inclusion of an Artificial Defect Creation Module further enhances robustness by diversifying training data. Experimental results show that ViT-Mamba outperforms existing methods, achieving a mean Average Precision (mAP) of 99.69%, demonstrating its effectiveness for real-world PCB inspection. Future work can focus on optimizing computational efficiency and enhancing generalization to unseen defects. ViT-Mamba contributes to advancing deep learning for industrial anomaly detection, offering a scalable and accurate solution for PCB quality inspection.
References
- 1. Niaz A, Ul Amin S, Soomro S, Zia H, Nam Choi K. Spatially aware fusion in 3D convolutional autoencoders for video anomaly detection. IEEE Access. 2024;12:104770–84.
- 2. Yang B, Zhu C, Li FWB, Wei T, Liang X, Wang Q. IAACS: image aesthetic assessment through color composition and space formation. Virtual Reality & Intelligent Hardware. 2023;5(1):42–56.
- 3. Tian X, Wu Z, Cao J, Chen S, Dong X. ILIDViz: an incremental learning-based visual analysis system for network anomaly detection. Virtual Reality & Intelligent Hardware. 2023;5(6):471–89.
- 4. Wei H, Li Q, Pan J, Chen J, Zhang Y, Qi L, et al. SPSNet: semantic-guided perspective shift network for robust person re-identification in drone imagery. Vis Comput. 2024;41(8):5563–82.
- 5. Zhu X, Yao X, Zhang J, Zhu M, You L, Yang X, et al. TMSDNet: Transformer with multi-scale dense network for single and multi-view 3D reconstruction. Comput Animation Virtual. 2023;35(1).
- 6. Lee J, Kang H. PIPformers: Patch based inpainting with vision transformers for generalize paintings. Computer Animation Virtual. 2024;35(3).
- 7. Lin X, Sun S, Huang W, Sheng B, Li P, Feng DD. EAPT: Efficient Attention Pyramid Transformer for Image Processing. IEEE Trans Multimedia. 2023;25:50–61.
- 8. Wang J, Du H, Zeng Y. PDFormer: Efficient Vision Transformer for Photovoltaic Defect Detection. IEEE Trans Consumer Electron. 2025;71(2):6602–11.
- 9. Gao H, Jiang W, Ran Q, Wang Y. Vision-Language Interaction via Contrastive Learning for Surface Anomaly Detection in Consumer Electronics Manufacturing. IEEE Trans Consumer Electron. 2024;70(3):6119–30.
- 10. Nazir A, Cheema MN, Sheng B, Li P, Li H, Xue G, et al. ECSU-Net: An Embedded Clustering Sliced U-Net Coupled With Fusing Strategy for Efficient Intervertebral Disc Segmentation and Classification. IEEE Trans Image Process. 2022;31:880–93. pmid:34951844
- 11. Li J, Chen J, Sheng B, Li P, Yang P, Feng DD, et al. Automatic Detection and Classification System of Domestic Waste via Multimodel Cascaded Convolutional Neural Network. IEEE Trans Ind Inf. 2022;18(1):163–73.
- 12. Tong K, Wu Y, Zhou F. Recent advances in small object detection based on deep learning: A review. Image and Vision Computing. 2020;97:103910.
- 13.
Fupei Wu, Xianmin Zhang, Yongcong Kuan, Zhenzhen He. An AOI algorithm for PCB based on feature extraction. In: 2008 7th World Congress on Intelligent Control and Automation. 2008. https://doi.org/10.1109/wcica.2008.4592931
- 14. Wang W-C, Chen S-L, Chen L-B, Chang W-J. A machine vision based automatic optical inspection system for measuring drilling quality of printed circuit boards. IEEE Access. 2017;5:10817–33.
- 15. Yuk EH, Park SH, Park C-S, Baek J-G. Feature-Learning-Based Printed Circuit Board Inspection via Speeded-Up Robust Features and Random Forest. Appl Sci. 2018;8(6):932.
- 16. Gaidhane VH, Hote YV, Singh V. An efficient similarity measure approach for PCB surface defect detection. Pattern Anal Applic. 2017;21(1):277–89.
- 17. Tsai D, Hsieh Y. Machine Vision-Based Positioning and Inspection Using Expectation–Maximization Technique. IEEE Trans Instrum Meas. 2017;66(11):2858–68.
- 18. Liu Z, Qu B. Machine vision based online detection of PCB defect. Microprocessors Microsystems. 2021;82:103807.
- 19.
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY. Ssd: Single shot multibox detector. In: Computer Vision–ECCV 2016 : 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I. 2016;21–37.
- 20.
Zhang Q, Liu H. Multi-scale defect detection of printed circuit board based on feature pyramid network. In: 2021 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA). 2021;911–4. https://doi.org/10.1109/icaica52286.2021.9498174
- 21. Zhong Z, Ma Z. A Novel Defect Detection Algorithm for Flexible Integrated Circuit Package Substrates. IEEE Trans Ind Electron. 2022;69(2):2117–26.
- 22. Zhang Y, Zhang W, Yu J, He L, Chen J, He Y. Complete and accurate holly fruits counting using YOLOX object detection. Comput Electronics Agriculture. 2022;198:107062.
- 23. Tong K, Wu Y. I-YOLO: a novel single-stage framework for small object detection. Vis Comput. 2024;40(12):8927–44.
- 24. Ding R, Dai L, Li G, Liu H. TDD-net: a tiny defect detection network for printed circuit boards. CAAI Trans on Intel Tech. 2019;4(2):110–6.
- 25. Moganti M, Ercal F, Dagli CH, Tsunekawa S. Automatic PCB Inspection Algorithms: A Survey. Computer Vision and Image Understanding. 1996;63(2):287–313.
- 26.
Raihan F, Ce W. PCB defect detection USING OPENCV with image subtraction method. In: 2017 International Conference on Information Management and Technology (ICIMTech). 2017;204–9. https://doi.org/10.1109/icimtech.2017.8273538
- 27.
Raj A, Sajeena A. Defects detection in PCB using image processing for industrial applications. In: 2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT). 2018;1077–9.
- 28. Ling Q, Isa NAM. Printed Circuit Board Defect Detection Methods Based on Image Processing, Machine Learning and Deep Learning: A Survey. IEEE Access. 2023;11:15921–44.
- 29.
Li YT, Guo JI. A VGG-16 based faster RCNN model for PCB error inspection in industrial AOI applications. In: 2018 IEEE International Conference on Consumer Electronics-Taiwan (ICCE-TW). 2018;1–2.
- 30. Hu B, Wang J. Detection of PCB Surface Defects With Improved Faster-RCNN and Feature Pyramid Network. IEEE Access. 2020;8:108335–45.
- 31. Xin H, Chen Z, Wang B. PCB Electronic Component Defect Detection Method based on Improved YOLOv4 Algorithm. J Phys: Conf Ser. 2021;1827(1):012167.
- 32. Zhang Y, Xie F, Huang L, Shi J, Yang J, Li Z. A lightweight one-stage defect detection network for small object based on dual attention mechanism and PAFPN. Front Physics. 2021;9:708097.
- 33. Wu L, Zhang L, Zhou Q. Printed Circuit Board Quality Detection Method Integrating Lightweight Network and Dual Attention Mechanism. IEEE Access. 2022;10:87617–29.
- 34. An K, Zhang Y. LPViT: A Transformer Based Model for PCB Image Classification and Defect Detection. IEEE Access. 2022;10:42542–53.
- 35. Chen W, Huang Z, Mu Q, Sun Y. PCB Defect Detection Method Based on Transformer-YOLO. IEEE Access. 2022;10:129480–9.
- 36. Yang Y, Kang H. An Enhanced Detection Method of PCB Defect Based on Improved YOLOv7. Electronics. 2023;12(9):2120.
- 37. Zou Z, Chen K, Shi Z, Guo Y, Ye J. Object Detection in 20 Years: A Survey. Proc IEEE. 2023;111(3):257–76.
- 38. Huang W, Wei P. A PCB dataset for defects detection and classification. arXiv preprint. 2019.
- 39. Zhang C, Shi W, Li X, Zhang H, Liu H. Improved bare PCB defect detection approach based on deep feature learning. J Eng. 2018;2018(16):1415–20.
- 40. Li Y, Li S. Defect detection of bare printed circuit boards based on gradient direction information entropy and uniform local binary patterns. CW. 2017;43(4):145–51.
- 41.
Chaudhary V, Dave IR, Upla KP. Automatic visual inspection of printed circuit board for defect detection and classification. In: 2017 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET). 2017;732–7. https://doi.org/10.1109/wispnet.2017.8299858
- 42. Li M, Yao N, Liu S, Li S, Zhao Y, Kong SG. Multisensor Image Fusion for Automated Detection of Defects in Printed Circuit Boards. IEEE Sensors J. 2021;21(20):23390–9.
- 43. Huang W, Wei P, Zhang M, Liu H. HRIPCB: a challenging dataset for PCB defects detection and classification. J Eng. 2020;2020(13):303–9.
- 44. Lim J, Lim J, Baskaran VM, Wang X. A deep context learning based PCB defect detection model with anomalous trend alarming system. Result Eng. 2023;17:100968.
- 45. Hu X, Kong D, Liu X, Zhang J, Zhang D. Printed circuit board (PCB) surface micro defect detection model based on residual network with novel attention mechanism. Comput Materi Cont. 2024;78(1).
- 46. Chen X, Wu Y, He X, Ming W. A comprehensive review of deep learning-based PCB defect detection. IEEE Access. 2023;11:139017–38.
- 47. Zeng N, Wu P, Wang Z, Li H, Liu W, Liu X. A Small-Sized Object Detection Oriented Multi-Scale Feature Fusion Approach With Application to Defect Detection. IEEE Trans Instrum Meas. 2022;71:1–14.
- 48.
Roth K, Pemula L, Zepeda J, Schölkopf B, Brox T, Gehler P. Towards total recall in industrial anomaly detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition; 2022. p. 14318–28.
- 49. Chen C, Wu Q, Zhang J, Xia H, Lin P, Wang Y. Y. U 2 D 2 PCB: uncertainty-aware unsupervised defect detection on PCB images using reconstructive and discriminative models. IEEE Transactions on Instrumentation and Measurement. 2024.