Objective: In order to better adapt to clinical applications, this paper proposes a cross-validation decision-making fusion method of Vision Transformer and DenseNet161.
Methods: The dataset is the most critical acetic acid image for clinical diagnosis, and the SR areas are processed by a specific method. Then, the Vision Transformer and DenseNet161 models are trained by the fivefold cross-validation method, and the fivefold prediction results corresponding to the two models are fused by different weights. Finally, the five fused results are averaged to obtain the category with the highest probability.
Results: The results show that the fusion method in this paper reaches an accuracy rate of 68% for the four classifications of cervical lesions.
Conclusions: It is more suitable for clinical environments, effectively reducing the missed detection rate and ensuring the life and health of patients.
Copyright © 2022 Ping Li et al.