Article contents
Swin Transformer–Driven Cervical Cell Classification with Explainable AI and Web-Based Screening
Abstract
Accurate interpretation of cervical cytology images is essential for effective cervical cancer screening, yet manual assessment is time-consuming and subject to observer variability. This paper presents a transformer-based deep learning framework for automated cervical cell classification using Pap smear images. We conduct a systematic evaluation of modern attention-driven architectures, including MaxViT, Swin Transformer, EfficientFormer, and HorNet, under a unified preprocessing and training pipeline designed to handle staining variability and class imbalance. To enhance model transparency and clinical trust, explainable AI is integrated via Grad-CAM, enabling visual localization of cytomorphological regions that drive model decisions. Experiments on the Herlev and SIPaKMeD datasets demonstrate that the proposed Swin Transformer achieves superior and consistent performance, reaching 99.27% accuracy on Herlev and 98.82% accuracy on SIPaKMeD, with high MCC and PR-AUC values. In addition, a lightweight web-based application is developed to support dataset selection, real-time inference, confidence reporting, and visual explanation. The results confirm that hierarchical transformer architectures can deliver accurate, interpretable, and deployable solutions for computer-aided cervical cancer screening.
Article information
Journal
Journal of Medical and Health Studies
Volume (Issue)
7 (5)
Pages
25-35
Published
Copyright
Copyright (c) 2026 https://creativecommons.org/licenses/by/4.0/
Open access

This work is licensed under a Creative Commons Attribution 4.0 International License.

Aims & scope
Call for Papers
Article Processing Charges
Publications Ethics
Google Scholar Citations
Recruitment