Research Article

Explainable Transformer-Based Skin Lesion Classification from Clinical Images

Authors

  • Ahmed Ali Linkon Department of Computer Science, Westcliff University, Irvine, CA 92614, USA
  • Mostafizur Rahman Shakil Department of Engineering Management, Westcliff University, Irvine, CA 92614, USA
  • Shahriar Ahmed School of Business, International American University, 3440 Wilshire Blvd STE 1000, Los Angeles, CA 90010, USA
  • Md Rashel Miah Department of Business Administration, Westcliff University, Irvine, CA 92614, USA
  • Asif Hassan Malik Department of Chemistry, York College, The City University of New York (CUNY), Jamaica, NY 11451, USA

Abstract

Early and reliable detection of skin cancer is critical for reducing disease burden and improving patient outcomes, yet large-scale screening remains constrained by limited specialist availability and heterogeneous image acquisition conditions. This paper presents an efficient transformer-based framework for automated multiclass skin lesion classification, centered on an EfficientViT architecture designed to balance representational capacity and computational efficiency. The proposed approach is evaluated against lightweight transformer and CNN baselines, including DeiT-Tiny, Axial Attention Transformer, Swin Transformer-Tiny, and EfficientNetV2-S, using the PAD-UFES-20 dataset comprising 2,298 smartphone-acquired clinical images across six lesion categories. Experimental results show that EfficientViT achieves superior performance, reaching 99.40% accuracy and 99.78% PR-AUC, indicating robust discrimination under real-world acquisition variability. To enhance transparency and support clinical interpretability, Grad-CAM visual explanations are integrated to highlight lesion-relevant regions driving model predictions. Overall, the results demonstrate that EfficientViT provides an accurate and interpretable solution for practical skin lesion screening using consumer-grade images.

Article information

Journal

Journal of Medical and Health Studies

Volume (Issue)

7 (5)

Pages

46-55

Published

2026-03-08

How to Cite

Ahmed Ali Linkon, Mostafizur Rahman Shakil, Shahriar Ahmed, Md Rashel Miah, & Asif Hassan Malik. (2026). Explainable Transformer-Based Skin Lesion Classification from Clinical Images . Journal of Medical and Health Studies, 7(5), 46-55. https://doi.org/10.32996/jmhs.2026.7.5.7

Downloads

Views

26

Downloads

4

Keywords:

Skin cancer detection, skin lesion classification, deep learning, Vision Transformer, Grad-CAM, PR-AUC, Explainable Artificial Intelligences.