This work critically evaluates the faithfulness and localization reliability of Grad-CAM for lung cancer CT classification across CNN and Vision Transformer architectures. Results reveal strong model-dependent variability and reduced reliability for transformer-based models, highlighting key limitations of saliency-based XAI in medical imaging.

