LymphAware

🧬 LymphAware: Domain-Aware Bias Disruption for Reliable Lymphoma Cancer AI Diagnosis

Teerapong Panboonyuen
College of Computing, Khon Kaen University
πŸŽ“ Supported by the Talent Scholarship for Exceptional Ability

Peer-Reviewed & Accepted at IEEE Access (February 2026) πŸŽ‰
DOI: 10.1109/ACCESS.2026.3667575

LymphAware Architecture

πŸš€ Overview

LymphAware is a domain-aware bias disruption framework designed to improve the reliability, robustness, and clinical relevance of AI systems for lymphoma histopathology diagnosis.

Modern medical AI models often achieve high accuracy by exploiting non-biological shortcuts β€” such as stain color, scanner signatures, or slide artifacts β€” instead of true pathological morphology. While effective in-domain, these shortcuts lead to fragile performance under cross-center variability, which is unacceptable for clinical deployment.

LymphAware explicitly addresses this challenge by separating morphology-relevant signals from shortcut-driven acquisition factors, enabling models to β€œthink more like pathologists.” πŸ§ πŸ”¬


✨ Key Innovations

πŸ”Ή Tri-Path Morphology Purification Architecture

πŸ”Ή Artifact-Shift Counterfactual Training

πŸ”Ή Domain-Aware Robustness Without Explicit Labels


πŸ“Š Qualitative Results β€” Shortcut Suppression

Qualitative Results

Models trained without LymphAware rely heavily on stain tone, background artifacts, and acquisition noise. With LymphAware, attention shifts toward diagnostically meaningful lymphoid morphology.


πŸ“ˆ Cross-Center Performance

Performance Tables

Across five independent medical centers:

βœ… Higher AUC βœ… Lower false positive rates βœ… Reduced variance across backbones βœ… Stronger causal consistency metrics


πŸ† Acceptance Evidence

IEEE Acceptance

This work has been peer-reviewed and accepted for publication in IEEE Access, highlighting its contribution to reliable medical AI research.


πŸ“– Official Publication

πŸ† IEEE Access (Early Access, February 2026)

πŸ”— Official IEEE Xplore Link
https://ieeexplore.ieee.org/document/11408775

πŸ“Œ DOI:
10.1109/ACCESS.2026.3667575


🧠 Why LymphAware Matters

Medical AI systems must be:

LymphAware moves the field closer to trustworthy computational pathology by addressing shortcut bias at the representation level, rather than relying solely on dataset curation or domain labels.


πŸš€ Training LymphAware

We provide a clean, reproducible PyTorch pipeline located in the src/ directory for training LymphAware across multi-center lymphoma datasets.

The framework is backbone-agnostic and supports:


πŸ“‚ Project Structure

LymphAware/
β”‚
β”œβ”€β”€ src/
β”‚   β”œβ”€β”€ train_lymphaware.py
β”‚   β”œβ”€β”€ models/
β”‚   β”œβ”€β”€ datasets/
β”‚   β”œβ”€β”€ losses/
β”‚   └── utils/
β”‚
β”œβ”€β”€ data/
β”‚   β”œβ”€β”€ CenterA/
β”‚   β”œβ”€β”€ CenterB/
β”‚   β”œβ”€β”€ CenterC/
β”‚   β”œβ”€β”€ CenterD/
β”‚   └── CenterE/
β”‚
└── outputs/

Each center directory should contain class folders:

CenterA/
    CLL/
    FL/
    MCL/

βš™οΈ Installation

git clone https://github.com/kaopanboonyuen/LymphAware.git
cd LymphAware

conda create -n lymphaware python=3.10
conda activate lymphaware

pip install -r requirements.txt

▢️ Training Example

Train on a specific center (e.g., Center A):

python src/train_lymphaware.py \
    --train_dir data/CenterA/train \
    --val_dir data/CenterA/test \
    --backbone resnet50 \
    --epochs 100 \
    --batch_size 16 \
    --lr 3e-4

πŸ”¬ Training with Vision Transformer (Best Performance)

python src/train_lymphaware.py \
    --train_dir data/CenterA/train \
    --val_dir data/CenterA/test \
    --backbone vit_large_patch16_224 \
    --epochs 100

πŸ’Ύ Outputs

Training artifacts will be saved to:

outputs/
    best_model.pth

The script automatically:

βœ… Tracks validation AUC βœ… Computes False Positive Rate (FPR) βœ… Saves the best checkpoint βœ… Supports GPU acceleration


πŸ§ͺ Multi-Center Reproduction (Centers A–E)

To reproduce the paper results:

  1. Train a model per center
  2. Evaluate cross-domain performance
  3. Average metrics across runs

Example loop:

for CENTER in CenterA CenterB CenterC CenterD CenterE
do
  python src/train_lymphaware.py \
      --train_dir data/${CENTER}/train \
      --val_dir data/${CENTER}/test \
      --backbone vit_large_patch16_224
done

⭐ Research Tips (From the Paper)

For best performance reported in IEEE Access:


🧠 Why This Training Matters

Unlike standard pipelines, LymphAware training:

The model learns cancer morphology β€” not acquisition artifacts.


If you find this work useful, please ⭐ star the repository.


πŸ™ Acknowledgement

This research is supported by:

πŸŽ“ Talent Scholarship for Exceptional Ability 🏫 College of Computing, Khon Kaen University


🌟 Final Note

LymphAware learns the cancer β€” not the confounders.

By enforcing morphology-grounded representations and suppressing shortcut bias, we aim to build AI systems that clinicians can truly trust.


⭐ If you find this project useful, please consider starring the repository!


πŸ“š BibTeX Citation

@article{panboonyuen2026lymphaware,
  author    = {Teerapong Panboonyuen},
  title     = {LymphAware: Domain-Aware Bias Disruption for Reliable Lymphoma Cancer AI Diagnosis},
  journal   = {IEEE Access},
  year      = {2026},
  pages     = {1--1},
  doi       = {10.1109/ACCESS.2026.3667575},
  publisher = {IEEE}
}

If you use this work in your research, please cite the official IEEE version via the DOI above.