Cổng tri thức PTIT

Trang chủ

Giới thiệu

AI Cộng đồng

Kho tri thức

Tin tức

Liên hệ

Bài báo quốc tế

Kho tri thức

Bài báo quốc tế

Enhancing Medical Image Classification with Noise-Injected Multi-Head Attention

Nguyễn Năng Hùng Vân

Vision Transformers (ViTs) have shown strong potential in medical image classification due to their ability to model long-range dependencies. Despite this advantage, the deterministic nature of the standard Multi-Head Attention (MHA) mechanism can lead to overfit ting and reduced robustness, especially when working with noisy and heterogeneous medical datasets. To address this issue, we introduce a modified attention mechanism called Noise-Injected Multi-Head Atten tion (NIMHA), which integrates controlled Gaussian noise into the key and value projections of MHA. This stochastic regularization approach enhances feature learning and model generalization while maintaining computational efficiency and compatibility with existing ViT architec tures. We evaluate NIMHA on two public datasets: Brain Tumor MRI and CT Kidney. Experimental results show that ViTs with NIMHA out perform baseline ViTs in classification accuracy, particularly on the more complex Brain Tumor MRI dataset. In addition, models with NIMHA exhibit more stable training behavior and faster convergence. Attention map analysis further reveals that the proposed method promotes a more distributed focus, improving the model’s ability to generalize to diverse clinical data. These findings suggest that incorporating noise-based reg ularization into attention mechanisms is a practical strategy to enhance the robustness and reliability of ViT-based models for medical imaging tasks.

Xuất bản trên:

Enhancing Medical Image Classification with Noise-Injected Multi-Head Attention

Ngày đăng:

2025

DOI:

Nhà xuất bản:

Địa điểm:

Từ khoá:

Medical Image Classification, Multi-Head Attention, Noise Injection, Regularization Techniques, Vision Transformers.

Bài báo liên quan

The inheritance and development of journalistic illustration from the Indochina School Of Fine Arts (1945–1975) to online journalism in VietNam: Transformation and influence

Hà Thị Hồng Ngân

Optimizing Mixed-Resolution ADC Allocation Under Bit-Budget Constraints in LDPC-Coded Massive MIMO

Đặng Ngọc Hùng

GT-FID: A Graph-Temporal Fusion Network for Host-Based Intrusion Detection from System Call Sequences

Đỗ Phúc Hảo

Resilient Edge Computing: An Elixir-BEAM Architecture for IoT Gas Leakage Detection

TRIET NGUYEN

Mind the Gap: On the Practical Utility of SHAP for Deep Learning-Based Intrusion Detection

Đỗ Phúc Hảo

Optimizing Mixed-Resolution ADC Allocation Under Bit-Budget Constraints in LDPC-Coded Massive MIMO

Đặng Ngọc Hùng

Convex Hull-Based Coreset Selection for Identifying Differentially Expressed Genes in Pediatric Sepsis

Nguyễn Kiều Linh