Bài báo quốc tế
SiCLIP: An explainable multimodal framework for silicosis diagnosis
Lê Minh Duy
Silicosis is a serious occupational lung disease caused by exposure to crystalline silica dust and remains difficult to detect early in at-risk worker populations. In this paper, we introduce the Silicosis Diagnosis Dataset (SDD), which comprises chest X-ray images and structured patient-profile information, including harmful habits and clinical symptoms. To exploit this multimodal dataset, we propose SiCLIP, a multimodal retrieval framework based on CLIP-ViT for silicosis screening and binary classification on SDD. SiCLIP learns a shared embedding space for chest X-ray images and patient profiles and performs retrieval-based aggregation for prediction. On the internally evaluated SDD benchmark, SiCLIP achieves higher accuracy and F1-score than several strong image-only deep learning baselines and the compared multimodal VLM baseline. In addition, SiCLIP provides case-based interpretability by grounding predictions in retrieved similar cases, complemented by supportive saliency visualizations. These results suggest that multimodal retrieval is a promising approach for silicosis screening support in occupationally exposed populations, while external validation remains necessary before broader clinical deployment.
Xuất bản trên:
SiCLIP: An explainable multimodal framework for silicosis diagnosis
Ngày đăng:
2026
Nhà xuất bản:
Artificial Intelligence in Medicine
Địa điểm:
Từ khoá:
Silicosis detection; Vision-language models; Explainable framework
Bài báo liên quan
MIST: A Multilingual Dataset and Benchmark for Fine-Grained Audio Inpainting Tampering Localization
Vũ Sơn TùngNF-DCL: Enhancing video anomaly detection with synthetic normal features and Debiased Contrastive Learning
Nguyễn Thu NgaA Workflow-Oriented Architecture Integrating Large Language Models for Automated Multi-Platform Content Management
Nguyễn Tất Thắng