Bài báo quốc tế
Kho tri thức
/
Bài báo quốc tế
/
NF-DCL: Enhancing video anomaly detection with synthetic normal features and Debiased Contrastive Learning
NF-DCL: Enhancing video anomaly detection with synthetic normal features and Debiased Contrastive Learning
Nguyễn Thu Nga
Video anomaly detection (VAD) is a challenging task, as it involves detecting anomalies at the frame level in a video stream with video-level supervision. The majority of existing weakly-supervised VAD methodologies, which employ Multiple-Instance Learning (MIL) along with a top-
ranking loss, often exhibit high false alarm rates for normal frames within abnormal videos. This pitfall comes from two main factors: the limited discriminatory capacity of the top-
ranking loss and the inefficiency of MIL in aggregating information from normal video segments. In contrast, semi-supervised VAD approaches that rely solely on normal video data during training often enhance the prediction of normal frames within abnormal videos, but at the expense of accuracy in detecting abnormal frames. To enhance the advantages of both approaches, we propose a two-stage method, namely Normalizing Flow-Debiased Contrastive Learning (NF-DCL). In the first stage, a Normalizing Flow model is employed to learn the distribution of normal feature representations and generate diverse synthetic normal samples. In the second stage, these synthesized features guide a Debiased Contrastive Loss integrated with MIL and top-
ranking loss, enhancing feature discrimination while alleviating label ambiguity and biased learning. Extensive experiments on UCF-Crime, ShanghaiTech, and XD-Violence demonstrate that NF-DCL consistently improves performance and significantly reduces false alarms, outperforming existing state-of-the-art methods.
Xuất bản trên:
NF-DCL: Enhancing video anomaly detection with synthetic normal features and Debiased Contrastive Learning
Ngày đăng:
2026
Nhà xuất bản:
Computer Vision and Image Understanding
Địa điểm:
Từ khoá:
Video anomaly detection; Synthetic normal features; Contrastive learning; Multiple-instance learning; Weakly-supervised learning; Semi-supervised learning
Bài báo liên quan
MIST: A Multilingual Dataset and Benchmark for Fine-Grained Audio Inpainting Tampering Localization
Vũ Sơn TùngA Workflow-Oriented Architecture Integrating Large Language Models for Automated Multi-Platform Content Management
Nguyễn Tất ThắngDevelopment of an Offline RAG Chatbot for Answering Food Hygiene and Safety Questions Based on Vietnamese Legal Frameworks
Nguyễn Tất Thắng