Crowd anomaly detection in surveillance videos using hybrid models of autoencoder, GAN, and YOLOv8

Vishvaparathy, S.; Akmal Jahan, M. A. C.

Please use this identifier to cite or link to this item: http://ir.lib.seu.ac.lk/handle/123456789/7893

Title:	Crowd anomaly detection in surveillance videos using hybrid models of autoencoder, GAN, and YOLOv8
Authors:	Vishvaparathy, S. Akmal Jahan, M. A. C.
Keywords:	Crowd Anomaly Detection Autoencoder Generative Adversarial Network YOLOv8 Hybrid Deep Learning Surveillance Real-Time Detection
Issue Date:	30-Oct-2025
Publisher:	Faculty of Applied Sciences, South Eastern University of Sri Lanka, Sammanthurai.
Citation:	Conference Proceedings of 14th Annual Science Research Session – 2025 on “NEXT-GEN SOLUTIONS: Bridging Science and Sustainability” on October 30th 2025. Faculty of Applied Sciences, South Eastern University of Sri Lanka, Sammanthurai.. pp. 30.
Abstract:	Crowd anomaly detection is an essential aspect in computer vision applications, such as public security monitoring and surveillance in crowded scenes. It is generally not feasible to monitor manually, and consequently, the demand for automatic real-time systems has emerged. Individual and hybrid deep learning approaches, such as Convolutional Autoencoders (AE), Generative Adversarial Networks (GAN), as well as YOLOv8 are presently explored. Although YOLOv8 is not the latest iteration in the series of YOLOs, it remains worth due to its excellent balance between accuracy, speed, and the ability to support several tasks of computer vision at once (detection, segmentation, classification, etc.). Its simple-to-use ecosystem with full documentation and a small API enables it to be used in many applications. Even if newer releases may include improvements in specific areas like parameters or accuracy, YOLOv8 provides a solid, multi-tasking, well-supported solution that is easier to use for most scenarios. On the other hand, AEs are good at recovering the motion patterns, and GANs can achieve anomaly scoring, while YOLOv8 has a more accurate object-level detection. However, none of them have satisfactory performance in complex events. To tackle this issue, a hybrid framework comprising the three models was proposed in this work using decision-level fusion to raise accuracy and reduce false positives. Experimental results on UCSD Ped2 and UMN datasets demonstrate that the proposed hybrid model performed better than single models in terms of precision, recall, F1- score, and AUC. The proposed approach provides a scalable, robust, and real-time solution for a cognitive surveillance system.
URI:	http://ir.lib.seu.ac.lk/handle/123456789/7893
ISBN:	978-955-627-146-1
Appears in Collections:	14th Annual Science Research Session

Files in This Item:

File	Description	Size	Format
ASRS2025-Original-53.pdf		23.86 kB	Adobe PDF	View/Open

Show full item record