Crowd anomaly detection in surveillance videos using hybrid models of autoencoder, GAN, and YOLOv8

Vishvaparathy, S.; Akmal Jahan, M. A. C.

dc.contributor.author	Vishvaparathy, S.
dc.contributor.author	Akmal Jahan, M. A. C.
dc.date.accessioned	2026-04-22T12:39:12Z
dc.date.available	2026-04-22T12:39:12Z
dc.date.issued	2025-10-30
dc.identifier.citation	Conference Proceedings of 14th Annual Science Research Session – 2025 on “NEXT-GEN SOLUTIONS: Bridging Science and Sustainability” on October 30th 2025. Faculty of Applied Sciences, South Eastern University of Sri Lanka, Sammanthurai.. pp. 30.	en_US
dc.identifier.isbn	978-955-627-146-1
dc.identifier.uri	http://ir.lib.seu.ac.lk/handle/123456789/7893
dc.description.abstract	Crowd anomaly detection is an essential aspect in computer vision applications, such as public security monitoring and surveillance in crowded scenes. It is generally not feasible to monitor manually, and consequently, the demand for automatic real-time systems has emerged. Individual and hybrid deep learning approaches, such as Convolutional Autoencoders (AE), Generative Adversarial Networks (GAN), as well as YOLOv8 are presently explored. Although YOLOv8 is not the latest iteration in the series of YOLOs, it remains worth due to its excellent balance between accuracy, speed, and the ability to support several tasks of computer vision at once (detection, segmentation, classification, etc.). Its simple-to-use ecosystem with full documentation and a small API enables it to be used in many applications. Even if newer releases may include improvements in specific areas like parameters or accuracy, YOLOv8 provides a solid, multi-tasking, well-supported solution that is easier to use for most scenarios. On the other hand, AEs are good at recovering the motion patterns, and GANs can achieve anomaly scoring, while YOLOv8 has a more accurate object-level detection. However, none of them have satisfactory performance in complex events. To tackle this issue, a hybrid framework comprising the three models was proposed in this work using decision-level fusion to raise accuracy and reduce false positives. Experimental results on UCSD Ped2 and UMN datasets demonstrate that the proposed hybrid model performed better than single models in terms of precision, recall, F1- score, and AUC. The proposed approach provides a scalable, robust, and real-time solution for a cognitive surveillance system.	en_US
dc.language.iso	en_US	en_US
dc.publisher	Faculty of Applied Sciences, South Eastern University of Sri Lanka, Sammanthurai.	en_US
dc.subject	Crowd Anomaly Detection	en_US
dc.subject	Autoencoder	en_US
dc.subject	Generative Adversarial Network	en_US
dc.subject	YOLOv8	en_US
dc.subject	Hybrid Deep Learning	en_US
dc.subject	Surveillance	en_US
dc.subject	Real-Time Detection	en_US
dc.title	Crowd anomaly detection in surveillance videos using hybrid models of autoencoder, GAN, and YOLOv8	en_US
dc.type	Article	en_US