ANALYSIS OF MODERN ALGORITHMS FOR DETECTING AND RECOGNIZING OBJECTS FROM A VIDEO STREAM FOR REAL-TIME PARKING MANAGEMENT SYSTEMS
DOI:
https://doi.org/10.31891/2307-5732-2023-321-3-17-23Keywords:
neural network, architecture, YOLO, CNN, R-CNN, Mask R-CNNAbstract
One of the areas of artificial intelligence is computer vision, which uses deep learning to detect, recognize, and classify objects in images and videos. To make such systems more efficient, methods based on neural networks are often used. The development of deep learning technologies has made it possible to create more accurate and complex computer vision models. Deep learning methods used to recognize objects in video can include a region proposal as part of the system or use non-regional methods based on detector proposals. The article discusses modern, best-known algorithms for object recognition in video. The features of different architectural solutions of neural networks are described. A study of publications on the problems of video data analysis has indicated the priority of using algorithms based on the convolutional neural network architecture. The paper pays more attention to such architectural solutions as YOLO and Mask R-CNN. Performance, processing speed, and accuracy are compared. The results of the study show that YOLO is one of the most advanced real-time object detection systems that processes images at a speed of 45 to 150 frames per second and has an mAP of 63.4% on the MS-COCO test set, and, for example, Mini-YOLOv3 reaches an mAP of 52.1% at 67 frames per second. However, if we compare different versions and modifications of YOLO with other systems, we can say with certainty that YOLO makes more localization errors. Mask R-CNN is an extension of Faster R-CNN, where the object mask prediction and bounding box recognition are performed in parallel. Thus, it was found that Mask R-CNN is best suited for a parking management system that can track free parking spaces from a camera video stream. This neural network has a number of advantages compared to R-CNN, Fast R-CNN, and YOLO. The main advantages of Mask R-CNN are performance and accuracy.
Downloads
Published
Issue
Section
License
Copyright (c) 2023 ДМИТРО МАРЧУК (Автор)

This work is licensed under a Creative Commons Attribution 4.0 International License.