THE ROLE OF COMPUTER VISION IN THE MODERN WORLD: ACHIEVEMENTS, CHALLENGES AND PROSPECTS

Authors

DOI:

https://doi.org/10.32782/IT/2024-2-10

Keywords:

artificial intelligence, computer vision, neural networks, artificial neural networks, convolutional neural networks, recurrent neural networks, pattern recognition

Abstract

This scientific article examines the essence of computer vision as a modern technology, its achievements, challenges and prospects. Computer vision plays an important role in various fields such as medicine, automotive, robotics and many others. The article examines the basic principles of computer vision, its application in various fields, comparative analysis of neural networks. The purpose of the study is to choose the topology of convolutional neural networks by conducting a comparative analysis of models of convolutional neural networks and their characteristics, which significantly affect the implementation of typical computer vision tasks. The methodology consists in a comparative analysis of the main classification methods based on wellknown convolutional neural networks, which are focused on the processing and recognition of image patterns, segmentation of text data, as well as sound streams. An analysis of existing research and publications on computer vision technologies, empirical research and evaluation of the effectiveness and accuracy of various approaches to computer vision has been carried out. The scientific novelty of the results obtained in the work consists in determining the main challenges facing the development of computer vision, in particular, the problem of processing large volumes of data, ensuring the accuracy and speed of algorithms, conducting a comprehensive analysis of modern achievements in the field of computer vision, taking into account the latest scientific research and technological development. Conclusions. Neural networks such as Artificial Neural Networks (ANN), Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN) have their own advantages and disadvantages that may depend on specific tasks and data. Given this, it is advisable to use them depending on the problems or tasks that need to be solved, namely: ANNs (artificial neural networks) are useful for solving complex problems; CNNs (Convolutional Neural Networks) are the most effective way to solve computer vision problems; RNNs (recurrent neural networks) are capable of natural language processing, useful for time series forecasting.

References

Marta Beltrán, Miguel Calvo. A privacy threat model for identity verification based on facial recognition. 2023. URL: https://doi.org/10.1016/j.cose.2023.103324 (дата звернення: 18.04.2024).

Abhishek Gupta, Alagan Anpalagan, Ling Guan, Ahmed Shaharyar Khwaja. Deep learning for object detection and scene perception in self-driving cars: Survey, challenges, and open issues. 2021. https://doi.org/10.1016/j.array.2021.100057 (дата звернення: 18.04.2024).

Rajasshrie Pillai, Brijesh Sivathanu, Yogesh K. Dwivedi. Shopping intention at AI-powered automated retail stores (AIPA). Journal of Retailing and Consumer Services. 2020. URL: https://doi.org/10.1016/j.jretconser.2020.102207 (дата звернення: 18.04.2024).

Arshi Parvaiz, Muhammad Anwaar Khalid, Rukhsana Zafar, Huma Ameer, Muhammad Ali, Muhammad Moazam Fraz. Vision Transformers in medical computer vision–A contemplative retrospection. Engineering Applications of Artificial Intelligence. 2023. URL: https://doi.org/10.1016/j.engappai.2023.106126 (дата звернення: 19.04.2024).

Jittima Varagula, Panit-a-nong Kulproma, Toshio ITO. Object Detection Method in Traffic by On-Board Computer Vision with Time Delay Neural Network. Procedia Computer Science. 2017. URL: https://doi.org/10.1016/j.procs.2017.08.185 (дата звернення: 19.04.2024).

Phat Thai, Sameer Alam, Nimrod Lilith, Binh T. Nguyen. A computer vision framework using Convolutional Neural Networks for airport-airside surveillance. Transportation Research Part C: Emerging Technologies. 2022. URL: https://doi.org/10.1016/j.trc.2022.103590 (дата звернення: 19.04.2024).

V.G. Dhanya, A. Subeesh, N.L. Kushwaha, Dinesh Kumar Vishwakarma, T. Nagesh Kumar, G. Ritika, A.N. Singh. Deep learning based computer vision approaches for smart agricultural applications. 2022. URL: https://doi.org/10.1016/j.aiia.2022.09.007 (дата звернення: 26.04.2024).

Markus Friedrich, Theresa Gerber, Jonas Dumler, Frank Döpper. A system for automated tool wear monitoring and classification using computer vision. Procedia CIRP. 2023. URL: https://doi.org/10.1016/j.procir.2023.06.073 (дата звернення: 26.04.2024).

Phat Thai, Sameer Alam, Nimrod Lilith, Binh T. Nguyen. An overview of Human Action Recognition in sports based on Computer Vision. Journal Heliyon. 2022. URL: https://doi.org/10.1016/j.heliyon.2022.e09633 (дата звернення: 26.04.2024).

A. Nouriani, R. McGovern, R. Rajamani. Activity recognition using a combination of high gain observer and deep learning computer vision algorithms. Intelligent Systems with Applications. 2023. URL: https://doi.org/10.1016/j.iswa.2023.200213 (дата звернення: 02.05.2024).

Masooma Memon. ANN vs CNN vs RNN: Neural Networks Guide. 2022. URL: https://levity.ai/blog/neural-networks-cnn-ann-rnn (дата звернення: 02.05.2024).

Zahra Elhamraoui, Introduction to convolutional neural network. 2020. URL: https://medium.com/analytics-vidhya/introduction-to-convolutional-neural-network-6942c189a723 (дата звернення: 02.05.2024).

James Le The 5 Computer Vision Techniques That Will Change How You See The World. 2018.

URL: https://heartbeat.comet.ml/the-5-computer-vision-techniques-that-will-change-how-you-see-the-world-1ee19334354b (дата звернення: 02.05.2024).

Jingfei Chang, Yang Lu, Ping Xue, Yiqun Xu, Zhen Wei. Iterative clustering pruning for convolutional neural networks. Knowledge-Based Systems. 2023. URL: https://doi.org/10.1016/j.knosys.2023.110386 (дата звернення: 03.05.2024).

Vidushi Meel. ANN and CNN: Analyzing Differences and Similarities. URL: https://viso.ai/deep-learning/ann-and-cnn-analyzing-differences-and-similarities/ (дата звернення: 03.05.2024).

Recurrent Neural Network Tutorial (RNN). 2022. URL: https://www.datacamp.com/tutorial/tutorial-forrecurrent-neural-network (дата звернення: 03.05.2024).

Kristina Host, Marina Ivašić-Kos. An overview of Human Action Recognition in sports based on Computer Vision. 2022. URL: https://doi.org/10.1016/j.heliyon.2022.e09633 (дата звернення: 03.05.2024).

What is a recurrent network? URL: https://h2o.ai/wiki/recurrent-network/ (дата звернення: 03.05.2024).

Published

2024-07-31