Enhancing human computer interaction with coot optimization and deep learning for multi language identification

Affiliations
  • 1Candidate of Economic Sciences, Department of Economics and Management, Kazan Federal University, Elabuga Institute of KFU, Elabuga, 423604, Russia.
  • 2Moscow Aviation Institute (National Research University), Moscow, 125080, Russia.
  • 3Candidate of Economic Sciences, Department of Innovative Entrepreneurship, Bauman Moscow State Technical University, Moscow, 105005, Russia.
  • 4Candidate of Medical Sciences, Department of Prosthetic Dentistry, Sechenov First Moscow State Medical University, Moscow, 119991, Russia.
  • 5Candidate of Economic Sciences, Department of Management, Kuban State Agrarian University named after I.T. Trubilin, Krasnodar, 350044, Russia.
  • 6Department of AI Software, Kangwon National University, Samcheok, 25913, Republic of Korea. joshi@kangwon.ac.kr.
  • 7Department of Electronics, Information and Communication Engineering, Kangwon National University, Samcheok, 25913, Republic of Korea. wcho@kangwon.ac.kr.

|

Abstract

Human-Computer Interaction (HCI) is a multidisciplinary field focused on designing and utilizing computer technology, underlining the interaction interface between computers and humans. HCI aims to generate systems that allow consumers to relate to computers effectively, efficiently, and pleasantly. Multiple Spoken Language Identification (SLI) for HCI (MSLI for HCI) denotes the ability of a computer system to recognize and distinguish various spoken languages to enable more complete and handy interactions among consumers and technology. SLI utilizing deep learning (DL) involves using artificial neural networks (ANNs), a subset of DL models, to automatically detect and recognize the language spoken in an audio signal. DL techniques, particularly neural networks (NNs), have succeeded in various pattern detection tasks, including speech and language processing. This paper develops a novel Coot Optimizer Algorithm with a DL-Driven Multiple SLI and Detection (COADL-MSLID) technique for HCI applications. The COADL-MSLID approach aims to detect multiple spoken languages from the input audio regardless of gender, speaking style, and age. In the COADL-MSLID technique, the audio files are transformed into spectrogram images as a primary step. Besides, the COADL-MSLID technique employs the SqueezeNet model to produce feature vectors, and the COA is applied to the hyperparameter range of the SqueezeNet method. The COADL-MSLID technique exploits the SLID process’s convolutional autoencoder (CAE) model. To underline the importance of the COADL-MSLID technique, a series of experiments were conducted on the benchmark dataset. The experimentation validation of the COADL-MSLID technique exhibits a greater accuracy result of 98.33% over other techniques.

Related Concept Videos

JoVE Research Video for Improving Translational Accuracy 02:07

2.4K

Human-Computer Interaction (HCI) is a multidisciplinary field focused on designing and utilizing computer technology, underlining the interaction interface between computers and humans. HCI aims to generate systems that allow consumers to relate to computers effectively, efficiently, and pleasantly. Multiple Spoken Language Identification (SLI) for HCI (MSLI for HCI) denotes the ability of a computer system […]

JoVE Research Video for Force Classification 01:22

954

Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,…

JoVE Research Video for Multi-input and Multi-variable systems 01:22

38

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence…

JoVE Research Video for Types of Errors: Detection and Minimization 01:12

933

Error is the deviation of the obtained result from the true, expected value or the estimated central value. Errors are expressed in absolute or relative terms.
Absolute error in a measurement is the numerical difference from the true or central value. Relative error is the ratio between absolute error and the true or central value, expressed as a percentage.
Errors can be classified by source, magnitude, and sign. There are three types of errors: systematic, random, and gross.
Systematic or…

JoVE Research Video for Classification of Systems-II 01:31

80

Continuous-time systems have continuous input and output signals, with time measured continuously. These systems are generally defined by differential or algebraic equations. For instance, in an RC circuit, the relationship between input and output voltage is expressed through a differential equation derived from Ohm's law and the capacitor relation,

Discrete-time systems have input and output signals at specific intervals, defined at distinct instants by difference equations. An example…

JoVE Research Video for Machines: Problem Solving II 01:30

221

Machines are complex structures consisting of movable, pin-connected multi-force members that work together to transmit forces. Consider a lifting tong carrying a 100 kg load. It comprises movable sections DAF and CBG linked together with member AB.

Assuming the structure is symmetrical at point E, the force acting on member AB can be calculated.
First, a free-body diagram indicating the forces acting at point E is considered. The weight of the load that equals the mass times the acceleration…