Search Results

Results (27)

Search Parameters:

Keyword: Speech
Order results
Results per page
Open AccessArticle
9 Pages, 3,765 KB Download PDF

Integrating Speech and Gesture for Generating Reliable Robotic Task Configuration

Advances in Science, Technology and Engineering Systems Journal, Volume 9, Issue 4, Page # 51–59, 2024; DOI: 10.25046/aj090406
Abstract:

This paper presents a system that combines speech and pointing gestures along with four distinct hand gestures to precisely identify both the object of interest and parameters for robotic tasks. We utilized skeleton landmarks to detect pointing gestures and determine their direction, while a pre-trained model, trained on 21 hand landmarks from 2D images, was…

Read More
(This article belongs to the SP16 (Special Issue on Computing, Engineering and Multidisciplinary Sciences 2024) & Section Robotics (ROB))
Open AccessArticle
7 Pages, 1,119 KB Download PDF

Bangla Speech Emotion Detection using Machine Learning Ensemble Methods

Advances in Science, Technology and Engineering Systems Journal, Volume 7, Issue 6, Page # 70–76, 2022; DOI: 10.25046/aj070608
Abstract:

Emotion is the most important component of being human, and very essential for everyday activities, such as the interaction between people, decision making, and learning. In order to adapt to the COVID-19 pandemic situation, most of the academic institutions relied on online video conferencing platforms to continue educational activities. Due to low bandwidth in many…

Read More
(This article belongs to Section Artificial Intelligence in Computer Science (CAI))
Open AccessArticle
11 Pages, 1,735 KB Download PDF

Emotion Mining from Speech in Collaborative Learning

Advances in Science, Technology and Engineering Systems Journal, Volume 6, Issue 5, Page # 90–100, 2021; DOI: 10.25046/aj060512
Abstract:

Affective states, a dimension of attitude, have a critical role in the learning process. In the educational setting, affective states are commonly captured by self-report tools or based on sentiment analysis on asynchronous textual chats, discussions, or students’ journals. Drawbacks of such tools include: distracting the learning process, demanding time and commitment from students to…

Read More
(This article belongs to the SP11 (Special Issue on Innovation in Computing, Engineering Science & Technology 2021) & Section Interdisciplinary Applications of Computer Science (CSI))
Open AccessArticle
14 Pages, 3,051 KB Download PDF

An Alternative Approach for Thai Automatic Speech Recognition Based on the CNN-based Keyword Spotting with Real-World Application

Advances in Science, Technology and Engineering Systems Journal, Volume 6, Issue 4, Page # 278–291, 2021; DOI: 10.25046/aj060431
Abstract:

An automatic speech recognition (ASR) is a key technology for preventing an ongoing global coronavirus epidemic. Due to the limited corpus database and the morphological diversity of the Thai language, Thai speech recognition is still difficult. In this research, the automatic speech recognition model was built differently from the traditional Thai NLP systems by using…

Read More
(This article belongs to the SP11 (Special Issue on Innovation in Computing, Engineering Science & Technology 2021) & Section Interdisciplinary Applications of Computer Science (CSI))
Open AccessArticle
15 Pages, 468 KB Download PDF

A Model for the Application of Automatic Speech Recognition for Generating Lesson Summaries

Advances in Science, Technology and Engineering Systems Journal, Volume 6, Issue 2, Page # 526–540, 2021; DOI: 10.25046/aj060260
Abstract:

Automatic Speech Recognition (ASR) technology has the potential to improve the learning experience of students in the classroom. This article addresses some of the key theoretical areas identified in the pursuit of implementing a speech recognition system, capable of lesson summary generation in the educational setting. The article discusses: some of the applica- tions of…

Read More
(This article belongs to the SP10 (Special Issue on Multidisciplinary Sciences and Engineering 2020-21) & Section Interdisciplinary Applications of Computer Science (CSI))
Open AccessArticle
8 Pages, 1,459 KB Download PDF

Deaf Chat: A Speech-to-Text Communication Aid for Hearing Deficiency

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 5, Page # 826–833, 2020; DOI: 10.25046/aj0505100
Abstract:

Hearing impairments have a negative impact in the lives of individuals living with them and those around such individuals. Different applications and technological tools have been developed to help reduce this negative impact. Most mobile applications that have been developed that use Speech-to-Text technology have been inconsistent such that they are not inclusive of all…

Read More
(This article belongs to the SP9 (Special Issue on Multidisciplinary Innovation in Engineering Science & Technology 2020) & Section Interdisciplinary Applications of Computer Science (CSI))
Open AccessArticle
13 Pages, 1,647 KB Download PDF

Distributed Microphone Arrays, Emerging Speech and Audio Signal Processing Platforms: A Review

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 4, Page # 331–343, 2020; DOI: 10.25046/aj050439
Abstract:

Given ubiquitous digital devices with recording capability, distributed microphone arrays are emerging recording tools for hands-free communications and spontaneous tele-conferencings. However, the analysis of signals recorded with diverse sampling rates, time delays, and qualities by distributed microphone arrays is not straightforward and entails important considerations. The crucial challenges include the unknown/changeable geometry of distributed arrays,…

Read More
(This article belongs to the SP9 (Special Issue on Multidisciplinary Innovation in Engineering Science & Technology 2020) & Section Telecommunications (TEL))
Open AccessArticle
9 Pages, 870 KB Download PDF

Noise Cancellation Algorithm Based on Air- and Bone-Conducted Speech Signals by Considering an Unscented Transformation Method

Advances in Science, Technology and Engineering Systems Journal, Volume 4, Issue 2, Page # 305–313, 2019; DOI: 10.25046/aj040239
Abstract:

Noise control is essential when applying speech recognition in noisy environments such as factories. In this study, a signal processing for noise cancellation is proposed by using a noise-insensitive bone-conducted speech signal together with an air-conducted speech signal. The speech signal is generally expressed by a nonlinear model. The extended Kalman filter is very famous…

Read More
(This article belongs to the SP7 (Special Issue on Advancement in Engineering and Computer Science 2019) & Section Electronic Engineering (EEE))
Open AccessArticle
4 Pages, 1,336 KB Download PDF

Difference in Speech Analysis Results by Coding

Advances in Science, Technology and Engineering Systems Journal, Volume 3, Issue 5, Page # 488–491, 2018; DOI: 10.25046/aj030555
Abstract:

Mental health disorder is becoming a social problem, and there is a need for technology that can easily check for states of stress and depression as a countermeasure. Conventional methods of diagnostic support and screening include self-administered psychological tests and use of biomarkers. However, there are problems such as burden on subjects, examination costs, dedicated…

Read More
(This article belongs to the SP6 (Special Issue on Recent Advances in Engineering Systems 2018-19) & Section Acoustics (ACO))
Open AccessArticle
9 Pages, 1,480 KB Download PDF

Amplitude-Frequency Analysis of Emotional Speech Using Transfer Learning and Classification of Spectrogram Images

Advances in Science, Technology and Engineering Systems Journal, Volume 3, Issue 4, Page # 363–371, 2018; DOI: 10.25046/aj030437
Abstract:

Automatic speech emotion recognition (SER) techniques based on acoustic analysis show high confusion between certain emotional categories. This study used an indirect approach to provide insights into the amplitude-frequency characteristics of different emotions in order to support the development of future, more efficiently differentiating SER methods. The analysis was carried out by transforming short 1-second…

Read More
(This article belongs to the SP5 (Special Issue on Multidisciplinary Sciences and Engineering 2018) & Section Interdisciplinary Applications of Computer Science (CSI))
Open AccessArticle
6 Pages, 885 KB Download PDF

Emotional state recognition in speech signal

Advances in Science, Technology and Engineering Systems Journal, Volume 2, Issue 3, Page # 1654–1659, 2017; DOI: 10.25046/aj0203205
Abstract:

The matters regarding speech signal processing and analyzing in terms of emotional states recognition were presented in this paper. An experiment was conducted to perform both objective and subjective emotional states recognition tests for Polish language.

Read More
(This article belongs to the SP3 (Special issue on Recent Advances in Engineering Systems 2017) & Section Telecommunications (TEL))
Open AccessArticle
7 Pages, 2,708 KB Download PDF

Analysis of Emotions and Movements of Asian and European Facial Expressions

Advances in Science, Technology and Engineering Systems Journal, Volume 9, Issue 1, Page # 42–48, 2024; DOI: 10.25046/aj090105
Abstract:

The aim of this study is to develop an advanced framework that not only recognize the dominant facial emotion, but also contains modules for gesture recognition and text-to-speech recognition. Each module is meticulously designed and integrated into unified system. The implemented models have been revised, with the results presented through graphical representations, providing prevalent emotions…

Read More
(This article belongs to the SP15 (Special Issue on Innovation in Computing, Engineering Science & Technology 2023) & Section Interdisciplinary Applications of Computer Science (CSI))
Open AccessArticle
16 Pages, 1,654 KB Download PDF

Machine Learning Algorithms for Real Time Blind Audio Source Separation with Natural Language Detection

Advances in Science, Technology and Engineering Systems Journal, Volume 6, Issue 5, Page # 125–140, 2021; DOI: 10.25046/aj060515
Abstract:

The Conv-TasNet and Demucs algorithms, can differentiate between two mixed signals, such as music and speech, the mixing operation proceed without any support information. The network of convolutional time-domain audio separations is used in Conv-TasNet algorithm, while there is a new waveform-to-waveform model in Demucs algorithm. The Demucs algorithm utilizes a procedure like the audio…

Read More
(This article belongs to the SP11 (Special Issue on Innovation in Computing, Engineering Science & Technology 2021) & Section Artificial Intelligence in Computer Science (CAI))
Open AccessArticle
11 Pages, 2,431 KB Download PDF

The Design and Implementation of Intelligent English Learning Chabot based on Transfer Learning Technology

Advances in Science, Technology and Engineering Systems Journal, Volume 6, Issue 5, Page # 32–42, 2021; DOI: 10.25046/aj060505
Abstract:

Chatbot operates task-oriented customer services in special and open domains at different mobile devices. Its related products such as knowledge base Question-Answer System also benefit daily activities. Chatbot functions generally include automatic speech recognition (ASR), natural language understanding (NLU), dialogue management (DM), natural language generation (NLG) and speech synthesis (SS). In this paper, we proposed…

Read More
(This article belongs to the SP11 (Special Issue on Innovation in Computing, Engineering Science & Technology 2021) & Section Interdisciplinary Applications of Computer Science (CSI))
Open AccessArticle
13 Pages, 1,145 KB Download PDF

Dependency Head Annotation for Myanmar Dependency Treebank

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 6, Page # 788–800, 2020; DOI: 10.25046/aj050694
Abstract:

Complete manual annotation of dependency treebank needs resources like annotators and annotation tools and takes long time and has high possibility of inconsistent annotations for free word order languages such as Myanmar. This paper describes a dependency head annotation scheme with Universal part-of-speech and Universal Dependencies for Myanmar dependency treebank. Currently 22,810 sentences and 680,218…

Read More
(This article belongs to the SP10 (Special Issue on Multidisciplinary Sciences and Engineering 2020-21) & Section Information Systems in Computer Science (CIS))
Open AccessArticle
13 Pages, 2,182 KB Download PDF

A Study on Intelligent Dialogue Agent for Older Adults’ Preventive Care – Towards Development of a Comprehensive Preventive Care System

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 6, Page # 09–21, 2020; DOI: 10.25046/aj050602
Abstract:

Preventive care approaches have attracted much attention in Japan, which is one of the world’s most super-aged societies. These approaches aim to decrease the number of people who require nursing care or other human support. Our research group has developed several kinds of preventive care systems, including a fall prevention system, a cognitive training system,…

Read More
(This article belongs to the SP9 (Special Issue on Multidisciplinary Innovation in Engineering Science & Technology 2020) & Section Psychiatry (PSY))
Open AccessArticle
8 Pages, 1,242 KB Download PDF

Interactive Virtual Rehabilitation for Aphasic Arabic-Speaking Patients

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 5, Page # 1225–1232, 2020; DOI: 10.25046/aj0505148
Abstract:

Objective: Individuals with aphasia often experience significant problems in their daily lives and social participation. Technologies that address speech and language disorders deficit in merging between therapist’s major role and reinforcing the training between sessions at home. It also lacks the Arabic language attention; however, current systems are typically expensive and lack amusement. Moreover, cumulative…

Read More
(This article belongs to Section Biomedical Engineering (EBI))
Open AccessArticle
15 Pages, 290 KB Download PDF

Advances in Optimisation Algorithms and Techniques for Deep Learning

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 5, Page # 563–577, 2020; DOI: 10.25046/aj050570
Abstract:

In the last decade, deep learning(DL) has witnessed excellent performances on a variety of problems, including speech recognition, object recognition, detection, and natural language processing (NLP) among many others. Of these applications, one common challenge is to obtain ideal parameters during the training of the deep neural networks (DNN). These typical parameters are obtained by…

Read More
(This article belongs to Section Interdisciplinary Applications of Computer Science (CSI))
Open AccessArticle
8 Pages, 1,120 KB Download PDF

Human-Robot Multilingual Verbal Communication – The Ontological knowledge and Learning-based Models

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 4, Page # 540–547, 2020; DOI: 10.25046/aj050464
Abstract:

In their verbal interactions, humans are often afforded with language barriers and communication problems and disabilities. This problem is even more serious in the fields of education and health care for children with special needs. The use of robotic agents, notably humanoids integrated within human groups, is a very important option to face these limitations.…

Read More
(This article belongs to the iraset-20 (Special Issue on Innovative Research in Applied Science, Engineering and Technology 2020) & Section Robotics (ROB))
Open AccessArticle
8 Pages, 805 KB Download PDF

The Sound of Trust: Towards Modelling Computational Trust using Voice-only Cues at Zero-Acquaintance

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 4, Page # 469–476, 2020; DOI: 10.25046/aj050456
Abstract:

Trust is essential in many interdependent human relationships. Trustworthiness is measured via the effectiveness of the relationships involving human perception. The decision to trust others is often made quickly (even at zero acquaintance). Previous research has shown the significance of voice in perceived trustworthiness. However, the listeners’ characteristics were not considered. A system has yet…

Read More
(This article belongs to the SP9 (Special Issue on Multidisciplinary Innovation in Engineering Science & Technology 2020) & Section Interdisciplinary Applications of Computer Science (CSI))
Open AccessArticle
11 Pages, 2,328 KB Download PDF

Bilateral Communication Device for Deaf-Mute and Normal People

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 4, Page # 363–373, 2020; DOI: 10.25046/aj050442
Abstract:

Communication is a bilateral process and being understood by the person you are talking to is a must. Without the ability to talk nor hear, a person would endure such handicap. Given that hearing and speech are missing, many have ventured to open new communication methods for them through sign language. This bilateral communication device…

Read More
(This article belongs to the SP9 (Special Issue on Multidisciplinary Innovation in Engineering Science & Technology 2020) & Section Psychiatry (PSY))
Open AccessArticle
9 Pages, 1,350 KB Download PDF

ROS Based Multimode Control of Wheeled Robot

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 2, Page # 688–696, 2020; DOI: 10.25046/aj050285
Abstract:

This research work mainly presents the design and development of a small-scaled wheeled robot, which can be controlled using multiple controlling interfaces using some new technological trends. Raspberry Pi 3 as the main controller, Python as the programming language integrated with the Robot Operating System (ROS) and Virtual Network Computing (VNC) for screen sharing are…

Read More
(This article belongs to the SP8 (Special Issue on Multidisciplinary Sciences and Engineering 2019-20) & Section Interdisciplinary Applications of Computer Science (CSI))
Open AccessArticle
5 Pages, 909 KB Download PDF

Smart Ambulance: Speed Clearance in the Internet of Things paradigm using Voice Chat

Advances in Science, Technology and Engineering Systems Journal, Volume 4, Issue 6, Page # 280–284, 2019; DOI: 10.25046/aj040635
Abstract:

In recent years, researchers have focused on the development of many applications of information and communication which could lead to enhance human life. The congestion and road traffic are one of the most problems facing the ambulance transportation to provide fast healthcare services for patients. In this work, a tracking and data transfer system has…

Read More
(This article belongs to Section Network Engineering (ENW))
Open AccessArticle
9 Pages, 929 KB Download PDF

Vowel Classification Based on Waveform Shapes

Advances in Science, Technology and Engineering Systems Journal, Volume 4, Issue 3, Page # 16–24, 2019; DOI: 10.25046/aj040303
Abstract:

Vowel classification is an essential part of speech recognition. In classical studies, this problem is mostly handled by using spectral domain features. In this study, a novel approach is proposed for vowel classification based on the visual features of speech waveforms. In sound vocalizing, the position of certain organs of the human vocal system such…

Read More
(This article belongs to Section Electronic Engineering (EEE))
Open AccessArticle
10 Pages, 701 KB Download PDF

Machine Learning Applied to GRBAS Voice Quality Assessment

Advances in Science, Technology and Engineering Systems Journal, Volume 3, Issue 6, Page # 329–338, 2018; DOI: 10.25046/aj030641
Abstract:

Voice problems are routinely assessed in hospital voice clinics by speech and language therapists (SLTs) who are highly skilled in making audio-perceptual evaluations of voice quality. The evaluations are often presented numerically in the form of five-dimensional ‘GRBAS’ scores. Computerised voice quality assessment may be carried out using digital signal processing (DSP) techniques which process…

Read More
(This article belongs to the SP5 (Special Issue on Multidisciplinary Sciences and Engineering 2018) & Section Acoustics (ACO))

Journal Menu

Journal Browser


Special Issues

Special Issue on Digital Frontiers of Entrepreneurship: Integrating AI, Gender Equity, and Sustainable Futures
Guest Editors: Dr. Muhammad Nawaz Tunio, Dr. Aamir Rashid, Dr. Imamuddin Khoso
Deadline: 30 May 2026

Special Issue on Sustainable Technologies for a Resilient Future
Guest Editors: Dr. Debasis Mitra, Dr. Sourav Chattaraj, Dr. Addisu Assefa
Deadline: 30 April 2026