Search Results - Advances in Science, Technology and Engineering Systems Journal

Integrating Speech and Gesture for Generating Reliable Robotic Task Configuration

by Shuvo Kumar Paul, Mircea Nicolescu and Monica Nicolescu

Advances in Science, Technology and Engineering Systems Journal, Volume 9, Issue 4, Page # 51–59, 2024; DOI: 10.25046/aj090406

Abstract:

This paper presents a system that combines speech and pointing gestures along with four distinct hand gestures to precisely identify both the object of interest and parameters for robotic tasks. We utilized skeleton landmarks to detect pointing gestures and determine their direction, while a pre-trained model, trained on 21 hand landmarks from 2D images, was…

Bangla Speech Emotion Detection using Machine Learning Ensemble Methods

by Roy D Gregori Ayon, Md. Sanaullah Rabbi, Umme Habiba and Maoyejatun Hasana

Advances in Science, Technology and Engineering Systems Journal, Volume 7, Issue 6, Page # 70–76, 2022; DOI: 10.25046/aj070608

Abstract:

Emotion is the most important component of being human, and very essential for everyday activities, such as the interaction between people, decision making, and learning. In order to adapt to the COVID-19 pandemic situation, most of the academic institutions relied on online video conferencing platforms to continue educational activities. Due to low bandwidth in many…

Read More

(This article belongs to Section Artificial Intelligence in Computer Science (CAI))

Emotion Mining from Speech in Collaborative Learning

by Nasrin Dehbozorgi, Mary Lou Maher and Mohsen Dorodchi

Advances in Science, Technology and Engineering Systems Journal, Volume 6, Issue 5, Page # 90–100, 2021; DOI: 10.25046/aj060512

Abstract:

Affective states, a dimension of attitude, have a critical role in the learning process. In the educational setting, affective states are commonly captured by self-report tools or based on sentiment analysis on asynchronous textual chats, discussions, or students’ journals. Drawbacks of such tools include: distracting the learning process, demanding time and commitment from students to…

An Alternative Approach for Thai Automatic Speech Recognition Based on the CNN-based Keyword Spotting with Real-World Application

by Kanjanapan Sukvichai and Chaitat Utintu

Advances in Science, Technology and Engineering Systems Journal, Volume 6, Issue 4, Page # 278–291, 2021; DOI: 10.25046/aj060431

Abstract:

An automatic speech recognition (ASR) is a key technology for preventing an ongoing global coronavirus epidemic. Due to the limited corpus database and the morphological diversity of the Thai language, Thai speech recognition is still difficult. In this research, the automatic speech recognition model was built differently from the traditional Thai NLP systems by using…

A Model for the Application of Automatic Speech Recognition for Generating Lesson Summaries

by Phillip Blunt and Bertram Haskins

Advances in Science, Technology and Engineering Systems Journal, Volume 6, Issue 2, Page # 526–540, 2021; DOI: 10.25046/aj060260

Abstract:

Automatic Speech Recognition (ASR) technology has the potential to improve the learning experience of students in the classroom. This article addresses some of the key theoretical areas identified in the pursuit of implementing a speech recognition system, capable of lesson summary generation in the educational setting. The article discusses: some of the applica- tions of…

Deaf Chat: A Speech-to-Text Communication Aid for Hearing Deficiency

by Mandlenkosi Shezi and Abejide Ade-Ibijola

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 5, Page # 826–833, 2020; DOI: 10.25046/aj0505100

Abstract:

Hearing impairments have a negative impact in the lives of individuals living with them and those around such individuals. Different applications and technological tools have been developed to help reduce this negative impact. Most mobile applications that have been developed that use Speech-to-Text technology have been inconsistent such that they are not inclusive of all…

Distributed Microphone Arrays, Emerging Speech and Audio Signal Processing Platforms: A Review

by Shahab Pasha, Jan Lundgren, Christian Ritz and Yuexian Zou

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 4, Page # 331–343, 2020; DOI: 10.25046/aj050439

Abstract:

Given ubiquitous digital devices with recording capability, distributed microphone arrays are emerging recording tools for hands-free communications and spontaneous tele-conferencings. However, the analysis of signals recorded with diverse sampling rates, time delays, and qualities by distributed microphone arrays is not straightforward and entails important considerations. The crucial challenges include the unknown/changeable geometry of distributed arrays,…

Noise Cancellation Algorithm Based on Air- and Bone-Conducted Speech Signals by Considering an Unscented Transformation Method

by Hisako Orimoto and Akira ikuta

Advances in Science, Technology and Engineering Systems Journal, Volume 4, Issue 2, Page # 305–313, 2019; DOI: 10.25046/aj040239

Abstract:

Noise control is essential when applying speech recognition in noisy environments such as factories. In this study, a signal processing for noise cancellation is proposed by using a noise-insensitive bone-conducted speech signal together with an air-conducted speech signal. The speech signal is generally expressed by a nonlinear model. The extended Kalman filter is very famous…

Difference in Speech Analysis Results by Coding

by Yasuhiro Omiya, Naoki Hagiwara, Takeshi Takano, Shuji Shinohara, Mitsuteru Nakamura, Masakazu Higuchi, Shunji Mitsuyoshi, Hiroyuki Toda and Shinichi Tokuno

Advances in Science, Technology and Engineering Systems Journal, Volume 3, Issue 5, Page # 488–491, 2018; DOI: 10.25046/aj030555

Abstract:

Mental health disorder is becoming a social problem, and there is a need for technology that can easily check for states of stress and depression as a countermeasure. Conventional methods of diagnostic support and screening include self-administered psychological tests and use of biomarkers. However, there are problems such as burden on subjects, examination costs, dedicated…

Read More

(This article belongs to the SP6 (Special Issue on Recent Advances in Engineering Systems 2018-19) & Section Acoustics (ACO))

Amplitude-Frequency Analysis of Emotional Speech Using Transfer Learning and Classification of Spectrogram Images

by Margaret Lech, Melissa Stolar, Robert Bolia and Michael Skinner

Advances in Science, Technology and Engineering Systems Journal, Volume 3, Issue 4, Page # 363–371, 2018; DOI: 10.25046/aj030437

Abstract:

Automatic speech emotion recognition (SER) techniques based on acoustic analysis show high confusion between certain emotional categories. This study used an indirect approach to provide insights into the amplitude-frequency characteristics of different emotions in order to support the development of future, more efficiently differentiating SER methods. The analysis was carried out by transforming short 1-second…

Emotional state recognition in speech signal

by Krystian Kapala, Dawid Krawczyk and Stefan Brachmanski

Advances in Science, Technology and Engineering Systems Journal, Volume 2, Issue 3, Page # 1654–1659, 2017; DOI: 10.25046/aj0203205

Abstract:

The matters regarding speech signal processing and analyzing in terms of emotional states recognition were presented in this paper. An experiment was conducted to perform both objective and subjective emotional states recognition tests for Polish language.

Read More

(This article belongs to the SP3 (Special issue on Recent Advances in Engineering Systems 2017) & Section Telecommunications (TEL))

Analysis of Emotions and Movements of Asian and European Facial Expressions

by Ajla Kulaglic, Zeynep Örpek, Berk Kayı and Samet Ozmen

Advances in Science, Technology and Engineering Systems Journal, Volume 9, Issue 1, Page # 42–48, 2024; DOI: 10.25046/aj090105

Abstract:

The aim of this study is to develop an advanced framework that not only recognize the dominant facial emotion, but also contains modules for gesture recognition and text-to-speech recognition. Each module is meticulously designed and integrated into unified system. The implemented models have been revised, with the results presented through graphical representations, providing prevalent emotions…

Machine Learning Algorithms for Real Time Blind Audio Source Separation with Natural Language Detection

by Arwa Alghamdi, Graham Healy and Hoda Abdelhafez

Advances in Science, Technology and Engineering Systems Journal, Volume 6, Issue 5, Page # 125–140, 2021; DOI: 10.25046/aj060515

Abstract:

The Conv-TasNet and Demucs algorithms, can differentiate between two mixed signals, such as music and speech, the mixing operation proceed without any support information. The network of convolutional time-domain audio separations is used in Conv-TasNet algorithm, while there is a new waveform-to-waveform model in Demucs algorithm. The Demucs algorithm utilizes a procedure like the audio…

The Design and Implementation of Intelligent English Learning Chabot based on Transfer Learning Technology

by Nuobei Shi, Qin Zeng and Raymond Shu Tak Lee

Advances in Science, Technology and Engineering Systems Journal, Volume 6, Issue 5, Page # 32–42, 2021; DOI: 10.25046/aj060505

Abstract:

Chatbot operates task-oriented customer services in special and open domains at different mobile devices. Its related products such as knowledge base Question-Answer System also benefit daily activities. Chatbot functions generally include automatic speech recognition (ASR), natural language understanding (NLU), dialogue management (DM), natural language generation (NLG) and speech synthesis (SS). In this paper, we proposed…

Dependency Head Annotation for Myanmar Dependency Treebank

by Hnin Thu Zar Aye and Win Pa Pa

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 6, Page # 788–800, 2020; DOI: 10.25046/aj050694

Abstract:

Complete manual annotation of dependency treebank needs resources like annotators and annotation tools and takes long time and has high possibility of inconsistent annotations for free word order languages such as Myanmar. This paper describes a dependency head annotation scheme with Universal part-of-speech and Universal Dependencies for Myanmar dependency treebank. Currently 22,810 sentences and 680,218…

A Study on Intelligent Dialogue Agent for Older Adults’ Preventive Care – Towards Development of a Comprehensive Preventive Care System

by Sho Hirose, Daisuke Kitakoshi, Akihiro Yamashita, Kentarou Suzuki and Masato Suzuki

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 6, Page # 09–21, 2020; DOI: 10.25046/aj050602

Abstract:

Preventive care approaches have attracted much attention in Japan, which is one of the world’s most super-aged societies. These approaches aim to decrease the number of people who require nursing care or other human support. Our research group has developed several kinds of preventive care systems, including a fall prevention system, a cognitive training system,…

Interactive Virtual Rehabilitation for Aphasic Arabic-Speaking Patients

by Sherif H. ElGohary, Aya Lithy, Shefaa Khamis, Aya Ali, Aya Alaa el-din and Hager Abd El-Azim

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 5, Page # 1225–1232, 2020; DOI: 10.25046/aj0505148

Abstract:

Objective: Individuals with aphasia often experience significant problems in their daily lives and social participation. Technologies that address speech and language disorders deficit in merging between therapist’s major role and reinforcing the training between sessions at home. It also lacks the Arabic language attention; however, current systems are typically expensive and lack amusement. Moreover, cumulative…

Read More

(This article belongs to Section Biomedical Engineering (EBI))

Advances in Optimisation Algorithms and Techniques for Deep Learning

by Chigozie Enyinna Nwankpa

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 5, Page # 563–577, 2020; DOI: 10.25046/aj050570

Abstract:

In the last decade, deep learning(DL) has witnessed excellent performances on a variety of problems, including speech recognition, object recognition, detection, and natural language processing (NLP) among many others. Of these applications, one common challenge is to obtain ideal parameters during the training of the deep neural networks (DNN). These typical parameters are obtained by…

Read More

(This article belongs to Section Interdisciplinary Applications of Computer Science (CSI))

Human-Robot Multilingual Verbal Communication – The Ontological knowledge and Learning-based Models

by Mohammed Qbadou, Intissar Salhi, Hanaâ El fazazi, Khalifa Mansouri, Michail Manios and Vassilis Kaburlasos

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 4, Page # 540–547, 2020; DOI: 10.25046/aj050464

Abstract:

In their verbal interactions, humans are often afforded with language barriers and communication problems and disabilities. This problem is even more serious in the fields of education and health care for children with special needs. The use of robotic agents, notably humanoids integrated within human groups, is a very important option to face these limitations.…

The Sound of Trust: Towards Modelling Computational Trust using Voice-only Cues at Zero-Acquaintance

by Deborah Ooi Yee Hui, Syaheerah Lebai Lutfi, Syibrah Naim, Zahid Akhtar, Ahmad Sufril Azlan Mohamed and Kamran Siddique

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 4, Page # 469–476, 2020; DOI: 10.25046/aj050456

Abstract:

Trust is essential in many interdependent human relationships. Trustworthiness is measured via the effectiveness of the relationships involving human perception. The decision to trust others is often made quickly (even at zero acquaintance). Previous research has shown the significance of voice in perceived trustworthiness. However, the listeners’ characteristics were not considered. A system has yet…

Bilateral Communication Device for Deaf-Mute and Normal People

by Raven Carlos Tabiongan

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 4, Page # 363–373, 2020; DOI: 10.25046/aj050442

Abstract:

Communication is a bilateral process and being understood by the person you are talking to is a must. Without the ability to talk nor hear, a person would endure such handicap. Given that hearing and speech are missing, many have ventured to open new communication methods for them through sign language. This bilateral communication device…

ROS Based Multimode Control of Wheeled Robot

by Rajesh Kannan Megalingam, Santosh Tantravahi, Hemanth Sai Surya Kumar Tammana, Nagasai Thokala, Hari Sudarshan Rahul Puram and Naveen Samudrala

Advances in Science, Technology and Engineering Systems Journal, Volume 5, Issue 2, Page # 688–696, 2020; DOI: 10.25046/aj050285

Abstract:

This research work mainly presents the design and development of a small-scaled wheeled robot, which can be controlled using multiple controlling interfaces using some new technological trends. Raspberry Pi 3 as the main controller, Python as the programming language integrated with the Robot Operating System (ROS) and Virtual Network Computing (VNC) for screen sharing are…

Smart Ambulance: Speed Clearance in the Internet of Things paradigm using Voice Chat

by Noor A.Hussein and Mohamed Ibrahim Shujaa

Advances in Science, Technology and Engineering Systems Journal, Volume 4, Issue 6, Page # 280–284, 2019; DOI: 10.25046/aj040635

Abstract:

In recent years, researchers have focused on the development of many applications of information and communication which could lead to enhance human life. The congestion and road traffic are one of the most problems facing the ambulance transportation to provide fast healthcare services for patients. In this work, a tracking and data transfer system has…

Read More

(This article belongs to Section Network Engineering (ENW))

Vowel Classification Based on Waveform Shapes

by Hakan Tora, Gursel Karacor and Baran Uslu

Advances in Science, Technology and Engineering Systems Journal, Volume 4, Issue 3, Page # 16–24, 2019; DOI: 10.25046/aj040303

Abstract:

Vowel classification is an essential part of speech recognition. In classical studies, this problem is mostly handled by using spectral domain features. In this study, a novel approach is proposed for vowel classification based on the visual features of speech waveforms. In sound vocalizing, the position of certain organs of the human vocal system such…

Read More

(This article belongs to Section Electronic Engineering (EEE))

Machine Learning Applied to GRBAS Voice Quality Assessment

by Zheng Xie, Chaitanya Gadepalli, Farideh Jalalinajafabadi, Barry M.G. Cheetham and Jarrod J. Homer

Advances in Science, Technology and Engineering Systems Journal, Volume 3, Issue 6, Page # 329–338, 2018; DOI: 10.25046/aj030641

Abstract:

Voice problems are routinely assessed in hospital voice clinics by speech and language therapists (SLTs) who are highly skilled in making audio-perceptual evaluations of voice quality. The evaluations are often presented numerically in the form of five-dimensional ‘GRBAS’ scores. Computerised voice quality assessment may be carried out using digital signal processing (DSP) techniques which process…

Read More

(This article belongs to the SP5 (Special Issue on Multidisciplinary Sciences and Engineering 2018) & Section Acoustics (ACO))

Results (27)