Related papers: Speaker Fluency Level Classification Using Machine…

Automated Speech Scoring System Under The Lens: Evaluating and interpreting the linguistic cues for language proficiency

English proficiency assessments have become a necessary metric for filtering and selecting prospective candidates for both academia and industry. With the rise in demand for such assessments, it has become increasingly necessary to have the…

Computation and Language · Computer Science 2021-12-01 Pakhi Bamdev , Manraj Singh Grover , Yaman Kumar Singla , Payman Vafaee , Mika Hama , Rajiv Ratn Shah

Deep Learning for Assessment of Oral Reading Fluency

Reading fluency assessment is a critical component of literacy programmes, serving to guide and monitor early education interventions. Given the resource intensive nature of the exercise when conducted by teachers, the development of…

Computation and Language · Computer Science 2024-06-04 Mithilesh Vaidya , Binaya Kumar Sahoo , Preeti Rao

Automated evaluation of children's speech fluency for low-resource languages

Assessment of children's speaking fluency in education is well researched for majority languages, but remains highly challenging for low resource languages. This paper proposes a system to automatically assess fluency by combining a…

Sound · Computer Science 2025-10-24 Bowen Zhang , Nur Afiqah Abdul Latiff , Justin Kan , Rong Tong , Donny Soh , Xiaoxiao Miao , Ian McLoughlin

Phonetic and Prosody-aware Self-supervised Learning Approach for Non-native Fluency Scoring

Speech fluency/disfluency can be evaluated by analyzing a range of phonetic and prosodic features. Deep neural networks are commonly trained to map fluency-related features into the human scores. However, the effectiveness of deep…

Computation and Language · Computer Science 2023-05-22 Kaiqi Fu , Shaojun Gao , Shuju Shi , Xiaohai Tian , Wei Li , Zejun Ma

An Investigation of the Effectiveness of Phase for Audio Classification

While log-amplitude mel-spectrogram has widely been used as the feature representation for processing speech based on deep learning, the effectiveness of another aspect of speech spectrum, i.e., phase information, was shown recently for…

Sound · Computer Science 2022-05-02 Shunsuke Hidaka , Kohei Wakamiya , Tokihiko Kaburagi

A Comparison of Classifiers in Performing Speaker Accent Recognition Using MFCCs

An algorithm involving Mel-Frequency Cepstral Coefficients (MFCCs) is provided to perform signal feature extraction for the task of speaker accent recognition. Then different classifiers are compared based on the MFCC feature. For each…

Sound · Computer Science 2015-02-02 Zichen Ma , Ernest Fokoue

Automatic Speech Recognition for Non-Native English: Accuracy and Disfluency Handling

Automatic speech recognition (ASR) has been an essential component of computer assisted language learning (CALL) and computer assisted language testing (CALT) for many years. As this technology continues to develop rapidly, it is important…

Computation and Language · Computer Science 2025-04-01 Michael McGuire

Speaker-Conditioned Hierarchical Modeling for Automated Speech Scoring

Automatic Speech Scoring (ASS) is the computer-assisted evaluation of a candidate's speaking proficiency in a language. ASS systems face many challenges like open grammar, variable pronunciations, and unstructured or semi-structured…

Audio and Speech Processing · Electrical Eng. & Systems 2021-09-07 Yaman Kumar Singla , Avykat Gupta , Shaurya Bagga , Changyou Chen , Balaji Krishnamurthy , Rajiv Ratn Shah

Automatic Proficiency Assessment in L2 English Learners

Second language proficiency (L2) in English is usually perceptually evaluated by English teachers or expert evaluators, with the inherent intra- and inter-rater variability. This paper explores deep learning techniques for comprehensive L2…

Computation and Language · Computer Science 2025-05-06 Armita Mohammadi , Alessandro Lameiras Koerich , Laureano Moro-Velazquez , Patrick Cardinal

Large Language Models for Dysfluency Detection in Stuttered Speech

Accurately detecting dysfluencies in spoken language can help to improve the performance of automatic speech and language processing components and support the development of more inclusive speech and language technologies. Inspired by the…

Sound · Computer Science 2024-06-18 Dominik Wagner , Sebastian P. Bayerl , Ilja Baumann , Korbinian Riedhammer , Elmar Nöth , Tobias Bocklet

Improved Accent Classification Combining Phonetic Vowels with Acoustic Features

Researches have shown accent classification can be improved by integrating semantic information into pure acoustic approach. In this work, we combine phonetic knowledge, such as vowels, with enhanced acoustic features to build an improved…

Sound · Computer Science 2016-02-25 Zhenhao Ge

Learnable Frequency Filters for Speech Feature Extraction in Speaker Verification

Mel-scale spectrum features are used in various recognition and classification tasks on speech signals. There is no reason to expect that these features are optimal for all different tasks, including speaker verification (SV). This paper…

Audio and Speech Processing · Electrical Eng. & Systems 2022-06-16 Jingyu Li , Yusheng Tian , Tan Lee

Transfer Learning Based Diagnosis and Analysis of Lung Sound Aberrations

With the development of computer -systems that can collect and analyze enormous volumes of data, the medical profession is establishing several non-invasive tools. This work attempts to develop a non-invasive technique for identifying…

Sound · Computer Science 2023-03-16 Hafsa Gulzar , Jiyun Li , Arslan Manzoor , Sadaf Rehmat , Usman Amjad , Hadiqa Jalil Khan

Unsupervised Approach to Evaluate Sentence-Level Fluency: Do We Really Need Reference?

Fluency is a crucial goal of all Natural Language Generation (NLG) systems. Widely used automatic evaluation metrics fall short in capturing the fluency of machine-generated text. Assessing the fluency of NLG systems poses a challenge since…

Computation and Language · Computer Science 2023-12-05 Gopichand Kanumolu , Lokesh Madasu , Pavan Baswani , Ananya Mukherjee , Manish Shrivastava

Building a Noisy Audio Dataset to Evaluate Machine Learning Approaches for Automatic Speech Recognition Systems

Automatic speech recognition systems are part of people's daily lives, embedded in personal assistants and mobile phones, helping as a facilitator for human-machine interaction while allowing access to information in a practically intuitive…

Sound · Computer Science 2021-10-05 Julio Cesar Duarte , Sérgio Colcher

Language-agnostic, automated assessment of listeners' speech recall using large language models

Speech-comprehension difficulties are common among older people. Standard speech tests do not fully capture such difficulties because the tests poorly resemble the context-rich, story-like nature of ongoing conversation and are typically…

Computation and Language · Computer Science 2025-03-04 Björn Herrmann

Self-supervised Speech Models for Word-Level Stuttered Speech Detection

Clinical diagnosis of stuttering requires an assessment by a licensed speech-language pathologist. However, this process is time-consuming and requires clinicians with training and experience in stuttering and fluency disorders.…

Audio and Speech Processing · Electrical Eng. & Systems 2024-09-18 Yi-Jen Shih , Zoi Gkalitsiou , Alexandros G. Dimakis , David Harwath

Improving Language Identification of Accented Speech

Language identification from speech is a common preprocessing step in many spoken language processing systems. In recent years, this field has seen fast progress, mostly due to the use of self-supervised models pretrained on multilingual…

Audio and Speech Processing · Electrical Eng. & Systems 2022-07-04 Kunnar Kukk , Tanel Alumäe

Neural Network Based Speaker Classification and Verification Systems with Enhanced Features

This work presents a novel framework based on feed-forward neural network for text-independent speaker classification and verification, two related systems of speaker recognition. With optimized features and model training, it achieves 100%…

Sound · Computer Science 2017-03-20 Zhenhao Ge , Ananth N. Iyer , Srinath Cheluvaraja , Ram Sundaram , Aravind Ganapathiraju

Speaker Identification using Speech Recognition

The audio data is increasing day by day throughout the globe with the increase of telephonic conversations, video conferences and voice messages. This research provides a mechanism for identifying a speaker in an audio file, based on the…

Sound · Computer Science 2022-05-31 Syeda Rabia Arshad , Syed Mujtaba Haider , Abdul Basit Mughal