Related papers: Spoken Language Identification using ConvNets

Efficient Spoken Language Recognition via Multilabel Classification

Spoken language recognition (SLR) is the task of automatically identifying the language present in a speech signal. Existing SLR models are either too computationally expensive or too large to run effectively on devices with limited…

Computation and Language · Computer Science 2023-06-06 Oriol Nieto , Zeyu Jin , Franck Dernoncourt , Justin Salamon

Language Identification Using Deep Convolutional Recurrent Neural Networks

Language Identification (LID) systems are used to classify the spoken language from a given audio sample and are typically the first step for many spoken language processing tasks, such as Automatic Speech Recognition (ASR) systems. Without…

Computer Vision and Pattern Recognition · Computer Science 2017-08-17 Christian Bartz , Tom Herold , Haojin Yang , Christoph Meinel

Improved Language Identification Through Cross-Lingual Self-Supervised Learning

Language identification greatly impacts the success of downstream tasks such as automatic speech recognition. Recently, self-supervised speech representations learned by wav2vec 2.0 have been shown to be very effective for a range of speech…

Computation and Language · Computer Science 2021-10-19 Andros Tjandra , Diptanu Gon Choudhury , Frank Zhang , Kritika Singh , Alexis Conneau , Alexei Baevski , Assaf Sela , Yatharth Saraf , Michael Auli

Language identification as improvement for lip-based biometric visual systems

Language has always been one of humanity's defining characteristics. Visual Language Identification (VLI) is a relatively new field of research that is complex and largely understudied. In this paper, we present a preliminary study in which…

Computer Vision and Pattern Recognition · Computer Science 2023-02-28 Lucia Cascone , Michele Nappi , Fabio Narducci

Is Attention always needed? A Case Study on Language Identification from Speech

Language Identification (LID) is a crucial preliminary process in the field of Automatic Speech Recognition (ASR) that involves the identification of a spoken language from audio samples. Contemporary systems that can process speech in…

Machine Learning · Computer Science 2026-03-04 Atanu Mandal , Santanu Pal , Indranil Dutta , Mahidas Bhattacharya , Sudip Kumar Naskar

Improving Language Identification for Multilingual Speakers

Spoken language identification (LID) technologies have improved in recent years from discriminating largely distinct languages to discriminating highly similar languages or even dialects of the same language. One aspect that has been mostly…

Audio and Speech Processing · Electrical Eng. & Systems 2020-01-30 Andrew Titus , Jan Silovsky , Nanxin Chen , Roger Hsiao , Mary Young , Arnab Ghoshal

Language Identification on Massive Datasets of Short Message using an Attention Mechanism CNN

Language Identification (LID) is a challenging task, especially when the input texts are short and noisy such as posts and statuses on social media or chat logs on gaming forums. The task has been tackled by either designing a feature set…

Computation and Language · Computer Science 2019-10-16 Duy Tin Vo , Richard Khoury

Fine-grained Language Identification with Multilingual CapsNet Model

Due to a drastic improvement in the quality of internet services worldwide, there is an explosion of multilingual content generation and consumption. This is especially prevalent in countries with large multilingual audience, who are…

Audio and Speech Processing · Electrical Eng. & Systems 2020-07-14 Mudit Verma , Arun Balaji Buduru

Improving Language Identification of Accented Speech

Language identification from speech is a common preprocessing step in many spoken language processing systems. In recent years, this field has seen fast progress, mostly due to the use of self-supervised models pretrained on multilingual…

Audio and Speech Processing · Electrical Eng. & Systems 2022-07-04 Kunnar Kukk , Tanel Alumäe

Transducer-based language embedding for spoken language identification

The acoustic and linguistic features are important cues for the spoken language identification (LID) task. Recent advanced LID systems mainly use acoustic features that lack the usage of explicit linguistic feature encoding. In this paper,…

Computation and Language · Computer Science 2022-08-01 Peng Shen , Xugang Lu , Hisashi Kawai

Automatic Language Identification in Texts: A Survey

Language identification (LI) is the problem of determining the natural language that a document or part thereof is written in. Automatic LI has been extensively researched for over fifty years. Today, LI is a key part of many text…

Computation and Language · Computer Science 2018-11-22 Tommi Jauhiainen , Marco Lui , Marcos Zampieri , Timothy Baldwin , Krister Lindén

Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model

We present a novel approach to multilingual audio-visual speech recognition tasks by introducing a single model on a multilingual dataset. Motivated by a human cognitive system where humans can intuitively distinguish different languages…

Multimedia · Computer Science 2023-10-24 Joanna Hong , Se Jin Park , Yong Man Ro

Multiclass Language Identification using Deep Learning on Spectral Images of Audio Signals

The first step in any voice recognition software is to determine what language a speaker is using, and ideally this process would be automated. The technique described in this paper, language identification for audio spectrograms (LIFAS),…

Sound · Computer Science 2019-05-14 Shauna Revay , Matthew Teschke

In-context Language Learning for Endangered Languages in Speech Recognition

With approximately 7,000 languages spoken worldwide, current large language models (LLMs) support only a small subset. Prior research indicates LLMs can learn new languages for certain tasks without supervised data. We extend this…

Computation and Language · Computer Science 2026-01-29 Zhaolin Li , Jan Niehues

Towards Relevance and Sequence Modeling in Language Recognition

The task of automatic language identification (LID) involving multiple dialects of the same language family in the presence of noise is a challenging problem. In these scenarios, the identity of the language/dialect may be reliably present…

Audio and Speech Processing · Electrical Eng. & Systems 2020-04-06 Bharat Padi , Anand Mohan , Sriram Ganapathy

Joint unsupervised and supervised learning for context-aware language identification

Language identification (LID) recognizes the language of a spoken utterance automatically. According to recent studies, LID models trained with an automatic speech recognition (ASR) task perform better than those trained with a LID task…

Audio and Speech Processing · Electrical Eng. & Systems 2023-04-17 Jinseok Park , Hyung Yong Kim , Jihwan Park , Byeong-Yeol Kim , Shukjae Choi , Yunkyu Lim

Language Recognition using Random Indexing

Random Indexing is a simple implementation of Random Projections with a wide range of applications. It can solve a variety of problems with good accuracy without introducing much complexity. Here we use it for identifying the language of…

Computation and Language · Computer Science 2015-03-02 Aditya Joshi , Johan Halseth , Pentti Kanerva

Automatic Spoken Language Identification using a Time-Delay Neural Network

Closed-set spoken language identification is the task of recognizing the language being spoken in a recorded audio clip from a set of known languages. In this study, a language identification system was built and trained to distinguish…

Computation and Language · Computer Science 2022-05-20 Benjamin Kepecs , Homayoon Beigi

LanideNN: Multilingual Language Identification on Character Window

In language identification, a common first step in natural language processing, we want to automatically determine the language of some input text. Monolingual language identification assumes that the given document is written in one…

Computation and Language · Computer Science 2017-08-01 Tom Kocmi , Ondřej Bojar

Enhancing Neural Spoken Language Recognition: An Exploration with Multilingual Datasets

In this research, we advanced a spoken language recognition system, moving beyond traditional feature vector-based models. Our improvements focused on effectively capturing language characteristics over extended periods using a specialized…

Sound · Computer Science 2025-01-22 Or Haim Anidjar , Roi Yozevitch