English
Related papers

Related papers: Refining a Deep Learning-based Formant Tracker usi…

200 papers

Formant tracking is investigated in this study by using trackers based on dynamic programming (DP) and deep neural nets (DNNs). Using the DP approach, six formant estimation methods were first compared. The six methods include linear…

Audio and Speech Processing · Electrical Eng. & Systems 2022-01-06 Dhananjaya Gowda , Bajibabu Bollepalli , Sudarsana Reddy Kadiri , Paavo Alku

In this paper, we propose a new method for the accurate estimation and tracking of formants in speech signals using time-varying quasi-closed-phase (TVQCP) analysis. Conventional formant tracking methods typically adopt a two-stage…

Audio and Speech Processing · Electrical Eng. & Systems 2023-09-01 Dhananjaya Gowda , Sudarsana Reddy Kadiri , Brad Story , Paavo Alku

Formant tracking is one of the most fundamental problems in speech processing. Traditionally, formants are estimated using signal processing methods. Recent studies showed that generic convolutional architectures can outperform recurrent…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-11 Wang Dai , Jinsong Zhang , Yingming Gao , Wei Wei , Dengfeng Ke , Binghuai Lin , Yanlu Xie

Formants are the spectral maxima that result from acoustic resonances of the human vocal tract, and their accurate estimation is among the most fundamental speech processing problems. Recent work has been shown that those frequencies can…

Sound · Computer Science 2022-06-24 Yosi Shrem , Felix Kreuk , Joseph Keshet

During the last years, deep learning trackers achieved stimulating results while bringing interesting ideas to solve the tracking problem. This progress is mainly due to the use of learned deep features obtained by training deep…

Computer Vision and Pattern Recognition · Computer Science 2020-12-24 Ahmed Zgaren , Wassim Bouachir , Riadh Ksantini

In this paper, we propose a deep learning based system for the task of deepfake audio detection. In particular, the draw input audio is first transformed into various spectrograms using three transformation methods of Short-time Fourier…

Sound · Computer Science 2024-07-03 Lam Pham , Phat Lam , Truong Nguyen , Huyen Nguyen , Alexander Schindler

Recently, efficient fine-tuning of large-scale pre-trained models has attracted increasing research interests, where linear probing (LP) as a fundamental module is involved in exploiting the final representations for task-dependent…

Computer Vision and Pattern Recognition · Computer Science 2023-10-03 Mingze Gao , Qilong Wang , Zhenyi Lin , Pengfei Zhu , Qinghua Hu , Jingbo Zhou

Traditional tracking-by-detection systems typically employ Kalman filters (KF) for state estimation. However, the KF requires domain-specific design choices and it is ill-suited to handling non-linear motion patterns. To address these…

Computer Vision and Pattern Recognition · Computer Science 2024-12-20 Momir Adžemović , Predrag Tadić , Andrija Petrović , Mladen Nikolić

The two-stage fine-tuning (FT) method, linear probing (LP) then fine-tuning (LP-FT), outperforms linear probing and FT alone. This holds true for both in-distribution (ID) and out-of-distribution (OOD) data. One key reason for its success…

Machine Learning · Computer Science 2024-10-23 Akiyoshi Tomihari , Issei Sato

Many real-world time series exhibit strong periodic structures arising from physical laws, human routines, or seasonal cycles. However, modern deep forecasting models often fail to capture these recurring patterns due to spectral bias and a…

Machine Learning · Computer Science 2025-08-05 Menglin Kong , Vincent Zhihao Zheng , Lijun Sun

Transformer-based trackers have achieved strong accuracy on the standard benchmarks. However, their efficiency remains an obstacle to practical deployment on both GPU and CPU platforms. In this paper, to overcome this issue, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2024-02-08 Yutao Cui , Tianhui Song , Gangshan Wu , Limin Wang

Multitarget Tracking (MTT) is the problem of tracking the states of an unknown number of objects using noisy measurements, with important applications to autonomous driving, surveillance, robotics, and others. In the model-based Bayesian…

Machine Learning · Computer Science 2021-06-07 Juliano Pinto , Georg Hess , William Ljungbergh , Yuxuan Xia , Lennart Svensson , Henk Wymeersch

Correlation filter (CF) based trackers generally include two modules, i.e., feature representation and on-line model adaptation. In existing off-line deep learning models for CF trackers, the model adaptation usually is either abandoned or…

Computer Vision and Pattern Recognition · Computer Science 2018-07-31 Yingjie Yao , Xiaohe Wu , Lei Zhang , Shiguang Shan , Wangmeng Zuo

Discriminative Correlation Filter (DCF) based methods have shown competitive performance on tracking benchmarks in recent years. Generally, DCF based trackers learn a rigid appearance model of the target. However, this reliance on a single…

Computer Vision and Pattern Recognition · Computer Science 2017-06-12 Joakim Johnander , Martin Danelljan , Fahad Shahbaz Khan , Michael Felsberg

Variant calling refinement is crucial for distinguishing true genetic variants from technical artifacts in high-throughput sequencing data. Manual review is time-consuming while heuristic filtering often lacks optimal solutions. Traditional…

Genomics · Quantitative Biology 2024-08-02 Omar Abdelwahab , Davoud Torkamaneh

The growth of global consumption has motivated important applications of deep learning to smart manufacturing and machine health monitoring. In particular, analyzing vibration data offers great potential to extract meaningful insights into…

Machine Learning · Computer Science 2024-05-30 Anthony Zhou , Amir Barati Farimani

During the recent years, correlation filters have shown dominant and spectacular results for visual object tracking. The types of the features that are employed in these family of trackers significantly affect the performance of visual…

Computer Vision and Pattern Recognition · Computer Science 2018-03-13 Erhan Gundogdu , A. Aydin Alatan

Looped Transformers have emerged as an efficient and powerful class of models for reasoning in the language domain. Recent studies show that these models achieve strong performance on algorithmic and reasoning tasks, suggesting that looped…

Computation and Language · Computer Science 2026-02-13 Ahmadreza Jeddi , Marco Ciccone , Babak Taati

Deep Learning architectures, and in particular Transformers, are conventionally viewed as a composition of layers. These layers are actually often obtained as the sum of two contributions: a residual path that copies the input and the…

This paper proposes a novel framework for audio deepfake detection with two main objectives: i) attaining the highest possible accuracy on available fake data, and ii) effectively performing continuous learning on new fake data in a…

Sound · Computer Science 2024-09-11 Tuan Duy Nguyen Le , Kah Kuan Teh , Huy Dat Tran
‹ Prev 1 2 3 10 Next ›