Related papers: Refining a Deep Learning-based Formant Tracker usi…

Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks

Formant tracking is investigated in this study by using trackers based on dynamic programming (DP) and deep neural nets (DNNs). Using the DP approach, six formant estimation methods were first compared. The six methods include linear…

Audio and Speech Processing · Electrical Eng. & Systems 2022-01-06 Dhananjaya Gowda , Bajibabu Bollepalli , Sudarsana Reddy Kadiri , Paavo Alku

Time-Varying Quasi-Closed-Phase Analysis for Accurate Formant Tracking in Speech Signals

In this paper, we propose a new method for the accurate estimation and tracking of formants in speech signals using time-varying quasi-closed-phase (TVQCP) analysis. Conventional formant tracking methods typically adopt a two-stage…

Audio and Speech Processing · Electrical Eng. & Systems 2023-09-01 Dhananjaya Gowda , Sudarsana Reddy Kadiri , Brad Story , Paavo Alku

Formant Tracking Using Dilated Convolutional Networks Through Dense Connection with Gating Mechanism

Formant tracking is one of the most fundamental problems in speech processing. Traditionally, formants are estimated using signal processing methods. Recent studies showed that generic convolutional architectures can outperform recurrent…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-11 Wang Dai , Jinsong Zhang , Yingming Gao , Wei Wei , Dengfeng Ke , Binghuai Lin , Yanlu Xie

Formant Estimation and Tracking using Probabilistic Heat-Maps

Formants are the spectral maxima that result from acoustic resonances of the human vocal tract, and their accurate estimation is among the most fundamental speech processing problems. Recent work has been shown that those frequencies can…

Sound · Computer Science 2022-06-24 Yosi Shrem , Felix Kreuk , Joseph Keshet

Coarse-to-Fine Object Tracking Using Deep Features and Correlation Filters

During the last years, deep learning trackers achieved stimulating results while bringing interesting ideas to solve the tracking problem. This progress is mainly due to the use of learned deep features obtained by training deep…

Computer Vision and Pattern Recognition · Computer Science 2020-12-24 Ahmed Zgaren , Wassim Bouachir , Riadh Ksantini

Deepfake Audio Detection Using Spectrogram-based Feature and Ensemble of Deep Learning Models

In this paper, we propose a deep learning based system for the task of deepfake audio detection. In particular, the draw input audio is first transformed into various spectrograms using three transformation methods of Short-time Fourier…

Sound · Computer Science 2024-07-03 Lam Pham , Phat Lam , Truong Nguyen , Huyen Nguyen , Alexander Schindler

Tuning Pre-trained Model via Moment Probing

Recently, efficient fine-tuning of large-scale pre-trained models has attracted increasing research interests, where linear probing (LP) as a fundamental module is involved in exploiting the final representations for task-dependent…

Computer Vision and Pattern Recognition · Computer Science 2023-10-03 Mingze Gao , Qilong Wang , Zhenyi Lin , Pengfei Zhu , Qinghua Hu , Jingbo Zhou

Beyond Kalman Filters: Deep Learning-Based Filters for Improved Object Tracking

Traditional tracking-by-detection systems typically employ Kalman filters (KF) for state estimation. However, the KF requires domain-specific design choices and it is ill-suited to handling non-linear motion patterns. To address these…

Computer Vision and Pattern Recognition · Computer Science 2024-12-20 Momir Adžemović , Predrag Tadić , Andrija Petrović , Mladen Nikolić

Understanding Linear Probing then Fine-tuning Language Models from NTK Perspective

The two-stage fine-tuning (FT) method, linear probing (LP) then fine-tuning (LP-FT), outperforms linear probing and FT alone. This holds true for both in-distribution (ID) and out-of-distribution (OOD) data. One key reason for its success…

Machine Learning · Computer Science 2024-10-23 Akiyoshi Tomihari , Issei Sato

Frequency-Constrained Learning for Long-Term Forecasting

Many real-world time series exhibit strong periodic structures arising from physical laws, human routines, or seasonal cycles. However, modern deep forecasting models often fail to capture these recurring patterns due to spectral bias and a…

Machine Learning · Computer Science 2025-08-05 Menglin Kong , Vincent Zhihao Zheng , Lijun Sun

MixFormerV2: Efficient Fully Transformer Tracking

Transformer-based trackers have achieved strong accuracy on the standard benchmarks. However, their efficiency remains an obstacle to practical deployment on both GPU and CPU platforms. In this paper, to overcome this issue, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2024-02-08 Yutao Cui , Tianhui Song , Gangshan Wu , Limin Wang

Next Generation Multitarget Trackers: Random Finite Set Methods vs Transformer-based Deep Learning

Multitarget Tracking (MTT) is the problem of tracking the states of an unknown number of objects using noisy measurements, with important applications to autonomous driving, surveillance, robotics, and others. In the model-based Bayesian…

Machine Learning · Computer Science 2021-06-07 Juliano Pinto , Georg Hess , William Ljungbergh , Yuxuan Xia , Lennart Svensson , Henk Wymeersch

Joint Representation and Truncated Inference Learning for Correlation Filter based Tracking

Correlation filter (CF) based trackers generally include two modules, i.e., feature representation and on-line model adaptation. In existing off-line deep learning models for CF trackers, the model adaptation usually is either abandoned or…

Computer Vision and Pattern Recognition · Computer Science 2018-07-31 Yingjie Yao , Xiaohe Wu , Lei Zhang , Shiguang Shan , Wangmeng Zuo

DCCO: Towards Deformable Continuous Convolution Operators

Discriminative Correlation Filter (DCF) based methods have shown competitive performance on tracking benchmarks in recent years. Generally, DCF based trackers learn a rigid appearance model of the target. However, this reliance on a single…

Computer Vision and Pattern Recognition · Computer Science 2017-06-12 Joakim Johnander , Martin Danelljan , Fahad Shahbaz Khan , Michael Felsberg

Refinement of genetic variants needs attention

Variant calling refinement is crucial for distinguishing true genetic variants from technical artifacts in high-throughput sequencing data. Manual review is time-consuming while heuristic filtering often lacks optimal solutions. Traditional…

Genomics · Quantitative Biology 2024-08-02 Omar Abdelwahab , Davoud Torkamaneh

FaultFormer: Pretraining Transformers for Adaptable Bearing Fault Classification

The growth of global consumption has motivated important applications of deep learning to smart manufacturing and machine health monitoring. In particular, analyzing vibration data offers great potential to extract meaningful insights into…

Machine Learning · Computer Science 2024-05-30 Anthony Zhou , Amir Barati Farimani

Good Features to Correlate for Visual Tracking

During the recent years, correlation filters have shown dominant and spectacular results for visual object tracking. The types of the features that are employed in these family of trackers significantly affect the performance of visual…

Computer Vision and Pattern Recognition · Computer Science 2018-03-13 Erhan Gundogdu , A. Aydin Alatan

LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation

Looped Transformers have emerged as an efficient and powerful class of models for reasoning in the language domain. Recent studies show that these models achieve strong performance on algorithmic and reasoning tasks, suggesting that looped…

Computation and Language · Computer Science 2026-02-13 Ahmadreza Jeddi , Marco Ciccone , Babak Taati

Inner Loop Inference for Pretrained Transformers: Unlocking Latent Capabilities Without Training

Deep Learning architectures, and in particular Transformers, are conventionally viewed as a composition of layers. These layers are actually often obtained as the sum of two contributions: a residual path that copies the input and the…

Machine Learning · Computer Science 2026-03-03 Jonathan Lys , Vincent Gripon , Bastien Pasdeloup , Axel Marmoret , Lukas Mauch , Fabien Cardinaux , Ghouthi Boukli Hacene

Continuous Learning of Transformer-based Audio Deepfake Detection

This paper proposes a novel framework for audio deepfake detection with two main objectives: i) attaining the highest possible accuracy on available fake data, and ii) effectively performing continuous learning on new fake data in a…

Sound · Computer Science 2024-09-11 Tuan Duy Nguyen Le , Kah Kuan Teh , Huy Dat Tran