English
Related papers

Related papers: Computer Audition: From Task-Specific Machine Lear…

200 papers

Foundation models (FMs) are changing the way medical images are analyzed by learning from large collections of unlabeled data. Instead of relying on manually annotated examples, FMs are pre-trained to learn general-purpose visual features…

Reasoning has become a defining capability of modern foundation models, yet its development in the audio modality remains limited. Audio poses challenges that are distinct from those of text and vision. It is continuous, temporally dense,…

Audio and Speech Processing · Electrical Eng. & Systems 2026-05-21 Zhihan Guo , Wenqian Cui , Guan-Ting Lin , Daxin Tan , Jingyao Li , Qiyong Zheng , Dingdong Wang , Jing Xiong , Han Shi , Jiaya Jia , Irwin King

Music is essential in daily life, fulfilling emotional and entertainment needs, and connecting us personally, socially, and culturally. A better understanding of music can enhance our emotions, cognitive skills, and cultural connections.…

In the era of large language models (LLMs) and artificial general intelligence (AGI), computer audition must evolve beyond traditional paradigms to fully leverage the capabilities of foundation models, towards more comprehensive…

The dawn of Foundation Models has on the one hand revolutionised a wide range of research problems, and, on the other hand, democratised the access and use of AI-based tools by the general public. We even observe an incursion of these…

Pre-trained deep learning models, known as foundation models, have become essential building blocks in machine learning domains such as natural language processing and image domains. This trend has extended to respiratory and heart sound…

Audio and Speech Processing · Electrical Eng. & Systems 2025-04-28 Daisuke Niizumi , Daiki Takeuchi , Masahiro Yasuda , Binh Thien Nguyen , Yasunori Ohishi , Noboru Harada

Pre-trained Foundation Models (PFMs) have ushered in a paradigm-shift in Artificial Intelligence, due to their ability to learn general-purpose representations that can be readily employed in a wide range of downstream tasks. While PFMs…

Databases · Computer Science 2024-11-13 Pasquale Balsebre , Weiming Huang , Gao Cong , Yi Li

The advent of foundation models (FMs) as an emerging suite of AI techniques has struck a wave of opportunities in computational healthcare. The interactive nature of these models, guided by pre-training data and human instructions, has…

Machine Learning · Computer Science 2026-04-30 Yunkun Zhang , Jin Gao , Zheling Tan , Lingfeng Zhou , Kexin Ding , Mu Zhou , Shaoting Zhang , Dequan Wang

The increasing success of audio foundation models across various tasks has led to a growing need for improved interpretability to understand their intricate decision-making processes better. Existing methods primarily focus on explaining…

Sound · Computer Science 2024-10-11 Alican Akman , Qiyang Sun , Björn W. Schuller

Foundation models (FM) have demonstrated remarkable performance across a wide range of tasks (especially in the fields of natural language processing and computer vision), primarily attributed to their ability to comprehend instructions and…

Artificial Intelligence · Computer Science 2025-02-11 Hongling Zheng , Li Shen , Anke Tang , Yong Luo , Han Hu , Bo Du , Yonggang Wen , Dacheng Tao

Recent advancements in artificial intelligence (AI), particularly foundation models (FMs), have revolutionized medical image analysis, demonstrating strong zero- and few-shot performance across diverse medical imaging tasks, from…

Computer Vision and Pattern Recognition · Computer Science 2025-10-21 Praveenbalaji Rajendran , Mojtaba Safari , Wenfeng He , Mingzhe Hu , Shansong Wang , Jun Zhou , Xiaofeng Yang

Speech foundation models (SFMs) are designed to serve as general-purpose representations for a wide range of speech-processing tasks. The last five years have seen an influx of increasingly successful self-supervised and supervised…

Computation and Language · Computer Science 2025-08-19 Ankita Pasad

Bioinformatics has witnessed a paradigm shift with the increasing integration of artificial intelligence (AI), particularly through the adoption of foundation models (FMs). These AI techniques have rapidly advanced, addressing historical…

Quantitative Methods · Quantitative Biology 2024-02-08 Qing Li , Zhihang Hu , Yixuan Wang , Lei Li , Yimin Fan , Irwin King , Le Song , Yu Li

Foundation models (FMs) are a popular topic of research in AI. Their ability to generalize to new tasks and datasets without retraining or needing an abundance of data makes them an appealing candidate for applications on specialist…

Computer Vision and Pattern Recognition · Computer Science 2024-09-06 Marga Don , Stijn Pinson , Blanca Guillen Cebrian , Yuki M. Asano

Foundation models (FMs), including large language models, have become increasingly popular due to their wide-ranging applicability and ability to understand human-like semantics. While previous research has explored the use of FMs in…

Signal Processing · Electrical Eng. & Systems 2024-10-28 Peiwen Jiang , Chao-Kai Wen , Xinping Yi , Xiao Li , Shi Jin , Jun Zhang

The recent wave of audio foundation models (FMs) could provide new capabilities for conversational modeling. However, there have been limited efforts to evaluate these audio FMs comprehensively on their ability to have natural and…

Computation and Language · Computer Science 2025-03-04 Siddhant Arora , Zhiyun Lu , Chung-Cheng Chiu , Ruoming Pang , Shinji Watanabe

Understanding the inner mechanisms of black-box foundation models (FMs) is essential yet challenging in artificial intelligence and its applications. Over the last decade, the long-running focus has been on their explainability, leading to…

Machine Learning · Computer Science 2024-11-26 Shi Fu , Yuzhu Chen , Yingjie Wang , Dacheng Tao

Brain foundation models (BFMs) have emerged as a transformative paradigm in computational neuroscience, offering a revolutionary framework for processing diverse neural signals across different brain-related tasks. These models leverage…

Machine Learning · Computer Science 2025-07-22 Xinliang Zhou , Chenyu Liu , Zhisheng Chen , Kun Wang , Yi Ding , Ziyu Jia , Qingsong Wen

Artificial intelligence (AI) has emerged as a pivotal enabler for next-generation wireless communication systems. However, conventional AI-based models encounter several limitations, such as heavy reliance on labeled data, limited…

Signal Processing · Electrical Eng. & Systems 2025-10-14 Jun Jiang , Yuan Gao , Xinyi Wu , Shugong Xu

Audio-Language Models (ALMs), trained on paired audio-text data, are designed to process, understand, and reason about audio-centric multimodal content. Unlike traditional supervised approaches that use predefined labels, ALMs leverage…

Sound · Computer Science 2026-03-13 Yi Su , Jisheng Bai , Qisheng Xu , Kele Xu , Yong Dou
‹ Prev 1 2 3 10 Next ›