Related papers: Computer Audition: From Task-Specific Machine Lear…

Foundation Models in Medical Imaging: A Review and Outlook

Foundation models (FMs) are changing the way medical images are analyzed by learning from large collections of unlabeled data. Instead of relying on manually annotated examples, FMs are pre-trained to learn general-purpose visual features…

Image and Video Processing · Electrical Eng. & Systems 2025-11-19 Vivien van Veldhuizen , Vanessa Botha , Chunyao Lu , Melis Erdal Cesur , Kevin Groot Lipman , Edwin D. de Jong , Hugo Horlings , Clárisa I. Sanchez , Cees G. M. Snoek , Lodewyk Wessels , Ritse Mann , Eric Marcus , Jonas Teuwen

A Survey of Audio Reasoning in Multimodal Foundation Models

Reasoning has become a defining capability of modern foundation models, yet its development in the audio modality remains limited. Audio poses challenges that are distinct from those of text and vision. It is continuous, temporally dense,…

Audio and Speech Processing · Electrical Eng. & Systems 2026-05-21 Zhihan Guo , Wenqian Cui , Guan-Ting Lin , Daxin Tan , Jingyao Li , Qiyong Zheng , Dingdong Wang , Jing Xiong , Han Shi , Jiaya Jia , Irwin King

A Survey of Foundation Models for Music Understanding

Music is essential in daily life, fulfilling emotional and entertainment needs, and connecting us personally, socially, and culturally. A better understanding of music can enhance our emotions, cognitive skills, and cultural connections.…

Sound · Computer Science 2024-09-17 Wenjun Li , Ying Cai , Ziyang Wu , Wenyi Zhang , Yifan Chen , Rundong Qi , Mengqi Dong , Peigen Chen , Xiao Dong , Fenghao Shi , Lei Guo , Junwei Han , Bao Ge , Tianming Liu , Lin Gan , Tuo Zhang

Towards General Auditory Intelligence: Large Multimodal Models for Machine Listening and Speaking

In the era of large language models (LLMs) and artificial general intelligence (AGI), computer audition must evolve beyond traditional paradigms to fully leverage the capabilities of foundation models, towards more comprehensive…

Audio and Speech Processing · Electrical Eng. & Systems 2025-11-04 Siyin Wang , Zengrui Jin , Changli Tang , Qiujia Li , Bo Li , Chen Chen , Yuchen Hu , Wenyi Yu , Yixuan Li , Jimin Zhuang , Yudong Yang , Mingqiu Wang , Michael Han , Yifan Ding , Junwen Bai , Tom Ouyang , Shuo-yiin Chang , Xianzhao Chen , Xiaohai Tian , Jun Zhang , Lu Lu , Guangzhi Sun , Zhehuai Chen , Ji Wu , Bowen Zhou , Yuxuan Wang , Tara Sainath , Yonghui Wu , Chao Zhang

Affective Computing Has Changed: The Foundation Model Disruption

The dawn of Foundation Models has on the one hand revolutionised a wide range of research problems, and, on the other hand, democratised the access and use of AI-based tools by the general public. We even observe an incursion of these…

Artificial Intelligence · Computer Science 2024-09-16 Björn Schuller , Adria Mallol-Ragolta , Alejandro Peña Almansa , Iosif Tsangko , Mostafa M. Amin , Anastasia Semertzidou , Lukas Christ , Shahin Amiriparian

Assessing the Utility of Audio Foundation Models for Heart and Respiratory Sound Analysis

Pre-trained deep learning models, known as foundation models, have become essential building blocks in machine learning domains such as natural language processing and image domains. This trend has extended to respiratory and heart sound…

Audio and Speech Processing · Electrical Eng. & Systems 2025-04-28 Daisuke Niizumi , Daiki Takeuchi , Masahiro Yasuda , Binh Thien Nguyen , Yasunori Ohishi , Noboru Harada

City Foundation Models for Learning General Purpose Representations from OpenStreetMap

Pre-trained Foundation Models (PFMs) have ushered in a paradigm-shift in Artificial Intelligence, due to their ability to learn general-purpose representations that can be readily employed in a wide range of downstream tasks. While PFMs…

Databases · Computer Science 2024-11-13 Pasquale Balsebre , Weiming Huang , Gao Cong , Yi Li

Data-Centric Foundation Models in Computational Healthcare: A Survey

The advent of foundation models (FMs) as an emerging suite of AI techniques has struck a wave of opportunities in computational healthcare. The interactive nature of these models, guided by pre-training data and human instructions, has…

Machine Learning · Computer Science 2026-04-30 Yunkun Zhang , Jin Gao , Zheling Tan , Lingfeng Zhou , Kexin Ding , Mu Zhou , Shaoting Zhang , Dequan Wang

Audio Explanation Synthesis with Generative Foundation Models

The increasing success of audio foundation models across various tasks has led to a growing need for improved interpretability to understand their intricate decision-making processes better. Existing methods primarily focus on explaining…

Sound · Computer Science 2024-10-11 Alican Akman , Qiyang Sun , Björn W. Schuller

Learning from models beyond fine-tuning

Foundation models (FM) have demonstrated remarkable performance across a wide range of tasks (especially in the fields of natural language processing and computer vision), primarily attributed to their ability to comprehend instructions and…

Artificial Intelligence · Computer Science 2025-02-11 Hongling Zheng , Li Shen , Anke Tang , Yong Luo , Han Hu , Bo Du , Yonggang Wen , Dacheng Tao

Foundation Models in Medical Image Analysis: A Systematic Review and Meta-Analysis

Recent advancements in artificial intelligence (AI), particularly foundation models (FMs), have revolutionized medical image analysis, demonstrating strong zero- and few-shot performance across diverse medical imaging tasks, from…

Computer Vision and Pattern Recognition · Computer Science 2025-10-21 Praveenbalaji Rajendran , Mojtaba Safari , Wenfeng He , Mingzhe Hu , Shansong Wang , Jun Zhou , Xiaofeng Yang

What do Speech Foundation Models Learn? Analysis and Applications

Speech foundation models (SFMs) are designed to serve as general-purpose representations for a wide range of speech-processing tasks. The last five years have seen an influx of increasingly successful self-supervised and supervised…

Computation and Language · Computer Science 2025-08-19 Ankita Pasad

Progress and Opportunities of Foundation Models in Bioinformatics

Bioinformatics has witnessed a paradigm shift with the increasing integration of artificial intelligence (AI), particularly through the adoption of foundation models (FMs). These AI techniques have rapidly advanced, addressing historical…

Quantitative Methods · Quantitative Biology 2024-02-08 Qing Li , Zhihang Hu , Yixuan Wang , Lei Li , Yimin Fan , Irwin King , Le Song , Yu Li

Foundation Model or Finetune? Evaluation of few-shot semantic segmentation for river pollution

Foundation models (FMs) are a popular topic of research in AI. Their ability to generalize to new tasks and datasets without retraining or needing an abundance of data makes them an appealing candidate for applications on specialist…

Computer Vision and Pattern Recognition · Computer Science 2024-09-06 Marga Don , Stijn Pinson , Blanca Guillen Cebrian , Yuki M. Asano

Semantic Communications using Foundation Models: Design Approaches and Open Issues

Foundation models (FMs), including large language models, have become increasingly popular due to their wide-ranging applicability and ability to understand human-like semantics. While previous research has explored the use of FMs in…

Signal Processing · Electrical Eng. & Systems 2024-10-28 Peiwen Jiang , Chao-Kai Wen , Xinping Yi , Xiao Li , Shi Jin , Jun Zhang

Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics

The recent wave of audio foundation models (FMs) could provide new capabilities for conversational modeling. However, there have been limited efforts to evaluate these audio FMs comprehensively on their ability to have natural and…

Computation and Language · Computer Science 2025-03-04 Siddhant Arora , Zhiyun Lu , Chung-Cheng Chiu , Ruoming Pang , Shinji Watanabe

A Theoretical Survey on Foundation Models

Understanding the inner mechanisms of black-box foundation models (FMs) is essential yet challenging in artificial intelligence and its applications. Over the last decade, the long-running focus has been on their explainability, leading to…

Machine Learning · Computer Science 2024-11-26 Shi Fu , Yuzhu Chen , Yingjie Wang , Dacheng Tao

Brain Foundation Models: A Survey on Advancements in Neural Signal Processing and Brain Discovery

Brain foundation models (BFMs) have emerged as a transformative paradigm in computational neuroscience, offering a revolutionary framework for processing diverse neural signals across different brain-related tasks. These models leverage…

Machine Learning · Computer Science 2025-07-22 Xinliang Zhou , Chenyu Liu , Zhisheng Chen , Kun Wang , Yi Ding , Ziyu Jia , Qingsong Wen

Towards channel foundation models (CFMs): Motivations, methodologies and opportunities

Artificial intelligence (AI) has emerged as a pivotal enabler for next-generation wireless communication systems. However, conventional AI-based models encounter several limitations, such as heavy reliance on labeled data, limited…

Signal Processing · Electrical Eng. & Systems 2025-10-14 Jun Jiang , Yuan Gao , Xinyi Wu , Shugong Xu

Audio-Language Models for Audio-Centric Tasks: A Systematic Survey

Audio-Language Models (ALMs), trained on paired audio-text data, are designed to process, understand, and reason about audio-centric multimodal content. Unlike traditional supervised approaches that use predefined labels, ALMs leverage…

Sound · Computer Science 2026-03-13 Yi Su , Jisheng Bai , Qisheng Xu , Kele Xu , Yong Dou