Related papers: Data Processing Techniques for Modern Multimodal M…

A Survey of Multimodal Large Language Model from A Data-centric Perspective

Multimodal large language models (MLLMs) enhance the capabilities of standard large language models by integrating and processing data from multiple modalities, including text, vision, audio, video, and 3D environments. Data plays a pivotal…

Artificial Intelligence · Computer Science 2024-07-19 Tianyi Bai , Hao Liang , Binwang Wan , Yanran Xu , Xi Li , Shiyu Li , Ling Yang , Bozhou Li , Yifan Wang , Bin Cui , Ping Huang , Jiulong Shan , Conghui He , Binhang Yuan , Wentao Zhang

Data Management For Training Large Language Models: A Survey

Data plays a fundamental role in training Large Language Models (LLMs). Efficient data management, particularly in formulating a well-suited training dataset, is significant for enhancing model performance and improving training efficiency…

Computation and Language · Computer Science 2024-08-05 Zige Wang , Wanjun Zhong , Yufei Wang , Qi Zhu , Fei Mi , Baojun Wang , Lifeng Shang , Xin Jiang , Qun Liu

From Efficient Multimodal Models to World Models: A Survey

Multimodal Large Models (MLMs) are becoming a significant research focus, combining powerful large language models with multimodal learning to perform complex tasks across different data modalities. This review explores the latest…

Machine Learning · Computer Science 2024-07-02 Xinji Mai , Zeng Tao , Junxiong Lin , Haoran Wang , Yang Chang , Yanlan Kang , Yan Wang , Wenqiang Zhang

Survey of Large Multimodal Model Datasets, Application Categories and Taxonomy

Multimodal learning, a rapidly evolving field in artificial intelligence, seeks to construct more versatile and robust systems by integrating and analyzing diverse types of data, including text, images, audio, and video. Inspired by the…

Artificial Intelligence · Computer Science 2024-12-24 Priyaranjan Pattnayak , Hitesh Laxmichand Patel , Bhargava Kumar , Amit Agarwal , Ishan Banerjee , Srikant Panda , Tejaswini Kumar

Multimodal Large Language Models: A Survey

The exploration of multimodal language models integrates multiple data types, such as images, text, language, audio, and other heterogeneity. While the latest large language models excel in text-based tasks, they often struggle to…

Artificial Intelligence · Computer Science 2023-11-23 Jiayang Wu , Wensheng Gan , Zefeng Chen , Shicheng Wan , Philip S. Yu

A Survey on Data Selection for LLM Instruction Tuning

Instruction tuning is a vital step of training large language models (LLMs), so how to enhance the effect of instruction tuning has received increased attention. Existing works indicate that the quality of the dataset is more crucial than…

Computation and Language · Computer Science 2025-08-27 Bolin Zhang , Jiahao Wang , Qianlong Du , Jiajun Zhang , Zhiying Tu , Dianhui Chu

Personalized Multimodal Large Language Models: A Survey

Multimodal Large Language Models (MLLMs) have become increasingly important due to their state-of-the-art performance and ability to integrate multiple data modalities, such as text, images, and audio, to perform complex tasks with high…

Computer Vision and Pattern Recognition · Computer Science 2024-12-04 Junda Wu , Hanjia Lyu , Yu Xia , Zhehao Zhang , Joe Barrow , Ishita Kumar , Mehrnoosh Mirtaheri , Hongjie Chen , Ryan A. Rossi , Franck Dernoncourt , Tong Yu , Ruiyi Zhang , Jiuxiang Gu , Nesreen K. Ahmed , Yu Wang , Xiang Chen , Hanieh Deilamsalehy , Namyong Park , Sungchul Kim , Huanrui Yang , Subrata Mitra , Zhengmian Hu , Nedim Lipka , Dang Nguyen , Yue Zhao , Jiebo Luo , Julian McAuley

Introduction to Multilevel Modeling Techniques

In this paper, I outline several conceptual and methodological issues related to modeling individual and group processes embedded in clustered/hierarchical data structures. We position multilevel modeling techniques within a broader set of…

Methodology · Statistics 2022-12-29 Amira Ibrahim El-Desokey

Data Mixing for Large Language Models Pretraining: A Survey and Outlook

Large language models (LLMs) rely on pretraining on massive and heterogeneous corpora, where training data composition has a decisive impact on training efficiency and downstream generalization under realistic compute and data budget…

Computation and Language · Computer Science 2026-04-21 Zhuo Chen , Yuxuan Miao , Supryadi , Deyi Xiong

LLM Data Selection and Utilization via Dynamic Bi-level Optimization

While large-scale training data is fundamental for developing capable large language models (LLMs), strategically selecting high-quality data has emerged as a critical approach to enhance training efficiency and reduce computational costs.…

Machine Learning · Computer Science 2025-07-23 Yang Yu , Kai Han , Hang Zhou , Yehui Tang , Kaiqi Huang , Yunhe Wang , Dacheng Tao

Review of multimodal machine learning approaches in healthcare

Machine learning methods in healthcare have traditionally focused on using data from a single modality, limiting their ability to effectively replicate the clinical practice of integrating multiple sources of information for improved…

Machine Learning · Computer Science 2024-02-13 Felix Krones , Umar Marikkar , Guy Parsons , Adam Szmul , Adam Mahdi

Large Language Model for Table Processing: A Survey

Tables, typically two-dimensional and structured to store large amounts of data, are essential in daily activities like database queries, spreadsheet manipulations, web table question answering, and image table information extraction.…

Artificial Intelligence · Computer Science 2024-11-05 Weizheng Lu , Jing Zhang , Ju Fan , Zihao Fu , Yueguo Chen , Xiaoyong Du

A Survey on Large-scale Machine Learning

Machine learning can provide deep insights into data, allowing machines to make high-quality predictions and having been widely used in real-world applications, such as text mining, visual classification, and recommender systems. However,…

Machine Learning · Computer Science 2020-08-11 Meng Wang , Weijie Fu , Xiangnan He , Shijie Hao , Xindong Wu

Process Modeling With Large Language Models

In the realm of Business Process Management (BPM), process modeling plays a crucial role in translating complex process dynamics into comprehensible visual representations, facilitating the understanding, analysis, improvement, and…

Software Engineering · Computer Science 2024-07-01 Humam Kourani , Alessandro Berti , Daniel Schuster , Wil M. P. van der Aalst

Multimodal Grounding for Language Processing

This survey discusses how recent developments in multimodal processing facilitate conceptual grounding of language. We categorize the information flow in multimodal processing with respect to cognitive models of human information processing…

Computation and Language · Computer Science 2019-07-04 Lisa Beinborn , Teresa Botschen , Iryna Gurevych

Diffusion Models For Multi-Modal Generative Modeling

Diffusion-based generative modeling has been achieving state-of-the-art results on various generation tasks. Most diffusion models, however, are limited to a single-generation modeling. Can we generalize diffusion models with the ability of…

Computer Vision and Pattern Recognition · Computer Science 2024-09-26 Changyou Chen , Han Ding , Bunyamin Sisman , Yi Xu , Ouye Xie , Benjamin Z. Yao , Son Dinh Tran , Belinda Zeng

A Comprehensive Review of Multimodal Large Language Models: Performance and Challenges Across Different Tasks

In an era defined by the explosive growth of data and rapid technological advancements, Multimodal Large Language Models (MLLMs) stand at the forefront of artificial intelligence (AI) systems. Designed to seamlessly integrate diverse data…

Artificial Intelligence · Computer Science 2024-08-05 Jiaqi Wang , Hanqi Jiang , Yiheng Liu , Chong Ma , Xu Zhang , Yi Pan , Mengyuan Liu , Peiran Gu , Sichen Xia , Wenjun Li , Yutong Zhang , Zihao Wu , Zhengliang Liu , Tianyang Zhong , Bao Ge , Tuo Zhang , Ning Qiang , Xintao Hu , Xi Jiang , Xin Zhang , Wei Zhang , Dinggang Shen , Tianming Liu , Shu Zhang

A Survey on Data Augmentation in Large Model Era

Large models, encompassing large language and diffusion models, have shown exceptional promise in approximating human-level intelligence, garnering significant interest from both academic and industrial spheres. However, the training of…

Machine Learning · Computer Science 2024-03-05 Yue Zhou , Chenlu Guo , Xu Wang , Yi Chang , Yuan Wu

Large Language Models for Business Process Management: Opportunities and Challenges

Large language models are deep learning models with a large number of parameters. The models made noticeable progress on a large number of tasks, and as a consequence allowing them to serve as valuable and versatile tools for a diverse…

Software Engineering · Computer Science 2023-04-11 Maxim Vidgof , Stefan Bachhofner , Jan Mendling

Investigating Public Fine-Tuning Datasets: A Complex Review of Current Practices from a Construction Perspective

With the rapid development of the large model domain, research related to fine-tuning has concurrently seen significant advancement, given that fine-tuning is a constituent part of the training process for large-scale models. Data…

Computation and Language · Computer Science 2024-07-12 Runyuan Ma , Wei Li , Fukai Shang