Related papers: A Roadmap for Big Model

Large Language Models for Business Process Management: Opportunities and Challenges

Large language models are deep learning models with a large number of parameters. The models made noticeable progress on a large number of tasks, and as a consequence allowing them to serve as valuable and versatile tools for a diverse…

Software Engineering · Computer Science 2023-04-11 Maxim Vidgof , Stefan Bachhofner , Jan Mendling

Dive into Big Model Training

The increasing scale of model size and continuous improvement of performance herald the arrival of the Big Model era. In this report, we explore what and how the big model training works by diving into training objectives and training…

Machine Learning · Computer Science 2022-07-26 Qinghua Liu , Yuxiang Jiang

Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey

With the urgent demand for generalized deep models, many pre-trained big models are proposed, such as BERT, ViT, GPT, etc. Inspired by the success of these models in single domains (like computer vision and natural language processing), the…

Computer Vision and Pattern Recognition · Computer Science 2024-04-11 Xiao Wang , Guangyao Chen , Guangwu Qian , Pengcheng Gao , Xiao-Yong Wei , Yaowei Wang , Yonghong Tian , Wen Gao

Robot Learning in the Era of Foundation Models: A Survey

The proliferation of Large Language Models (LLMs) has s fueled a shift in robot learning from automation towards general embodied Artificial Intelligence (AI). Adopting foundation models together with traditional learning methods to robot…

Robotics · Computer Science 2023-11-27 Xuan Xiao , Jiahang Liu , Zhipeng Wang , Yanmin Zhou , Yong Qi , Qian Cheng , Bin He , Shuo Jiang

From Efficient Multimodal Models to World Models: A Survey

Multimodal Large Models (MLMs) are becoming a significant research focus, combining powerful large language models with multimodal learning to perform complex tasks across different data modalities. This review explores the latest…

Machine Learning · Computer Science 2024-07-02 Xinji Mai , Zeng Tao , Junxiong Lin , Haoran Wang , Yang Chang , Yanlan Kang , Yan Wang , Wenqiang Zhang

Big Learning

Recent advances in big/foundation models reveal a promising path for deep learning, where the roadmap steadily moves from big data to big models to (the newly-introduced) big learning. Specifically, the big learning exhaustively exploits…

Machine Learning · Computer Science 2023-05-23 Yulai Cong , Miaoyun Zhao

A Survey on Large-scale Machine Learning

Machine learning can provide deep insights into data, allowing machines to make high-quality predictions and having been widely used in real-world applications, such as text mining, visual classification, and recommender systems. However,…

Machine Learning · Computer Science 2020-08-11 Meng Wang , Weijie Fu , Xiangnan He , Shijie Hao , Xindong Wu

A Survey on Multimodal Benchmarks: In the Era of Large AI Models

The rapid evolution of Multimodal Large Language Models (MLLMs) has brought substantial advancements in artificial intelligence, significantly enhancing the capability to understand and generate multimodal content. While prior studies have…

Artificial Intelligence · Computer Science 2024-09-30 Lin Li , Guikun Chen , Hanrong Shi , Jun Xiao , Long Chen

A Survey on Integration of Large Language Models with Intelligent Robots

In recent years, the integration of large language models (LLMs) has revolutionized the field of robotics, enabling robots to communicate, understand, and reason with human-like proficiency. This paper explores the multifaceted impact of…

Robotics · Computer Science 2024-08-16 Yeseung Kim , Dohyun Kim , Jieun Choi , Jisang Park , Nayoung Oh , Daehyung Park

On Efficient Training of Large-Scale Deep Learning Models: A Literature Review

The field of deep learning has witnessed significant progress, particularly in computer vision (CV), natural language processing (NLP), and speech. The use of large-scale models trained on vast amounts of data holds immense promise for…

Machine Learning · Computer Science 2023-04-10 Li Shen , Yan Sun , Zhiyuan Yu , Liang Ding , Xinmei Tian , Dacheng Tao

Advancing bioinformatics with large language models: components, applications and perspectives

Large language models (LLMs) are a class of artificial intelligence models based on deep learning, which have great performance in various tasks, especially in natural language processing (NLP). Large language models typically consist of…

Quantitative Methods · Quantitative Biology 2025-02-04 Jiajia Liu , Mengyuan Yang , Yankai Yu , Haixia Xu , Tiangang Wang , Kang Li , Xiaobo Zhou

A Comprehensive Evaluation of Large Language Models on Benchmark Biomedical Text Processing Tasks

Recently, Large Language Models (LLM) have demonstrated impressive capability to solve a wide range of tasks. However, despite their success across various tasks, no prior work has investigated their capability in the biomedical domain yet.…

Computation and Language · Computer Science 2024-02-21 Israt Jahan , Md Tahmid Rahman Laskar , Chun Peng , Jimmy Huang

Pre-Trained Models: Past, Present and Future

Large-scale pre-trained models (PTMs) such as BERT and GPT have recently achieved great success and become a milestone in the field of artificial intelligence (AI). Owing to sophisticated pre-training objectives and huge model parameters,…

Artificial Intelligence · Computer Science 2021-08-12 Xu Han , Zhengyan Zhang , Ning Ding , Yuxian Gu , Xiao Liu , Yuqi Huo , Jiezhong Qiu , Yuan Yao , Ao Zhang , Liang Zhang , Wentao Han , Minlie Huang , Qin Jin , Yanyan Lan , Yang Liu , Zhiyuan Liu , Zhiwu Lu , Xipeng Qiu , Ruihua Song , Jie Tang , Ji-Rong Wen , Jinhui Yuan , Wayne Xin Zhao , Jun Zhu

Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey

Deep reinforcement learning has shown remarkable success in the past few years. Highly complex sequential decision making problems have been solved in tasks such as game playing and robotics. Unfortunately, the sample complexity of most…

Machine Learning · Computer Science 2020-12-03 Aske Plaat , Walter Kosters , Mike Preuss

Foundations of Large Language Models

This is a book about large language models. As indicated by the title, it primarily focuses on foundational concepts rather than comprehensive coverage of all cutting-edge technologies. The book is structured into five main chapters, each…

Computation and Language · Computer Science 2025-06-17 Tong Xiao , Jingbo Zhu

Large Language Models can accomplish Business Process Management Tasks

Business Process Management (BPM) aims to improve organizational activities and their outcomes by managing the underlying processes. To achieve this, it is often necessary to consider information from various sources, including unstructured…

Computation and Language · Computer Science 2023-07-20 Michael Grohs , Luka Abb , Nourhan Elsayed , Jana-Rebecca Rehse

On the use of Large Language Models in Model-Driven Engineering

Model-Driven Engineering (MDE) has seen significant advancements with the integration of Machine Learning (ML) and Deep Learning (DL) techniques. Building upon the groundwork of previous investigations, our study provides a concise overview…

Software Engineering · Computer Science 2024-10-24 Juri Di Rocco , Davide Di Ruscio , Claudio Di Sipio , Phuong T. Nguyen , Riccardo Rubei

Several categories of Large Language Models (LLMs): A Short Survey

Large Language Models(LLMs)have become effective tools for natural language processing and have been used in many different fields. This essay offers a succinct summary of various LLM subcategories. The survey emphasizes recent developments…

Computation and Language · Computer Science 2023-07-21 Saurabh Pahune , Manoj Chandrasekharan

Large Language Models for Multi-Robot Systems: A Survey

The rapid advancement of Large Language Models (LLMs) has opened new possibilities in Multi-Robot Systems (MRS), enabling enhanced communication, task allocation and planning, and human-robot interaction. Unlike traditional single-robot and…

Robotics · Computer Science 2026-05-05 Peihan Li , Zijian An , Shams Abrar , Lifeng Zhou

Systems for Parallel and Distributed Large-Model Deep Learning Training

Deep learning (DL) has transformed applications in a variety of domains, including computer vision, natural language processing, and tabular data analysis. The search for improved DL model accuracy has led practitioners to explore…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-01-10 Kabir Nagrecha