English
Related papers

Related papers: A Roadmap for Big Model

200 papers

Large language models are deep learning models with a large number of parameters. The models made noticeable progress on a large number of tasks, and as a consequence allowing them to serve as valuable and versatile tools for a diverse…

Software Engineering · Computer Science 2023-04-11 Maxim Vidgof , Stefan Bachhofner , Jan Mendling

The increasing scale of model size and continuous improvement of performance herald the arrival of the Big Model era. In this report, we explore what and how the big model training works by diving into training objectives and training…

Machine Learning · Computer Science 2022-07-26 Qinghua Liu , Yuxiang Jiang

With the urgent demand for generalized deep models, many pre-trained big models are proposed, such as BERT, ViT, GPT, etc. Inspired by the success of these models in single domains (like computer vision and natural language processing), the…

Computer Vision and Pattern Recognition · Computer Science 2024-04-11 Xiao Wang , Guangyao Chen , Guangwu Qian , Pengcheng Gao , Xiao-Yong Wei , Yaowei Wang , Yonghong Tian , Wen Gao

The proliferation of Large Language Models (LLMs) has s fueled a shift in robot learning from automation towards general embodied Artificial Intelligence (AI). Adopting foundation models together with traditional learning methods to robot…

Robotics · Computer Science 2023-11-27 Xuan Xiao , Jiahang Liu , Zhipeng Wang , Yanmin Zhou , Yong Qi , Qian Cheng , Bin He , Shuo Jiang

Multimodal Large Models (MLMs) are becoming a significant research focus, combining powerful large language models with multimodal learning to perform complex tasks across different data modalities. This review explores the latest…

Machine Learning · Computer Science 2024-07-02 Xinji Mai , Zeng Tao , Junxiong Lin , Haoran Wang , Yang Chang , Yanlan Kang , Yan Wang , Wenqiang Zhang

Recent advances in big/foundation models reveal a promising path for deep learning, where the roadmap steadily moves from big data to big models to (the newly-introduced) big learning. Specifically, the big learning exhaustively exploits…

Machine Learning · Computer Science 2023-05-23 Yulai Cong , Miaoyun Zhao

Machine learning can provide deep insights into data, allowing machines to make high-quality predictions and having been widely used in real-world applications, such as text mining, visual classification, and recommender systems. However,…

Machine Learning · Computer Science 2020-08-11 Meng Wang , Weijie Fu , Xiangnan He , Shijie Hao , Xindong Wu

The rapid evolution of Multimodal Large Language Models (MLLMs) has brought substantial advancements in artificial intelligence, significantly enhancing the capability to understand and generate multimodal content. While prior studies have…

Artificial Intelligence · Computer Science 2024-09-30 Lin Li , Guikun Chen , Hanrong Shi , Jun Xiao , Long Chen

In recent years, the integration of large language models (LLMs) has revolutionized the field of robotics, enabling robots to communicate, understand, and reason with human-like proficiency. This paper explores the multifaceted impact of…

Robotics · Computer Science 2024-08-16 Yeseung Kim , Dohyun Kim , Jieun Choi , Jisang Park , Nayoung Oh , Daehyung Park

The field of deep learning has witnessed significant progress, particularly in computer vision (CV), natural language processing (NLP), and speech. The use of large-scale models trained on vast amounts of data holds immense promise for…

Machine Learning · Computer Science 2023-04-10 Li Shen , Yan Sun , Zhiyuan Yu , Liang Ding , Xinmei Tian , Dacheng Tao

Large language models (LLMs) are a class of artificial intelligence models based on deep learning, which have great performance in various tasks, especially in natural language processing (NLP). Large language models typically consist of…

Quantitative Methods · Quantitative Biology 2025-02-04 Jiajia Liu , Mengyuan Yang , Yankai Yu , Haixia Xu , Tiangang Wang , Kang Li , Xiaobo Zhou

Recently, Large Language Models (LLM) have demonstrated impressive capability to solve a wide range of tasks. However, despite their success across various tasks, no prior work has investigated their capability in the biomedical domain yet.…

Computation and Language · Computer Science 2024-02-21 Israt Jahan , Md Tahmid Rahman Laskar , Chun Peng , Jimmy Huang

Large-scale pre-trained models (PTMs) such as BERT and GPT have recently achieved great success and become a milestone in the field of artificial intelligence (AI). Owing to sophisticated pre-training objectives and huge model parameters,…

Deep reinforcement learning has shown remarkable success in the past few years. Highly complex sequential decision making problems have been solved in tasks such as game playing and robotics. Unfortunately, the sample complexity of most…

Machine Learning · Computer Science 2020-12-03 Aske Plaat , Walter Kosters , Mike Preuss

This is a book about large language models. As indicated by the title, it primarily focuses on foundational concepts rather than comprehensive coverage of all cutting-edge technologies. The book is structured into five main chapters, each…

Computation and Language · Computer Science 2025-06-17 Tong Xiao , Jingbo Zhu

Business Process Management (BPM) aims to improve organizational activities and their outcomes by managing the underlying processes. To achieve this, it is often necessary to consider information from various sources, including unstructured…

Computation and Language · Computer Science 2023-07-20 Michael Grohs , Luka Abb , Nourhan Elsayed , Jana-Rebecca Rehse

Model-Driven Engineering (MDE) has seen significant advancements with the integration of Machine Learning (ML) and Deep Learning (DL) techniques. Building upon the groundwork of previous investigations, our study provides a concise overview…

Software Engineering · Computer Science 2024-10-24 Juri Di Rocco , Davide Di Ruscio , Claudio Di Sipio , Phuong T. Nguyen , Riccardo Rubei

Large Language Models(LLMs)have become effective tools for natural language processing and have been used in many different fields. This essay offers a succinct summary of various LLM subcategories. The survey emphasizes recent developments…

Computation and Language · Computer Science 2023-07-21 Saurabh Pahune , Manoj Chandrasekharan

The rapid advancement of Large Language Models (LLMs) has opened new possibilities in Multi-Robot Systems (MRS), enabling enhanced communication, task allocation and planning, and human-robot interaction. Unlike traditional single-robot and…

Robotics · Computer Science 2026-05-05 Peihan Li , Zijian An , Shams Abrar , Lifeng Zhou

Deep learning (DL) has transformed applications in a variety of domains, including computer vision, natural language processing, and tabular data analysis. The search for improved DL model accuracy has led practitioners to explore…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-01-10 Kabir Nagrecha
‹ Prev 1 2 3 10 Next ›