Related papers: Training Transformers Together

Implementing and Experimenting with Diffusion Models for Text-to-Image Generation

Taking advantage of the many recent advances in deep learning, text-to-image generative models currently have the merit of attracting the general public attention. Two of these models, DALL-E 2 and Imagen, have demonstrated that highly…

Computer Vision and Pattern Recognition · Computer Science 2022-09-23 Robin Zbinden

Implementation of Big AI Models for Wireless Networks with Collaborative Edge Computing

Big Artificial Intelligence (AI) models have emerged as a crucial element in various intelligent applications at the edge, such as voice assistants in smart homes and autonomous robotics in smart factories. Training big AI models, e.g., for…

Machine Learning · Computer Science 2024-04-30 Liekang Zeng , Shengyuan Ye , Xu Chen , Yang Yang

Bridging the Resource Gap: Deploying Advanced Imitation Learning Models onto Affordable Embedded Platforms

Advanced imitation learning with structures like the transformer is increasingly demonstrating its advantages in robotics. However, deploying these large-scale models on embedded platforms remains a major challenge. In this paper, we…

Machine Learning · Computer Science 2024-11-19 Haizhou Ge , Ruixiang Wang , Zhu-ang Xu , Hongrui Zhu , Ruichen Deng , Yuhang Dong , Zeyu Pang , Guyue Zhou , Junyu Zhang , Lu Shi

Scalable Deep Learning on Distributed Infrastructures: Challenges, Techniques and Tools

Deep Learning (DL) has had an immense success in the recent past, leading to state-of-the-art results in various domains such as image recognition and natural language processing. One of the reasons for this success is the increasing size…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-09-26 Ruben Mayer , Hans-Arno Jacobsen

CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers

Large-scale pretrained transformers have created milestones in text (GPT-3) and text-to-image (DALL-E and CogView) generation. Its application to video generation is still facing many challenges: The potential huge computation cost makes…

Computer Vision and Pattern Recognition · Computer Science 2022-06-01 Wenyi Hong , Ming Ding , Wendi Zheng , Xinghan Liu , Jie Tang

Can collaborative learning be private, robust and scalable?

In federated learning for medical image analysis, the safety of the learning protocol is paramount. Such settings can often be compromised by adversaries that target either the private data used by the federation or the integrity of the…

Machine Learning · Computer Science 2022-08-09 Dmitrii Usynin , Helena Klause , Johannes C. Paetzold , Daniel Rueckert , Georgios Kaissis

Unsupervised Multi-Domain Multimodal Image-to-Image Translation with Explicit Domain-Constrained Disentanglement

Image-to-image translation has drawn great attention during the past few years. It aims to translate an image in one domain to a given reference image in another domain. Due to its effectiveness and efficiency, many applications can be…

Computer Vision and Pattern Recognition · Computer Science 2019-11-05 Weihao Xia , Yujiu Yang , Jing-Hao Xue

Pretraining is All You Need for Image-to-Image Translation

We propose to use pretraining to boost general image-to-image translation. Prior image-to-image translation methods usually need dedicated architectural design and train individual translation models from scratch, struggling for…

Computer Vision and Pattern Recognition · Computer Science 2022-05-26 Tengfei Wang , Ting Zhang , Bo Zhang , Hao Ouyang , Dong Chen , Qifeng Chen , Fang Wen

CoLLiE: Collaborative Training of Large Language Models in an Efficient Way

Large language models (LLMs) are increasingly pivotal in a wide range of natural language processing tasks. Access to pre-trained models, courtesy of the open-source community, has made it possible to adapt these models to specific…

Computation and Language · Computer Science 2023-12-04 Kai Lv , Shuo Zhang , Tianle Gu , Shuhao Xing , Jiawei Hong , Keyu Chen , Xiaoran Liu , Yuqing Yang , Honglin Guo , Tengxiao Liu , Yu Sun , Qipeng Guo , Hang Yan , Xipeng Qiu

Team Learning as a Lens for Designing Human-AI Co-Creative Systems

Generative, ML-driven interactive systems have the potential to change how people interact with computers in creative processes - turning tools into co-creators. However, it is still unclear how we might achieve effective human-AI…

Human-Computer Interaction · Computer Science 2022-07-08 Frederic Gmeiner , Kenneth Holstein , Nikolas Martelaro

DeRelayL: Sustainable Decentralized Relay Learning

In the era of big data, large-scale machine learning models have revolutionized various fields, driving significant advancements. However, large-scale model training demands high financial and computational resources, which are only…

Machine Learning · Computer Science 2026-05-06 Haihan Duan , Tengfei Ma , Yuyang Qin , Runhao Zeng , Wei Cai , Victor C. M. Leung , Xiping Hu

LAFITE: Towards Language-Free Training for Text-to-Image Generation

One of the major challenges in training text-to-image generation models is the need of a large number of high-quality image-text pairs. While image samples are often easily accessible, the associated text descriptions typically require…

Computer Vision and Pattern Recognition · Computer Science 2022-03-25 Yufan Zhou , Ruiyi Zhang , Changyou Chen , Chunyuan Li , Chris Tensmeyer , Tong Yu , Jiuxiang Gu , Jinhui Xu , Tong Sun

Collaborative Training in Sensor Networks: A graphical model approach

Graphical models have been widely applied in solving distributed inference problems in sensor networks. In this paper, the problem of coordinating a network of sensors to train a unique ensemble estimator under communication constraints is…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-11-17 Haipeng Zheng , Sanjeev R. Kulkarni , H. Vincent Poor

TutoAI: A Cross-domain Framework for AI-assisted Mixed-media Tutorial Creation on Physical Tasks

Mixed-media tutorials, which integrate videos, images, text, and diagrams to teach procedural skills, offer more browsable alternatives than timeline-based videos. However, manually creating such tutorials is tedious, and existing automated…

Human-Computer Interaction · Computer Science 2024-03-14 Yuexi Chen , Vlad I. Morariu , Anh Truong , Zhicheng Liu

Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text

In this paper, we explore the possibility of building a unified foundation model that can be adapted to both vision-only and text-only tasks. Starting from BERT and ViT, we design a unified transformer consisting of modality-specific…

Computer Vision and Pattern Recognition · Computer Science 2021-12-15 Qing Li , Boqing Gong , Yin Cui , Dan Kondratyuk , Xianzhi Du , Ming-Hsuan Yang , Matthew Brown

Learning image-to-image translation using paired and unpaired training samples

Image-to-image translation is a general name for a task where an image from one domain is converted to a corresponding image in another domain, given sufficient training data. Traditionally different approaches have been proposed depending…

Computer Vision and Pattern Recognition · Computer Science 2018-05-09 Soumya Tripathy , Juho Kannala , Esa Rahtu

Collaborative Learning over Wireless Networks: An Introductory Overview

In this chapter, we will mainly focus on collaborative training across wireless devices. Training a ML model is equivalent to solving an optimization problem, and many distributed optimization algorithms have been developed over the last…

Machine Learning · Computer Science 2021-12-13 Emre Ozfatura , Deniz Gunduz , H. Vincent Poor

Deep Ensembles for Low-Data Transfer Learning

In the low-data regime, it is difficult to train good supervised models from scratch. Instead practitioners turn to pre-trained models, leveraging transfer learning. Ensembling is an empirically and theoretically appealing way to construct…

Machine Learning · Computer Science 2020-10-20 Basil Mustafa , Carlos Riquelme , Joan Puigcerver , André Susano Pinto , Daniel Keysers , Neil Houlsby

Flatee: Federated Learning Across Trusted Execution Environments

Federated learning allows us to distributively train a machine learning model where multiple parties share local model parameters without sharing private data. However, parameter exchange may still leak information. Several approaches have…

Cryptography and Security · Computer Science 2021-11-15 Arup Mondal , Yash More , Ruthu Hulikal Rooparaghunath , Debayan Gupta

What happens when generative AI models train recursively on each others' outputs?

The internet serves as a common source of training data for generative AI (genAI) models but is increasingly populated with AI-generated content. This duality raises the possibility that future genAI models may be trained on other models'…

Machine Learning · Computer Science 2025-10-03 Hung Anh Vu , Galen Reeves , Emily Wenger