English
Related papers

Related papers: Fathom: Reference Workloads for Modern Deep Learni…

200 papers

Researchers have been highly active to investigate the classical machine learning workflow and integrate best practices from the software engineering lifecycle. However, deep learning exhibits deviations that are not yet covered in this…

Software Engineering · Computer Science 2022-08-30 Janosch Baltensperger , Pasquale Salza , Harald C. Gall

In recent years, deep learning techniques have outperformed traditional models in many machine learning tasks. Deep neural networks have successfully been applied to address time series forecasting problems, which is a very important topic…

Machine Learning · Computer Science 2021-04-09 Pedro Lara-Benítez , Manuel Carranza-García , José C. Riquelme

Sensors are an integral part of modern Internet of Things (IoT) applications. There is a critical need for the analysis of heterogeneous multivariate temporal data obtained from the individual sensors of these systems. In this paper we…

Machine Learning · Computer Science 2019-05-14 Yujing Chen , Yue Ning , Zheng Chai , Huzefa Rangwala

Deep learning, a branch of artificial intelligence, is a data-driven method that uses multiple layers of interconnected units or neurons to learn intricate patterns and representations directly from raw input data. Empowered by this…

Machine Learning · Computer Science 2025-07-28 Mohd Halim Mohd Noor , Ayokunle Olalekan Ige

Federated learning (FL) offers privacy-preserving decentralized machine learning, optimizing models at edge clients without sharing private data. Simultaneously, foundation models (FMs) have gained traction in the artificial intelligence…

Machine Learning · Computer Science 2023-10-06 Sixing Yu , J. Pablo Muñoz , Ali Jannesari

The field of deep learning has witnessed a remarkable shift towards extremely compute- and memory-intensive neural networks. These newer larger models have enabled researchers to advance state-of-the-art tools across a variety of fields.…

Machine Learning · Computer Science 2022-07-04 Daniel Nichols , Siddharth Singh , Shu-Huai Lin , Abhinav Bhatele

The quality of learned features by representation learning determines the performance of learning algorithms and the related application tasks (such as high-dimensional data clustering). As a relatively new paradigm for representation…

Machine Learning · Computer Science 2021-02-02 Zhao Zhang , Yan Zhang , Mingliang Xu , Li Zhang , Yi Yang , Shuicheng Yan

Foundation models (FM) have demonstrated remarkable performance across a wide range of tasks (especially in the fields of natural language processing and computer vision), primarily attributed to their ability to comprehend instructions and…

Artificial Intelligence · Computer Science 2025-02-11 Hongling Zheng , Li Shen , Anke Tang , Yong Luo , Han Hu , Bo Du , Yonggang Wen , Dacheng Tao

We propose DFModel, a modeling framework for mapping dataflow computation graphs onto large-scale systems. Mapping a workload to a system requires optimizing dataflow mappings at various levels, including the inter-chip (between chips)…

Hardware Architecture · Computer Science 2024-12-24 Sho Ko , Nathan Zhang , Olivia Hsu , Ardavan Pedram , Kunle Olukotun

State-of-the-art machine learning frameworks support a wide variety of design features to enable a flexible machine learning programming interface and to ease the programmability burden on machine learning developers. Identifying and using…

Machine Learning · Computer Science 2020-07-01 Yu Emma Wang , Carole-Jean Wu , Xiaodong Wang , Kim Hazelwood , David Brooks

Deep learning (DL) is becoming increasingly popular in several application domains and has made several new application features involving computer vision, speech recognition and synthesis, self-driving automobiles, drug design, etc.…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-09-17 K. R. Jayaram , Vinod Muthusamy , Parijat Dube , Vatche Ishakian , Chen Wang , Benjamin Herta , Scott Boag , Diana Arroyo , Asser Tantawi , Archit Verma , Falk Pollok , Rania Khalaf

Foundation models have demonstrated a great ability to achieve general human-level intelligence far beyond traditional approaches. As the technique keeps attracting attention from the AI community, an increasing number of foundation models…

Computation and Language · Computer Science 2024-05-07 Shizhe Diao , Rui Pan , Hanze Dong , Ka Shun Shum , Jipeng Zhang , Wei Xiong , Tong Zhang

The integration of Foundation Models (FMs) with Federated Learning (FL) presents a transformative paradigm in Artificial Intelligence (AI). This integration offers enhanced capabilities, while addressing concerns of privacy, data…

Machine Learning · Computer Science 2024-09-10 Chao Ren , Han Yu , Hongyi Peng , Xiaoli Tang , Bo Zhao , Liping Yi , Alysa Ziying Tan , Yulan Gao , Anran Li , Xiaoxiao Li , Zengxiang Li , Qiang Yang

Tool-integrated reasoning has emerged as a key focus for enabling agentic applications. Among these, DeepResearch Agents have gained significant attention for their strong performance on complex, open-ended information-seeking tasks. We…

Artificial Intelligence · Computer Science 2025-09-30 Shreyas Singh , Kunal Singh , Pradeep Moturi

Learning sophisticated feature interactions behind user behaviors is critical in maximizing CTR for recommender systems. Despite great progress, existing methods have a strong bias towards low- or high-order interactions, or rely on…

Information Retrieval · Computer Science 2018-05-17 Huifeng Guo , Ruiming Tang , Yunming Ye , Zhenguo Li , Xiuqiang He , Zhenhua Dong

The advent of the Transformer architecture has propelled the growth of natural language processing (NLP) models, leading to remarkable achievements in numerous NLP tasks. Yet, the absence of specialized hardware like expansive GPU memory…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-03-18 Xiaofeng Wu , Jia Rao , Wei Chen

In comparison to classical shallow representation learning techniques, deep neural networks have achieved superior performance in nearly every application benchmark. But despite their clear empirical advantages, it is still not well…

Machine Learning · Computer Science 2022-01-11 Calvin Murdock , George Cazenavette , Simon Lucey

"Deep Learning"/"Deep Neural Nets" is a technological marvel that is now increasingly deployed at the cutting-edge of artificial intelligence tasks. This dramatic success of deep learning in the last few years has been hinged on an enormous…

Machine Learning · Computer Science 2021-04-30 Anirbit Mukherjee

The transition from standard generative AI to \emph{reasoning-centric architectures}, exemplified by models capable of extensive Chain-of-Thought~(CoT) processing, marks a fundamental paradigm shift in system requirements. Unlike…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-20 Moiz Arif , Avinash Maurya , Sudharshan Vazhkudai , Bogdan Nicolae

Working memory (WM), a fundamental cognitive process facilitating the temporary storage, integration, manipulation, and retrieval of information, plays a vital role in reasoning and decision-making tasks. Robust benchmark datasets that…

Neurons and Cognition · Quantitative Biology 2023-11-02 Ankur Sikarwar , Mengmi Zhang
‹ Prev 1 2 3 10 Next ›