English
Related papers

Related papers: MIRGE: An Array-Based Computational Framework for …

200 papers

Large reasoning models (LRMs) have shown significant progress in test-time scaling through chain-of-thought prompting. Current approaches like search-o1 integrate retrieval augmented generation (RAG) into multi-step reasoning processes but…

Computation and Language · Computer Science 2026-01-21 Kaiwen Wei , Rui Shan , Dongsheng Zou , Jianzhong Yang , Bi Zhao , Junnan Zhu , Jiang Zhong

We introduce Mirage, the first multi-level superoptimizer for tensor programs. A key idea in Mirage is $\mu$Graphs, a uniform representation of tensor programs at the kernel, thread block, and thread levels of the GPU compute hierarchy.…

Machine Learning · Computer Science 2025-06-09 Mengdi Wu , Xinhao Cheng , Shengyu Liu , Chunan Shi , Jianan Ji , Kit Ao , Praveen Velliengiri , Xupeng Miao , Oded Padon , Zhihao Jia

Real-time multimodal inference on resource-constrained edge devices is essential for applications such as autonomous driving, human-computer interaction, and mobile health. However, prior work often overlooks the tight coupling between…

Computer Vision and Pattern Recognition · Computer Science 2026-03-31 Runxi Huang , Mingxuan Yu , Mingyu Tsoi , Xiaomin Ouyang

To execute scientific computing programs such as deep learning at high speed, GPU acceleration is a powerful option. With the recent advancements in web technologies, interfaces like WebGL and WebGPU, which utilize GPUs on the client side…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-04 Masatoshi Hidaka , Tatsuya Harada

Frequent subgraph mining (FSM) is an important task for exploratory data analysis on graph data. Over the years, many algorithms have been proposed to solve this task. These algorithms assume that the data structure of the mining task is…

Databases · Computer Science 2013-07-24 Mansurul A Bhuiyan , Mohammad Al Hasan

Code-mixing is a phenomenon of mixing words and phrases from two or more languages in a single utterance of speech and text. Due to the high linguistic diversity, code-mixing presents several challenges in evaluating standard natural…

Computation and Language · Computer Science 2021-07-27 Ayush Garg , Sammed S Kagi , Vivek Srivastava , Mayank Singh

Retrieval-Augmented Generation (RAG) has gained prominence as an effective method for enhancing the generative capabilities of Large Language Models (LLMs) through the incorporation of external knowledge. However, the evaluation of RAG…

Computation and Language · Computer Science 2025-04-25 Chanhee Park , Hyeonseok Moon , Chanjun Park , Heuiseok Lim

In today's era of Internet of Things (IoT), where massive amounts of data are produced by IoT and other devices, edge computing has emerged as a prominent paradigm for low-latency data processing. However, applications may have diverse…

Networking and Internet Architecture · Computer Science 2021-07-19 Md Washik Al Azad , Susmit Shannigrahi , Nicholas Stergiou , Francisco R. Ortega , Spyridon Mastorakis

Scientists are increasingly turning to datacenter-scale computers to produce and analyze massive arrays. Despite decades of database research that extols the virtues of declarative query processing, scientists still write, debug and…

Databases · Computer Science 2017-02-28 Haoyuan Xing , Sofoklis Floratos , Spyros Blanas , Suren Byna , Prabhat , Kesheng Wu , Paul Brown

To effectively leverage user-specific data, retrieval augmented generation (RAG) is employed in multimodal large language model (MLLM) applications. However, conventional retrieval approaches often suffer from limited retrieval accuracy.…

Computer Vision and Pattern Recognition · Computer Science 2026-03-04 Maoliang Li , Ke Li , Yaoyang Liu , Jiayu Chen , Zihao Zheng , Yinjun Wu , Chenchen Liu , Xiang Chen

Access to diverse, well-annotated medical images with interactive learning tools is fundamental for training practitioners in medicine and related fields to improve their diagnostic skills and understanding of anatomical structures. While…

Computer Vision and Pattern Recognition · Computer Science 2026-05-07 Miguel Diaz Benito , Cecilia Diana Albelda , Alvaro Garcia Martin , Jesus Bescos Cano , Marcos Escudero-Vinolo , Juan C. SanMiguel

We present SURGE, a streaming GPU encoding system deployed in production to generate embeddings for over 800 million texts across 40,000 logical partitions. Production embedding pipelines face a tension between logical data partitioning and…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-05 Shashank Kapadia , Deep Narayan Mishra , Sujal Reddy Alugubelli , Ajay Kumar , Swapnil Yadav , Rishi Bhatia

In the Python world, NumPy arrays are the standard representation for numerical data. Here, we show how these arrays enable efficient implementation of numerical computations in a high-level language. Overall, three techniques are applied…

Mathematical Software · Computer Science 2011-03-14 Stefan Van Der Walt , S. Chris Colbert , Gaël Varoquaux

Driven by the increasing demand for low-latency and real-time processing, machine learning applications are steadily migrating toward edge computing platforms, where Field-Programmable Gate Arrays (FPGAs) are widely adopted for their energy…

Hardware Architecture · Computer Science 2026-02-13 Jiahong Bi , Lars Schütze , Jeronimo Castrillon

Sparse Matrix-Vector Multiplication (SpMV) is the cornerstone in many iterative workloads, including large-scale graph analytics and sparse iterative solvers. Accelerating SpMV on real-world graphs remains challenging due to highly…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-11 Qi Zhang , Zhengan Yao , Zhenglu Jiang , Zan-Bo Zhang

The rapid growth of academic literature makes the manual creation of scientific surveys increasingly infeasible. While large language models show promise for automating this process, progress in this area is hindered by the absence of…

Computation and Language · Computer Science 2026-05-05 Weihang Su , Anzhe Xie , Qingyao Ai , Jianming Long , Xuanyi Chen , Jiaxin Mao , Ziyi Ye , Yiqun Liu

Microarray technology is still an important way to assess gene expression in molecular biology, mainly because it measures expression profiles for thousands of genes simultaneously, what makes this technology a good option for some studies…

Computation · Statistics 2015-11-12 Gustavo H. Esteves , Roberto Hirata

Intel Array Building Blocks is a high-level data-parallel programming environment designed to produce scalable and portable results on existing and upcoming multi- and many-core platforms. We have chosen several mathematical kernels - a…

Performance · Computer Science 2012-11-08 Volker Weinberg

Array programming provides a powerful, compact, expressive syntax for accessing, manipulating, and operating on data in vectors, matrices, and higher-dimensional arrays. NumPy is the primary array programming library for the Python…

In-memory computing is a promising alternative to traditional computer designs, as it helps overcome performance limits caused by the separation of memory and processing units. However, many current approaches struggle with unreliable…

‹ Prev 1 2 3 10 Next ›