Related papers: MIRGE: An Array-Based Computational Framework for …

MIRAGE: Scaling Test-Time Inference with Parallel Graph-Retrieval-Augmented Reasoning Chains

Large reasoning models (LRMs) have shown significant progress in test-time scaling through chain-of-thought prompting. Current approaches like search-o1 integrate retrieval augmented generation (RAG) into multi-step reasoning processes but…

Computation and Language · Computer Science 2026-01-21 Kaiwen Wei , Rui Shan , Dongsheng Zou , Jianzhong Yang , Bi Zhao , Junnan Zhu , Jiang Zhong

Mirage: A Multi-Level Superoptimizer for Tensor Programs

We introduce Mirage, the first multi-level superoptimizer for tensor programs. A key idea in Mirage is $\mu$Graphs, a uniform representation of tensor programs at the kernel, thread block, and thread levels of the GPU compute hierarchy.…

Machine Learning · Computer Science 2025-06-09 Mengdi Wu , Xinhao Cheng , Shengyu Liu , Chunan Shi , Jianan Ji , Kit Ao , Praveen Velliengiri , Xupeng Miao , Oded Padon , Zhihao Jia

MMEdge: Accelerating On-device Multimodal Inference via Pipelined Sensing and Encoding

Real-time multimodal inference on resource-constrained edge devices is essential for applications such as autonomous driving, human-computer interaction, and mobile health. However, prior work often overlooks the tight coupling between…

Computer Vision and Pattern Recognition · Computer Science 2026-03-31 Runxi Huang , Mingxuan Yu , Mingyu Tsoi , Xiaomin Ouyang

WgPy: GPU-accelerated NumPy-like array library for web browsers

To execute scientific computing programs such as deep learning at high speed, GPU acceleration is a powerful option. With the recent advancements in web technologies, interfaces like WebGL and WebGPU, which utilize GPUs on the client side…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-04 Masatoshi Hidaka , Tatsuya Harada

MIRAGE: An Iterative MapReduce based FrequentSubgraph Mining Algorithm

Frequent subgraph mining (FSM) is an important task for exploratory data analysis on graph data. Over the years, many algorithms have been proposed to solve this task. These algorithms assume that the data structure of the mining task is…

Databases · Computer Science 2013-07-24 Mansurul A Bhuiyan , Mohammad Al Hasan

MIPE: A Metric Independent Pipeline for Effective Code-Mixed NLG Evaluation

Code-mixing is a phenomenon of mixing words and phrases from two or more languages in a single utterance of speech and text. Due to the high linguistic diversity, code-mixing presents several challenges in evaluating standard natural…

Computation and Language · Computer Science 2021-07-27 Ayush Garg , Sammed S Kagi , Vivek Srivastava , Mayank Singh

MIRAGE: A Metric-Intensive Benchmark for Retrieval-Augmented Generation Evaluation

Retrieval-Augmented Generation (RAG) has gained prominence as an effective method for enhancing the generative capabilities of Large Language Models (LLMs) through the incorporation of external knowledge. However, the evaluation of RAG…

Computation and Language · Computer Science 2025-04-25 Chanhee Park , Hyeonseok Moon , Chanjun Park , Heuiseok Lim

CLEDGE: A Hybrid Cloud-Edge Computing Framework over Information Centric Networking

In today's era of Internet of Things (IoT), where massive amounts of data are produced by IoT and other devices, edge computing has emerged as a prominent paradigm for low-latency data processing. However, applications may have diverse…

Networking and Internet Architecture · Computer Science 2021-07-19 Md Washik Al Azad , Susmit Shannigrahi , Nicholas Stergiou , Francisco R. Ortega , Spyridon Mastorakis

ArrayBridge: Interweaving declarative array processing with high-performance computing

Scientists are increasingly turning to datacenter-scale computers to produce and analyze massive arrays. Despite decades of database research that extols the virtues of declarative query processing, scientists still write, debug and…

Databases · Computer Science 2017-02-28 Haoyuan Xing , Sofoklis Floratos , Spyros Blanas , Suren Byna , Prabhat , Kesheng Wu , Paul Brown

MIRAGE: Runtime Scheduling for Multi-Vector Image Retrieval with Hierarchical Decomposition

To effectively leverage user-specific data, retrieval augmented generation (RAG) is employed in multimodal large language model (MLLM) applications. However, conventional retrieval approaches often suffer from limited retrieval accuracy.…

Computer Vision and Pattern Recognition · Computer Science 2026-03-04 Maoliang Li , Ke Li , Yaoyang Liu , Jiayu Chen , Zihao Zheng , Yinjun Wu , Chenchen Liu , Xiang Chen

MIRAGE: Retrieval and Generation of Multimodal Images and Texts for Medical Education

Access to diverse, well-annotated medical images with interactive learning tools is fundamental for training practitioners in medicine and related fields to improve their diagnostic skills and understanding of anatomical structures. While…

Computer Vision and Pattern Recognition · Computer Science 2026-05-07 Miguel Diaz Benito , Cecilia Diana Albelda , Alvaro Garcia Martin , Jesus Bescos Cano , Marcos Escudero-Vinolo , Juan C. SanMiguel

SURGE: SuperBatch Unified Resource-efficient GPU Encoding for Heterogeneous Partitioned Data

We present SURGE, a streaming GPU encoding system deployed in production to generate embeddings for over 800 million texts across 40,000 logical partitions. Production embedding pipelines face a tension between logical data partitioning and…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-05 Shashank Kapadia , Deep Narayan Mishra , Sujal Reddy Alugubelli , Ajay Kumar , Swapnil Yadav , Rishi Bhatia

The NumPy array: a structure for efficient numerical computation

In the Python world, NumPy arrays are the standard representation for numerical data. Here, we show how these arrays enable efficient implementation of numerical computations in a high-level language. Overall, three techniques are applied…

Mathematical Software · Computer Science 2011-03-14 Stefan Van Der Walt , S. Chris Colbert , Gaël Varoquaux

MING: An Automated CNN-to-Edge MLIR HLS framework

Driven by the increasing demand for low-latency and real-time processing, machine learning applications are steadily migrating toward edge computing platforms, where Field-Programmable Gate Arrays (FPGAs) are widely adopted for their energy…

Hardware Architecture · Computer Science 2026-02-13 Jiahong Bi , Lars Schütze , Jeronimo Castrillon

MERBIT: A GPU-Based SpMV Method for Iterative Workloads

Sparse Matrix-Vector Multiplication (SpMV) is the cornerstone in many iterative workloads, including large-scale graph analytics and sparse iterative solvers. Accelerating SpMV on real-world graphs remains challenging due to highly…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-11 Qi Zhang , Zhengan Yao , Zhenglu Jiang , Zan-Bo Zhang

SurGE: A Benchmark and Evaluation Framework for Scientific Survey Generation

The rapid growth of academic literature makes the manual creation of scientific surveys increasingly infeasible. While large language models show promise for automating this process, progress in this area is hindered by the absence of…

Computation and Language · Computer Science 2026-05-05 Weihang Su , Anzhe Xie , Qingyao Ai , Jianming Long , Xuanyi Chen , Jiaxin Mao , Ziyi Ye , Yiqun Liu

maigesPack: A Computational Environment for Microarray Data Analysis

Microarray technology is still an important way to assess gene expression in molecular biology, mainly because it measures expression profiles for thousands of genes simultaneously, what makes this technology a good option for some studies…

Computation · Statistics 2015-11-12 Gustavo H. Esteves , Roberto Hirata

Data-parallel programming with Intel Array Building Blocks (ArBB)

Intel Array Building Blocks is a high-level data-parallel programming environment designed to produce scalable and portable results on existing and upcoming multi- and many-core platforms. We have chosen several mathematical kernels - a…

Performance · Computer Science 2012-11-08 Volker Weinberg

Array Programming with NumPy

Array programming provides a powerful, compact, expressive syntax for accessing, manipulating, and operating on data in vectors, matrices, and higher-dimensional arrays. NumPy is the primary array programming library for the Python…

Mathematical Software · Computer Science 2020-09-22 Charles R. Harris , K. Jarrod Millman , Stéfan J. van der Walt , Ralf Gommers , Pauli Virtanen , David Cournapeau , Eric Wieser , Julian Taylor , Sebastian Berg , Nathaniel J. Smith , Robert Kern , Matti Picus , Stephan Hoyer , Marten H. van Kerkwijk , Matthew Brett , Allan Haldane , Jaime Fernández del Río , Mark Wiebe , Pearu Peterson , Pierre Gérard-Marchant , Kevin Sheppard , Tyler Reddy , Warren Weckesser , Hameer Abbasi , Christoph Gohlke , Travis E. Oliphant

Mixed-Mode In-Memory Computing: Towards High-Performance Logic Processing In A Memristive Crossbar Array

In-memory computing is a promising alternative to traditional computer designs, as it helps overcome performance limits caused by the separation of memory and processing units. However, many current approaches struggle with unreliable…

Applied Physics · Physics 2025-06-25 Nan Du , Ilia Polian , Christopher Bengel , Kefeng Li , Ziang Chen , Xianyue Zhao , Uwe Huebner , Li-Wei Chen , Feng Liu , Massimiliano Di Ventra , Stephan Menzel , Heidemarie Krueger