Computation and Language · Computer Science
MemBench: Towards More Comprehensive Evaluation on the Memory of LLM-based Agents
Haoran Tan, Zeyu Zhang, Chen Ma, Xu Chen +2
2025-06-30
Machine Learning · Computer Science
LabelBench: A Comprehensive Framework for Benchmarking Adaptive Label-Efficient Learning
Jifan Zhang, Yifang Chen, Gregory Canal, Stephen Mussmann +7
2024-03-05
Software Engineering · Computer Science
RT-Bench: an Extensible Benchmark Framework for the Analysis and Management of Real-Time Applications
Mattia Nicolella, Shahin Roozkhosh, Denis Hoornaert, Andrea Bastoni +1
2022-08-02
Hardware Architecture · Computer Science
Heterogeneous Memory Benchmarking Toolkit
Golsana Ghaemi, Gabriel Franco, Kazem Taram, Renato Mancuso
2025-07-08
Quantum Physics · Physics
Measuring what matters: A scalable framework for application-level quantum benchmarking
Willie Aboumrad, Claudio Girotto, Joshua Goings, Luning Zhao +15
2026-04-14
Machine Learning · Computer Science
MemoryBench: A Benchmark for Memory and Continual Learning in LLM Systems
Qingyao Ai, Yichen Tang, Changyue Wang, Jianming Long +2
2026-05-12
Hardware Architecture · Computer Science
A Mess of Memory System Benchmarking, Simulation and Application Profiling
Pouya Esmaili-Dokht, Francesco Sgherzi, Valeria Soldera Girelli, Isaac Boixaderas +14
2024-12-10
Quantum Physics · Physics
Application-Oriented Performance Benchmarks for Quantum Computing
Thomas Lubinski, Sonika Johri, Paul Varosy, Jeremiah Coleman +5
2025-04-15
Distributed, Parallel, and Cluster Computing · Computer Science
SProBench: Stream Processing Benchmark for High Performance Computing Infrastructure
Apurv Deepak Kulkarni, Siavash Ghiasvand
2025-04-04
Distributed, Parallel, and Cluster Computing · Computer Science
ConsumerBench: Benchmarking Generative AI Applications on End-User Devices
Yile Gu, Rohan Kadekodi, Hoang Nguyen, Keisuke Kamahori +2
2025-06-24
Artificial Intelligence · Computer Science
NeuroBench: A Framework for Benchmarking Neuromorphic Computing Algorithms and Systems
Jason Yik, Korneel Van den Berghe, Douwe den Blanken, Younes Bouhadjar +96
2025-01-16
Computer Vision and Pattern Recognition · Computer Science
AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval
Pavel Suma, Giorgos Kordopatis-Zilos, Ahmet Iscen, Giorgos Tolias
2024-08-07
Artificial Intelligence · Computer Science
ImplicitMemBench: Measuring Unconscious Behavioral Adaptation in Large Language Models
Chonghan Qin, Xiachong Feng, Weitao Ma, Xiaocheng Feng +1
2026-04-16
Distributed, Parallel, and Cluster Computing · Computer Science
ScALPEL: A Scalable Adaptive Lightweight Performance Evaluation Library for application performance monitoring
Hari K. Pyla, Bharath Ramesh, Calvin J. Ribbens, Srinidhi Varadarajan
2009-03-03
Computation and Language · Computer Science
LLMeBench: A Flexible Framework for Accelerating LLMs Benchmarking
Fahim Dalvi, Maram Hasanain, Sabri Boughorbel, Basel Mousi +9
2024-02-27
Robotics · Computer Science
RMBench: Memory-Dependent Robotic Manipulation Benchmark with Insights into Policy Design
Tianxing Chen, Yuran Wang, Mingleyang Li, Yan Qin +13
2026-03-17
Artificial Intelligence · Computer Science
MedMemoryBench: Benchmarking Agent Memory in Personalized Healthcare
Yihao Wang, Haoran Xu, Renjie Gu, Yixuan Ye +9
2026-05-13
Artificial Intelligence · Computer Science
VehicleMemBench: An Executable Benchmark for Multi-User Long-Term Memory in In-Vehicle Agents
Yuhao Chen, Yi Xu, Xinyun Ding, Xiang Fang +6
2026-03-26
Computation and Language · Computer Science
Minerva: A Programmable Memory Test Benchmark for Language Models
Menglin Xia, Victor Ruehle, Saravan Rajmohan, Reza Shokri
2025-06-10
Machine Learning · Computer Science
PerturBench: Benchmarking Machine Learning Models for Cellular Perturbation Analysis
Yan Wu, Esther Wershof, Sebastian M Schmon, Marcel Nassar +6
2025-10-28
Computation and Language · Computer Science
EvoMemBench: Benchmarking Agent Memory from a Self-Evolving Perspective
Yuyao Wang, Zhongjian Zhang, Mo Chi, Kaichi Yu +6
2026-05-19