Related papers: TACO: Topics in Algorithmic COde generation datase…

COFO: COdeFOrces dataset for Program Classification, Recognition and Tagging

In recent years, a lot of technological advances in computer science have aided software programmers to create innovative and real-time user-friendly software. With the creation of the software and the urging interest of people to learn to…

Software Engineering · Computer Science 2025-03-25 Kuldeep Gautam , S. VenkataKeerthy , Ramakrishna Upadrasta

TAO: A Large-Scale Benchmark for Tracking Any Object

For many years, multi-object tracking benchmarks have focused on a handful of categories. Motivated primarily by surveillance and self-driving applications, these datasets provide tracks for people, vehicles, and animals, ignoring the vast…

Computer Vision and Pattern Recognition · Computer Science 2020-05-22 Achal Dave , Tarasha Khurana , Pavel Tokmakov , Cordelia Schmid , Deva Ramanan

DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation

Data analysis is a crucial analytical process to generate in-depth studies and conclusive insights to comprehensively answer a given user query for tabular data. In this work, we aim to propose new resources and benchmarks to inspire future…

Computation and Language · Computer Science 2024-10-30 Xueqing Wu , Rui Zheng , Jingzhen Sha , Te-Lin Wu , Hanyu Zhou , Mohan Tang , Kai-Wei Chang , Nanyun Peng , Haoran Huang

TACO: Trash Annotations in Context for Litter Detection

TACO is an open image dataset for litter detection and segmentation, which is growing through crowdsourcing. Firstly, this paper describes this dataset and the tools developed to support it. Secondly, we report instance segmentation…

Computer Vision and Pattern Recognition · Computer Science 2020-03-18 Pedro F Proença , Pedro Simões

TaCo: A Benchmark for Lossless and Lossy Codecs of Heterogeneous Tactile Data

Tactile sensing is crucial for embodied intelligence, providing fine-grained perception and control in complex environments. However, efficient tactile data compression, which is essential for real-time robotic applications under strict…

Robotics · Computer Science 2026-02-11 Zhengxue Cheng , Yan Zhao , Keyu Wang , Hengdi Zhang , Li Song

MMCode: Benchmarking Multimodal Large Language Models for Code Generation with Visually Rich Programming Problems

Programming often involves converting detailed and complex specifications into code, a process during which developers typically utilize visual aids to more effectively convey concepts. While recent developments in Large Multimodal Models…

Computation and Language · Computer Science 2024-09-27 Kaixin Li , Yuchen Tian , Qisheng Hu , Ziyang Luo , Zhiyong Huang , Jing Ma

TACO: A Toolsuite for the Verification of Threshold Automata

We present TACO, a toolsuite for the development and automatic verification of fault-tolerant and threshold-based distributed algorithms. Our toolsuite implements three approaches for model checking threshold automata in different decidable…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-08 Paul Eichler , Tom Baumeister , Mouhammad Sakr , Mahboubeh Kalateh Dowlati , Marcus Völp , Swen Jacobs

LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs

We introduce LeetCodeDataset, a high-quality benchmark for evaluating and training code-generation models, addressing two key challenges in LLM research: the lack of reasoning-focused coding benchmarks and self-contained training testbeds.…

Machine Learning · Computer Science 2025-04-22 Yunhui Xia , Wei Shen , Yan Wang , Jason Klein Liu , Huifeng Sun , Siyue Wu , Jian Hu , Xiaolong Xu

tasksource: A Dataset Harmonization Framework for Streamlined NLP Multi-Task Learning and Evaluation

The HuggingFace Datasets Hub hosts thousands of datasets, offering exciting opportunities for language model training and evaluation. However, datasets for a specific task type often have different schemas, making harmonization challenging.…

Computation and Language · Computer Science 2023-05-17 Damien Sileo

Competition-Level Code Generation with AlphaCode

Programming is a powerful and ubiquitous problem-solving tool. Developing systems that can assist programmers or even generate programs independently could make programming more productive and accessible, yet so far incorporating…

Programming Languages · Computer Science 2023-01-11 Yujia Li , David Choi , Junyoung Chung , Nate Kushman , Julian Schrittwieser , Rémi Leblond , Tom Eccles , James Keeling , Felix Gimeno , Agustin Dal Lago , Thomas Hubert , Peter Choy , Cyprien de Masson d'Autume , Igor Babuschkin , Xinyun Chen , Po-Sen Huang , Johannes Welbl , Sven Gowal , Alexey Cherepanov , James Molloy , Daniel J. Mankowitz , Esme Sutherland Robson , Pushmeet Kohli , Nando de Freitas , Koray Kavukcuoglu , Oriol Vinyals

Learning based Methods for Code Runtime Complexity Prediction

Predicting the runtime complexity of a programming code is an arduous task. In fact, even for humans, it requires a subtle analysis and comprehensive knowledge of algorithms to predict time complexity with high fidelity, given any code. As…

Machine Learning · Computer Science 2019-11-05 Jagriti Sikka , Kushal Satya , Yaman Kumar , Shagun Uppal , Rajiv Ratn Shah , Roger Zimmermann

TOC-UCO: a comprehensive repository of tabular ordinal classification datasets

An ordinal classification (OC) problem corresponds to a special type of classification characterised by the presence of a natural order relationship among the classes. This type of problem can be found in a number of real-world…

Machine Learning · Computer Science 2025-07-25 Rafael Ayllón-Gavilán , David Guijo-Rubio , Antonio Manuel Gómez-Orellana , Francisco Bérchez-Moreno , Víctor Manuel Vargas-Yun , Pedro A. Gutiérrez

BigO(Bench) -- Can LLMs Generate Code with Controlled Time and Space Complexity?

We introduce BigO(Bench), a novel coding benchmark designed to evaluate the capabilities of generative language models in understanding and generating code with specified time and space complexities. This benchmark addresses the gap in…

Computation and Language · Computer Science 2025-03-21 Pierre Chambon , Baptiste Roziere , Benoit Sagot , Gabriel Synnaeve

Generative AI for Quantum Circuits and Quantum Code: A Technical Review and Taxonomy

We review thirteen generative systems and five supporting datasets for quantum circuit and quantum code generation, identified through a structured scoping review of Hugging Face, arXiv, and provenance tracing (January-February 2026). We…

Computational Engineering, Finance, and Science · Computer Science 2026-03-18 Juhani Merilehto

PACO: Parts and Attributes of Common Objects

Object models are gradually progressing from predicting just category labels to providing detailed descriptions of object instances. This motivates the need for large datasets which go beyond traditional object masks and provide richer…

Computer Vision and Pattern Recognition · Computer Science 2023-01-06 Vignesh Ramanathan , Anmol Kalia , Vladan Petrovic , Yi Wen , Baixue Zheng , Baishan Guo , Rui Wang , Aaron Marquez , Rama Kovvuri , Abhishek Kadian , Amir Mousavi , Yiwen Song , Abhimanyu Dubey , Dhruv Mahajan

TACIT Benchmark: A Programmatic Visual Reasoning Benchmark for Generative and Discriminative Models

Existing visual reasoning benchmarks predominantly rely on natural language prompts, evaluate narrow reasoning modalities, or depend on subjective scoring procedures such as LLM-as-judge. We introduce the TACIT Benchmark, a programmatic…

Computer Vision and Pattern Recognition · Computer Science 2026-03-03 Daniel Nobrega Medeiros

Using a thousand optimization tasks to learn hyperparameter search strategies

We present TaskSet, a dataset of tasks for use in training and evaluating optimizers. TaskSet is unique in its size and diversity, containing over a thousand tasks ranging from image classification with fully connected or convolutional…

Machine Learning · Computer Science 2020-04-02 Luke Metz , Niru Maheswaranathan , Ruoxi Sun , C. Daniel Freeman , Ben Poole , Jascha Sohl-Dickstein

Codehacks: A Dataset of Adversarial Tests for Competitive Programming Problems Obtained from Codeforces

Software is used in critical applications in our day-to-day life and it is important to ensure its correctness. One popular approach to assess correctness is to evaluate software on tests. If a test fails, it indicates a fault in the…

Software Engineering · Computer Science 2025-04-01 Max Hort , Leon Moonen

Complementary datasets to COCO for object detection

For nearly a decade, the COCO dataset has been the central test bed of research in object detection. According to the recent benchmarks, however, it seems that performance on this dataset has started to saturate. One possible reason can be…

Computer Vision and Pattern Recognition · Computer Science 2022-06-24 Ali Borji

SecureCode: A Production-Grade Multi-Turn Dataset for Training Security-Aware Code Generation Models

AI coding assistants produce vulnerable code in 45\% of security-relevant scenarios~\cite{veracode2025}, yet no public training dataset teaches both traditional web security and AI/ML-specific defenses in a format suitable for instruction…

Cryptography and Security · Computer Science 2026-02-12 Scott Thornton