English
Related papers

Related papers: TACO: Topics in Algorithmic COde generation datase…

200 papers

In recent years, a lot of technological advances in computer science have aided software programmers to create innovative and real-time user-friendly software. With the creation of the software and the urging interest of people to learn to…

Software Engineering · Computer Science 2025-03-25 Kuldeep Gautam , S. VenkataKeerthy , Ramakrishna Upadrasta

For many years, multi-object tracking benchmarks have focused on a handful of categories. Motivated primarily by surveillance and self-driving applications, these datasets provide tracks for people, vehicles, and animals, ignoring the vast…

Computer Vision and Pattern Recognition · Computer Science 2020-05-22 Achal Dave , Tarasha Khurana , Pavel Tokmakov , Cordelia Schmid , Deva Ramanan

Data analysis is a crucial analytical process to generate in-depth studies and conclusive insights to comprehensively answer a given user query for tabular data. In this work, we aim to propose new resources and benchmarks to inspire future…

Computation and Language · Computer Science 2024-10-30 Xueqing Wu , Rui Zheng , Jingzhen Sha , Te-Lin Wu , Hanyu Zhou , Mohan Tang , Kai-Wei Chang , Nanyun Peng , Haoran Huang

TACO is an open image dataset for litter detection and segmentation, which is growing through crowdsourcing. Firstly, this paper describes this dataset and the tools developed to support it. Secondly, we report instance segmentation…

Computer Vision and Pattern Recognition · Computer Science 2020-03-18 Pedro F Proença , Pedro Simões

Tactile sensing is crucial for embodied intelligence, providing fine-grained perception and control in complex environments. However, efficient tactile data compression, which is essential for real-time robotic applications under strict…

Robotics · Computer Science 2026-02-11 Zhengxue Cheng , Yan Zhao , Keyu Wang , Hengdi Zhang , Li Song

Programming often involves converting detailed and complex specifications into code, a process during which developers typically utilize visual aids to more effectively convey concepts. While recent developments in Large Multimodal Models…

Computation and Language · Computer Science 2024-09-27 Kaixin Li , Yuchen Tian , Qisheng Hu , Ziyang Luo , Zhiyong Huang , Jing Ma

We present TACO, a toolsuite for the development and automatic verification of fault-tolerant and threshold-based distributed algorithms. Our toolsuite implements three approaches for model checking threshold automata in different decidable…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-08 Paul Eichler , Tom Baumeister , Mouhammad Sakr , Mahboubeh Kalateh Dowlati , Marcus Völp , Swen Jacobs

We introduce LeetCodeDataset, a high-quality benchmark for evaluating and training code-generation models, addressing two key challenges in LLM research: the lack of reasoning-focused coding benchmarks and self-contained training testbeds.…

Machine Learning · Computer Science 2025-04-22 Yunhui Xia , Wei Shen , Yan Wang , Jason Klein Liu , Huifeng Sun , Siyue Wu , Jian Hu , Xiaolong Xu

The HuggingFace Datasets Hub hosts thousands of datasets, offering exciting opportunities for language model training and evaluation. However, datasets for a specific task type often have different schemas, making harmonization challenging.…

Computation and Language · Computer Science 2023-05-17 Damien Sileo

Programming is a powerful and ubiquitous problem-solving tool. Developing systems that can assist programmers or even generate programs independently could make programming more productive and accessible, yet so far incorporating…

Predicting the runtime complexity of a programming code is an arduous task. In fact, even for humans, it requires a subtle analysis and comprehensive knowledge of algorithms to predict time complexity with high fidelity, given any code. As…

Machine Learning · Computer Science 2019-11-05 Jagriti Sikka , Kushal Satya , Yaman Kumar , Shagun Uppal , Rajiv Ratn Shah , Roger Zimmermann

An ordinal classification (OC) problem corresponds to a special type of classification characterised by the presence of a natural order relationship among the classes. This type of problem can be found in a number of real-world…

We introduce BigO(Bench), a novel coding benchmark designed to evaluate the capabilities of generative language models in understanding and generating code with specified time and space complexities. This benchmark addresses the gap in…

Computation and Language · Computer Science 2025-03-21 Pierre Chambon , Baptiste Roziere , Benoit Sagot , Gabriel Synnaeve

We review thirteen generative systems and five supporting datasets for quantum circuit and quantum code generation, identified through a structured scoping review of Hugging Face, arXiv, and provenance tracing (January-February 2026). We…

Computational Engineering, Finance, and Science · Computer Science 2026-03-18 Juhani Merilehto

Object models are gradually progressing from predicting just category labels to providing detailed descriptions of object instances. This motivates the need for large datasets which go beyond traditional object masks and provide richer…

Existing visual reasoning benchmarks predominantly rely on natural language prompts, evaluate narrow reasoning modalities, or depend on subjective scoring procedures such as LLM-as-judge. We introduce the TACIT Benchmark, a programmatic…

Computer Vision and Pattern Recognition · Computer Science 2026-03-03 Daniel Nobrega Medeiros

We present TaskSet, a dataset of tasks for use in training and evaluating optimizers. TaskSet is unique in its size and diversity, containing over a thousand tasks ranging from image classification with fully connected or convolutional…

Machine Learning · Computer Science 2020-04-02 Luke Metz , Niru Maheswaranathan , Ruoxi Sun , C. Daniel Freeman , Ben Poole , Jascha Sohl-Dickstein

Software is used in critical applications in our day-to-day life and it is important to ensure its correctness. One popular approach to assess correctness is to evaluate software on tests. If a test fails, it indicates a fault in the…

Software Engineering · Computer Science 2025-04-01 Max Hort , Leon Moonen

For nearly a decade, the COCO dataset has been the central test bed of research in object detection. According to the recent benchmarks, however, it seems that performance on this dataset has started to saturate. One possible reason can be…

Computer Vision and Pattern Recognition · Computer Science 2022-06-24 Ali Borji

AI coding assistants produce vulnerable code in 45\% of security-relevant scenarios~\cite{veracode2025}, yet no public training dataset teaches both traditional web security and AI/ML-specific defenses in a format suitable for instruction…

Cryptography and Security · Computer Science 2026-02-12 Scott Thornton
‹ Prev 1 2 3 10 Next ›