Related papers: SketchFill: Sketch-Guided Code Generation for Impu…

Machine Learning for Missing Value Imputation

In recent times, a considerable number of research studies have been carried out to address the issue of Missing Value Imputation (MVI). MVI aims to provide a primary solution for datasets that have one or more missing attribute values. The…

Machine Learning · Computer Science 2024-10-14 Abu Fuad Ahmad , Khaznah Alshammari , Istiaque Ahmed , MD Shohel Sayed

SketchVLM: Vision language models can annotate images to explain thoughts and guide users

When answering questions about images, humans naturally point, label, and draw to explain their reasoning. In contrast, modern vision-language models (VLMs) such as Gemini-3-Pro and GPT-5 only respond with text, which can be difficult for…

Computer Vision and Pattern Recognition · Computer Science 2026-04-29 Brandon Collins , Logan Bolton , Hung Huy Nguyen , Mohammad Reza Taesiri , Trung Bui , Anh Totti Nguyen

Missing Value Imputation Based on Deep Generative Models

Missing values widely exist in many real-world datasets, which hinders the performing of advanced data analytics. Properly filling these missing values is crucial but challenging, especially when the missing rate is high. Many approaches…

Machine Learning · Computer Science 2018-08-07 Hongbao Zhang , Pengtao Xie , Eric Xing

It's Not Just Labeling -- A Research on LLM Generated Feedback Interpretability and Image Labeling Sketch Features

The quality of training data is critical to the performance of machine learning applications in domains like transportation, healthcare, and robotics. Accurate image labeling, however, often relies on time-consuming, expert-driven methods…

Human-Computer Interaction · Computer Science 2025-05-28 Baichuan Li , Larry Powell , Tracy Hammond

Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models

Humans draw to facilitate reasoning: we draw auxiliary lines when solving geometry problems; we mark and circle when reasoning on maps; we use sketches to amplify our ideas and relieve our limited-capacity working memory. However, such…

Computer Vision and Pattern Recognition · Computer Science 2024-11-12 Yushi Hu , Weijia Shi , Xingyu Fu , Dan Roth , Mari Ostendorf , Luke Zettlemoyer , Noah A Smith , Ranjay Krishna

Missing Value Imputation on Multidimensional Time Series

We present DeepMVI, a deep learning method for missing value imputation in multidimensional time-series datasets. Missing values are commonplace in decision support platforms that aggregate data over long time stretches from disparate…

Machine Learning · Computer Science 2023-06-22 Parikshit Bansal , Prathamesh Deshpande , Sunita Sarawagi

From Text to Visuals: Using LLMs to Generate Math Diagrams with Vector Graphics

Advances in large language models (LLMs) offer new possibilities for enhancing math education by automating support for both teachers and students. While prior work has focused on generating math problems and high-quality distractors, the…

Artificial Intelligence · Computer Science 2025-03-11 Jaewook Lee , Jeongah Lee , Wanyong Feng , Andrew Lan

Prompt Sketching for Large Language Models

Many recent prompting strategies for large language models (LLMs) query the model multiple times sequentially -- first to produce intermediate results and then the final answer. However, using these methods, both decoder and model are…

Computation and Language · Computer Science 2023-11-10 Luca Beurer-Kellner , Mark Niklas Müller , Marc Fischer , Martin Vechev

SketchJudge: A Diagnostic Benchmark for Grading Hand-drawn Diagrams with Multimodal Large Language Models

While Multimodal Large Language Models (MLLMs) have achieved remarkable progress in visual understanding, they often struggle when faced with the unstructured and ambiguous nature of human-generated sketches. This limitation is particularly…

Computer Vision and Pattern Recognition · Computer Science 2026-01-13 Yuhang Su , Mei Wang , Yaoyao Zhong , Guozhang Li , Shixing Li , Yihan Feng , Hua Huang

Explainable Deepfake Detection with RL Enhanced Self-Blended Images

Most prior deepfake detection methods lack explainable outputs. With the growing interest in multimodal large language models (MLLMs), researchers have started exploring their use in interpretable deepfake detection. However, a major…

Computer Vision and Pattern Recognition · Computer Science 2026-01-23 Ning Jiang , Dingheng Zeng , Yanhong Liu , Haiyang Yi , Shijie Yu , Minghe Weng , Haifeng Shen , Ying Li

Sketch: A Toolkit for Streamlining LLM Operations

Large language models (LLMs) represented by GPT family have achieved remarkable success. The characteristics of LLMs lie in their ability to accommodate a wide range of tasks through a generative approach. However, the flexibility of their…

Computation and Language · Computer Science 2024-09-06 Xin Jiang , Xiang Li , Wenjia Ma , Xuezhi Fang , Yiqun Yao , Naitong Yu , Xuying Meng , Peng Han , Jing Li , Aixin Sun , Yequan Wang

CodeVision: Detecting LLM-Generated Code Using 2D Token Probability Maps and Vision Models

The rise of large language models (LLMs) like ChatGPT has significantly improved automated code generation, enhancing software development efficiency. However, this introduces challenges in academia, particularly in distinguishing between…

Software Engineering · Computer Science 2025-01-08 Zhenyu Xu , Victor S. Sheng

Sketch-and-Verify: Structured Inference-Time Scaling via Program Sketching

SKETCHVERIFY is a within-tier cost-performance policy, not a universal accuracy improvement. The operational question: a practitioner stuck with a small, cheap code model (here, Gemini 3.1 Flash Lite) for latency, deployment, or budget…

Machine Learning · Computer Science 2026-05-12 Shan Jiang , Zijian Yi , Chenguang Zhu

SketchVL: Policy Optimization via Fine-Grained Credit Assignment for Chart Understanding and More

Charts are high-density visual carriers of complex data and medium for information extraction and analysis. Due to the need for precise and complex visual reasoning, automated chart understanding poses a significant challenge to existing…

Computer Vision and Pattern Recognition · Computer Science 2026-01-12 Muye Huang , Lingling Zhang , Yifei Li , Yaqiang Wu , Jun Liu

LTD-Bench: Evaluating Large Language Models by Letting Them Draw

Current evaluation paradigms for large language models (LLMs) represent a critical blind spot in AI research--relying on opaque numerical metrics that conceal fundamental limitations in spatial reasoning while providing no intuitive…

Computation and Language · Computer Science 2025-11-05 Liuhao Lin , Ke Li , Zihan Xu , Yuchen Shi , Yulei Qin , Yan Zhang , Xing Sun , Rongrong Ji

UltraSketchLLM: Saliency-Driven Sketching for Ultra-Low Bit LLM Compression

The rapid growth of large language models (LLMs) has outpaced the memory constraints of edge devices, necessitating extreme weight compression beyond the 1-bit limit. While quantization reduces model size, it is fundamentally limited to 1…

Machine Learning · Computer Science 2025-06-24 Sunan Zou , Ziyun Zhang , Xueting Sun , Guojie Luo

O3SLM: Open Weight, Open Data, and Open Vocabulary Sketch-Language Model

While Large Vision Language Models (LVLMs) are increasingly deployed in real-world applications, their ability to interpret abstract visual inputs remains limited. Specifically, they struggle to comprehend hand-drawn sketches, a modality…

Computer Vision and Pattern Recognition · Computer Science 2025-12-25 Rishi Gupta , Mukilan Karuppasamy , Shyam Marjit , Aditay Tripathi , Anirban Chakraborty

Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs

While Multimodal Large Language Models (MLLMs) excel at visual understanding, they often struggle in complex scenarios that require visual planning and imagination. Inspired by how humans use sketching as a form of visual thinking to…

Computer Vision and Pattern Recognition · Computer Science 2025-10-29 Huanyu Zhang , Wenshan Wu , Chengzu Li , Ning Shang , Yan Xia , Yangyu Huang , Yifan Zhang , Li Dong , Zhang Zhang , Liang Wang , Tieniu Tan , Furu Wei

LLM-Sketch: Enhancing Network Sketches with LLM

Network stream mining is fundamental to many network operations. Sketches, as compact data structures that offer low memory overhead with bounded accuracy, have emerged as a promising solution for network stream mining. Recent studies…

Networking and Internet Architecture · Computer Science 2025-02-12 Yuanpeng Li , Zhen Xu , Zongwei Lv , Yannan Hu , Yong Cui , Tong Yang

CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images

Recent advances in Large Language Models (LLMs) and Vision Language Models (VLMs) have shown significant progress in mathematical reasoning, yet they still face a critical bottleneck with problems requiring visual assistance, such as…

Computer Vision and Pattern Recognition · Computer Science 2025-10-14 Chengqi Duan , Kaiyue Sun , Rongyao Fang , Manyuan Zhang , Yan Feng , Ying Luo , Yufang Liu , Ke Wang , Peng Pei , Xunliang Cai , Hongsheng Li , Yi Ma , Xihui Liu