Related papers: V-Star: Learning Visibly Pushdown Grammars from Pr…

A Tighter Bound for the Determinization of Visibly Pushdown Automata

Visibly pushdown automata (VPA), introduced by Alur and Madhusuan in 2004, is a subclass of pushdown automata whose stack behavior is completely determined by the input symbol according to a fixed partition of the input alphabet. Since its…

Formal Languages and Automata Theory · Computer Science 2009-11-18 Nguyen Van Tang

A Grammatical Inference Approach to Language-Based Anomaly Detection in XML

False-positives are a problem in anomaly-based intrusion detection systems. To counter this issue, we discuss anomaly detection for the eXtensible Markup Language (XML) in a language-theoretic view. We argue that many XML-based attacks…

Cryptography and Security · Computer Science 2013-11-13 Harald Lampesberger

Passive Model Learning of Visibly Deterministic Context-free Grammars

We present PAPNI, a passive automata learning algorithm capable of learning deterministic context-free grammars, which are modeled with visibly deterministic pushdown automata. PAPNI is a generalization of RPNI, a passive automata learning…

Formal Languages and Automata Theory · Computer Science 2025-08-25 Edi Muškardin , Tamim Burgstaller

Inferring Input Grammars from Dynamic Control Flow

A program is characterized by its input model, and a formal input model can be of use in diverse areas including vulnerability analysis, reverse engineering, fuzzing and software testing, clone detection and refactoring. Unfortunately,…

Software Engineering · Computer Science 2019-12-13 Rahul Gopinath , Björn Mathis , Andreas Zeller

The Neural Network Pushdown Automaton: Model, Stack and Learning Simulations

In order for neural networks to learn complex languages or grammars, they must have sufficient computational power or resources to recognize or generate such languages. Though many approaches have been discussed, one ob- vious approach to…

Artificial Intelligence · Computer Science 2017-11-17 G. Z. Sun , C. L. Giles , H. H. Chen , Y. C. Lee

Active Learning of Input Grammars

Knowing the precise format of a program's input is a necessary prerequisite for systematic testing. Given a program and a small set of sample inputs, we (1) track the data flow of inputs to aggregate input fragments that share the same data…

Programming Languages · Computer Science 2017-08-30 Matthias Höschele , Alexander Kampmann , Andreas Zeller

Reliable Natural Language Understanding with Large Language Models and Answer Set Programming

Humans understand language by extracting information (meaning) from sentences, combining it with existing commonsense knowledge, and then performing reasoning to draw conclusions. While large language models (LLMs) such as GPT-3 and ChatGPT…

Computation and Language · Computer Science 2023-08-31 Abhiramon Rajasekharan , Yankai Zeng , Parth Padalkar , Gopal Gupta

V-STaR: Training Verifiers for Self-Taught Reasoners

Common self-improvement approaches for large language models (LLMs), such as STaR, iteratively fine-tune LLMs on self-generated solutions to improve their problem-solving ability. However, these approaches discard the large amounts of…

Machine Learning · Computer Science 2024-08-15 Arian Hosseini , Xingdi Yuan , Nikolay Malkin , Aaron Courville , Alessandro Sordoni , Rishabh Agarwal

Differentiate-and-Inject: Enhancing VLAs via Functional Differentiation Induced by In-Parameter Structural Reasoning

As robots are expected to perform increasingly diverse tasks, they must understand not only low-level actions but also the higher-level structure that determines how a task should unfold. Existing vision-language-action (VLA) models…

Robotics · Computer Science 2026-02-10 Jingyi Hou , Leyu Zhou , Chenchen Jing , Jinghan Yang , Xinbo Yu , Wei He

Learning Language Structures through Grounding

Language is highly structured, with syntactic and semantic structures, to some extent, agreed upon by speakers of the same language. With implicit or explicit awareness of such structures, humans can learn and use language efficiently and…

Computation and Language · Computer Science 2024-10-23 Freda Shi

Learning Active Perception via Self-Evolving Preference Optimization for GUI Grounding

Vision Language Models (VLMs) have recently achieved significant progress in bridging visual perception and linguistic reasoning. Recently, OpenAI o3 model introduced a zoom-in search strategy that effectively elicits active perception…

Computer Vision and Pattern Recognition · Computer Science 2025-09-05 Wanfu Wang , Qipeng Huang , Guangquan Xue , Xiaobo Liang , Juntao Li

Inferring Attributed Grammars from Parser Implementations

Software systems that process structured inputs often lack complete and up-to-date specifications, which specify the input syntax and the semantics of input processing. While grammar mining techniques have focused on recovering syntactic…

Software Engineering · Computer Science 2025-07-18 Andreas Pointner , Josef Pichler , Herbert Prähofer

Leveraging Vision-Language Models as Weak Annotators in Active Learning

Active learning aims to reduce annotation cost by selectively querying informative samples for supervision under a limited labeling budget. In this work, we investigate how vision-language models (VLMs) can be leveraged to further reduce…

Computer Vision and Pattern Recognition · Computer Science 2026-05-04 Phuong Ngoc Nguyen , Kaito Shiku , Ryoma Bise , Seiichi Uchida , Shinnosuke Matsuo

Pseudocode-Guided Structured Reasoning for Automating Reliable Inference in Vision-Language Models

Vision-Language Models (VLMs) are becoming the cornerstone of high-level reasoning for robotic automation, enabling robots to parse natural language commands and perceive their environments. However, their susceptibility to hallucinations…

Artificial Intelligence · Computer Science 2026-05-20 Weicong Ni , Tianbao Jiang , Linlin Wang

Learning Differentiable Programs with Admissible Neural Heuristics

We study the problem of learning differentiable functions expressed as programs in a domain-specific language. Such programmatic models can offer benefits such as composability and interpretability; however, learning them requires…

Machine Learning · Computer Science 2021-03-30 Ameesh Shah , Eric Zhan , Jennifer J. Sun , Abhinav Verma , Yisong Yue , Swarat Chaudhuri

Weakly Supervised Attention Learning for Textual Phrases Grounding

Grounding textual phrases in visual content is a meaningful yet challenging problem with various potential applications such as image-text inference or text-driven multimedia interaction. Most of the current existing methods adopt the…

Computer Vision and Pattern Recognition · Computer Science 2018-05-03 Zhiyuan Fang , Shu Kong , Tianshu Yu , Yezhou Yang

AStar: Boosting Multimodal Reasoning with Automated Structured Thinking

Multimodal large language models excel across diverse domains but struggle with complex visual reasoning tasks. To enhance their reasoning capabilities, current approaches typically rely on explicit search or post-training techniques.…

Computation and Language · Computer Science 2026-03-03 Jinyang Wu , Mingkuan Feng , Guocheng Zhai , Shuai Zhang , Zheng Lian , Fangrui Lv , Pengpeng Shao , Ruihan Jin , Zhengqi Wen , Jianhua Tao

A Derivative-based Parser Generator for Visibly Pushdown Grammars

In this paper, we present a derivative-based, functional recognizer and parser generator for visibly pushdown grammars. The generated parser accepts ambiguous grammars and produces a parse forest containing all valid parse trees for an…

Programming Languages · Computer Science 2021-09-13 Xiaodong Jia , Ashish Kumar , Gang Tan

Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection

Inspired by the success of vision-language methods (VLMs) in zero-shot classification, recent works attempt to extend this line of work into object detection by leveraging the localization ability of pre-trained VLMs and generating pseudo…

Computer Vision and Pattern Recognition · Computer Science 2023-08-01 Yanxin Long , Jianhua Han , Runhui Huang , Xu Hang , Yi Zhu , Chunjing Xu , Xiaodan Liang

Video-STAR: Reinforcing Open-Vocabulary Action Recognition with Tools

Multimodal large language models (MLLMs) have demonstrated remarkable potential in bridging visual and textual reasoning, yet their reliance on text-centric priors often limits their ability to disentangle semantically similar actions in…

Computer Vision and Pattern Recognition · Computer Science 2026-04-20 Zhenlong Yuan , Xiangyan Qu , Chengxuan Qian , Rui Chen , Jing Tang , Lei Sun , Xiangxiang Chu , Dapeng Zhang , Yiwei Wang , Yujun Cai , Shuo Li