Related papers: Pattern-based Subterm Selection in Isabelle

A Formulation of the Simple Theory of Types (for Isabelle)

Simple type theory is formulated for use with the generic theorem prover Isabelle. This requires explicit type inference rules. There are function, product, and subset types, which may be empty. Descriptions (the eta-operator) introduce the…

Logic in Computer Science · Computer Science 2008-02-03 Lawrence C. Paulson

Constrained Training of Neural Networks via Theorem Proving

We introduce a theorem proving approach to the specification and generation of temporal logical constraints for training neural networks. We formalise a deep embedding of linear temporal logic over finite traces (LTL$_f$) and an associated…

Artificial Intelligence · Computer Science 2022-07-11 Mark Chevallier , Matthew Whyte , Jacques D. Fleuriot

Improving Statistical Language Model Performance with Automatically Generated Word Hierarchies

An automatic word classification system has been designed which processes word unigram and bigram frequency statistics extracted from a corpus of natural language utterances. The system implements a binary top-down form of word clustering…

cmp-lg · Computer Science 2016-08-31 John McMahon , F. J. Smith

Unsupervised Spoken Term Discovery on Untranscribed Speech

(Part of the abstract) In this thesis, we investigate the use of unsupervised spoken term discovery in tackling this problem. Unsupervised spoken term discovery aims to discover topic-related terminologies in a speech without knowing the…

Audio and Speech Processing · Electrical Eng. & Systems 2020-12-01 Man-Ling Sung

The Isabelle ENIGMA

We significantly improve the performance of the E automated theorem prover on the Isabelle Sledgehammer problems by combining learning and theorem proving in several ways. In particular, we develop targeted versions of the ENIGMA guidance…

Artificial Intelligence · Computer Science 2022-05-05 Zarathustra A. Goertzel , Jan Jakubův , Cezary Kaliszyk , Miroslav Olšák , Jelle Piepenbrock , Josef Urban

Patterns versus Characters in Subword-aware Neural Language Modeling

Words in some natural languages can have a composite structure. Elements of this structure include the root (that could also be composite), prefixes and suffixes with which various nuances and relations to other words can be expressed.…

Computation and Language · Computer Science 2017-09-05 Rustem Takhanov , Zhenisbek Assylbekov

Pattern Based Term Extraction Using ACABIT System

In this paper, we propose a pattern-based term extraction approach for Japanese, applying ACABIT system originally developed for French. The proposed approach evaluates termhood using morphological patterns of basic terms and term variants.…

Computation and Language · Computer Science 2009-07-16 Koichi Takeuchi , Kyo Kageura , Teruo Koyama , Béatrice Daille , Laurent Romary

Isabelle: The Next 700 Theorem Provers

Isabelle is a generic theorem prover, designed for interactive reasoning in a variety of formal theories. At present it provides useful proof procedures for Constructive Type Theory, various first-order logics, Zermelo-Fraenkel set theory,…

Logic in Computer Science · Computer Science 2008-02-03 Lawrence C. Paulson

Pattern Sampling for Shapelet-based Time Series Classification

Subsequence-based time series classification algorithms provide accurate and interpretable models, but training these models is extremely computation intensive. The asymptotic time complexity of subsequence-based algorithms remains a…

Machine Learning · Computer Science 2021-02-18 Atif Raza , Stefan Kramer

Term Rewriting Based On Set Automaton Matching

In this article we investigate how a subterm pattern matching algorithm can be exploited to implement efficient term rewriting procedures. From the left-hand sides of the rewrite system we construct a set automaton, which can be used to…

Computation and Language · Computer Science 2023-04-07 Mark Bouwman , Rick Erkens

Foundation Posteriors for Approximate Probabilistic Inference

Probabilistic programs provide an expressive representation language for generative models. Given a probabilistic program, we are interested in the task of posterior inference: estimating a latent variable given a set of observed variables.…

Machine Learning · Computer Science 2022-09-01 Mike Wu , Noah Goodman

Syntactic Control of Language Models by Posterior Inference

Controlling the syntactic structure of text generated by language models is valuable for applications requiring clarity, stylistic consistency, or interpretability, yet it remains a challenging task. In this paper, we argue that sampling…

Computation and Language · Computer Science 2025-06-10 Vicky Xefteri , Tim Vieira , Ryan Cotterell , Afra Amini

Experiments in Linear Template Combination using Genetic Algorithms

Natural Language Generation systems typically have two parts - strategic ('what to say') and tactical ('how to say'). We present our experiments in building an unsupervised corpus-driven template based tactical NLG system. We consider…

Computation and Language · Computer Science 2016-05-25 Nikhilesh Bhatnagar , Radhika Mamidi

Automatic Extraction of Subcategorization from Corpora

We describe a novel technique and implemented system for constructing a subcategorization dictionary from textual corpora. Each dictionary entry encodes the relative frequency of occurrence of a comprehensive set of subcategorization…

cmp-lg · Computer Science 2016-08-31 Ted Briscoe , John Carroll

Generating Ontologies from Templates: A Rule-Based Approach for Capturing Regularity

We present a second-order language that can be used to succinctly specify ontologies in a consistent and transparent manner. This language is based on ontology templates (OTTR), a framework for capturing recurring patterns of axioms in…

Artificial Intelligence · Computer Science 2018-09-28 Henrik Forssell , Christian Kindermann , Daniel P. Lupp , Uli Sattler , Evgenij Thorstensen

Extending Shinohara's Algorithm for Computing Descriptive (Angluin-Style) Patterns to Subsequence Patterns

The introduction of pattern languages in the seminal work [Angluin, ``Finding Patterns Common to a Set of Strings'', JCSS 1980] has revived the classical model of inductive inference (learning in the limit, gold-style learning). In…

Formal Languages and Automata Theory · Computer Science 2022-06-29 Markus L. Schmid

XSTEM: An exemplar-based stemming algorithm

Stemming is the process of reducing related words to a standard form by removing affixes from them. Existing algorithms vary with respect to their complexity, configurability, handling of unknown words, and ability to avoid under- and…

Computation and Language · Computer Science 2024-06-04 Kirk Baker

SparTerm: Learning Term-based Sparse Representation for Fast Text Retrieval

Term-based sparse representations dominate the first-stage text retrieval in industrial applications, due to its advantage in efficiency, interpretability, and exact term matching. In this paper, we study the problem of transferring the…

Information Retrieval · Computer Science 2020-10-05 Yang Bai , Xiaoguang Li , Gang Wang , Chaoliang Zhang , Lifeng Shang , Jun Xu , Zhaowei Wang , Fangshan Wang , Qun Liu

Importance Sampling of Word Patterns in DNA and Protein Sequences

Monte Carlo methods can provide accurate p-value estimates of word counting test statistics and are easy to implement. They are especially attractive when an asymptotic theory is absent or when either the search sequence or the word pattern…

Applications · Statistics 2008-12-01 Hock Peng Chan , Nancy R. Zhang , Louis H. Y. Chen

Information Extraction Using the Structured Language Model

The paper presents a data-driven approach to information extraction (viewed as template filling) using the structured language model (SLM) as a statistical parser. The task of template filling is cast as constrained parsing using the SLM.…

Computation and Language · Computer Science 2007-05-23 Ciprian Chelba , Milind Mahajan