Related papers: Bootstrapping Lexical Choice via Multiple-Sequence…

Ambiguity in the Acquisition of Lexical Information

This paper describes an approach to the automatic identification of lexical information in on-line dictionaries. This approach uses bootstrapping techniques, specifically so that ambiguity in the dictionary text can be treated properly.…

cmp-lg · Computer Science 2008-02-03 Lucy Vanderwende

Learning to Paraphrase: An Unsupervised Approach Using Multiple-Sequence Alignment

We address the text-to-text generation problem of sentence-level paraphrasing -- a phenomenon distinct from and more difficult than word- or phrase-level paraphrasing. Our approach applies multiple-sequence alignment to sentences gathered…

Computation and Language · Computer Science 2007-05-23 Regina Barzilay , Lillian Lee

Automating Knowledge Acquisition for Content-Centric Cognitive Agents Using LLMs

The paper describes a system that uses large language model (LLM) technology to support the automatic learning of new entries in an intelligent agent's semantic lexicon. The process is bootstrapped by an existing non-toy lexicon and a…

Computation and Language · Computer Science 2023-12-29 Sanjay Oruganti , Sergei Nirenburg , Jesse English , Marjorie McShane

Multi-domain machine translation enhancements by parallel data extraction from comparable corpora

Parallel texts are a relatively rare language resource, however, they constitute a very useful research material with a wide range of applications. This study presents and analyses new methodologies we developed for obtaining such data from…

Computation and Language · Computer Science 2016-03-23 Krzysztof Wołk , Emilia Rejmund , Krzysztof Marasek

Beyond Contrastive Learning: A Variational Generative Model for Multilingual Retrieval

Contrastive learning has been successfully used for retrieval of semantically aligned sentences, but it often requires large batch sizes or careful engineering to work well. In this paper, we instead propose a generative model for learning…

Computation and Language · Computer Science 2023-06-06 John Wieting , Jonathan H. Clark , William W. Cohen , Graham Neubig , Taylor Berg-Kirkpatrick

Collocational bootstrapping: A hypothesis about the learning of subject-verb agreement in humans and neural networks

In what ways might statistical signals in linguistic input assist with the acquisition of syntax? Here we hypothesize a mechanism called collocational bootstrapping, in which regularities in word co-occurrence patterns can provide cues to…

Computation and Language · Computer Science 2026-05-21 Claire Hobbs , R. Thomas McCoy

Decision Making: Lexicographical Procedure

It is introduced the using of generation lexicographical procedure for multicriteria decision-making problems.

Optimization and Control · Mathematics 2010-03-03 V. Zhukovin , N. Chkhikvadze , Z. Alimbarashvili

Explicit Syntactic Guidance for Neural Text Generation

Most existing text generation models follow the sequence-to-sequence paradigm. Generative Grammar suggests that humans generate natural language texts by learning language grammar. We propose a syntax-guided generation schema, which…

Computation and Language · Computer Science 2023-06-27 Yafu Li , Leyang Cui , Jianhao Yan , Yongjing Yin , Wei Bi , Shuming Shi , Yue Zhang

Neural Composition: Learning to Generate from Multiple Models

Decomposing models into multiple components is critically important in many applications such as language modeling (LM) as it enables adapting individual components separately and biasing of some components to the user's personal…

Computation and Language · Computer Science 2020-11-11 Denis Filimonov , Ravi Teja Gadde , Ariya Rastrow

KnowTrace: Bootstrapping Iterative Retrieval-Augmented Generation with Structured Knowledge Tracing

Recent advances in retrieval-augmented generation (RAG) furnish large language models (LLMs) with iterative retrievals of relevant information to handle complex multi-hop questions. These methods typically alternate between LLM reasoning…

Computation and Language · Computer Science 2025-05-27 Rui Li , Quanyu Dai , Zeyu Zhang , Xu Chen , Zhenhua Dong , Ji-Rong Wen

Bootstrapping Syntax and Recursion using Alignment-Based Learning

This paper introduces a new type of unsupervised learning algorithm, based on the alignment of sentences and Harris's (1951) notion of interchangeability. The algorithm is applied to an untagged, unstructured corpus of natural language…

Machine Learning · Computer Science 2009-09-25 Menno van Zaanen

A New Approach for Semi-automatic Building and Extending a Multilingual Terminology Thesaurus

This paper describes a new system for semi-automatically building, extending and managing a terminological thesaurus---a multilingual terminology dictionary enriched with relationships between the terms themselves to form a thesaurus. The…

Computation and Language · Computer Science 2019-04-09 Adam Rambousek , Ales Horak , Vit Suchomel , Vit Baisa

Bootstrapping Deep Lexical Resources: Resources for Courses

We propose a range of deep lexical acquisition methods which make use of morphological, syntactic and ontological language resources to model word similarity and bootstrap from a seed lexicon. The different methods are deployed in learning…

Computation and Language · Computer Science 2007-09-18 Timothy Baldwin

Natural Language Deduction with Incomplete Information

A growing body of work studies how to answer a question or verify a claim by generating a natural language "proof": a chain of deductive inferences yielding the answer based on a set of premises. However, these methods can only make sound…

Computation and Language · Computer Science 2022-11-02 Zayne Sprague , Kaj Bostrom , Swarat Chaudhuri , Greg Durrett

Mapping Multilingual Hierarchies Using Relaxation Labeling

This paper explores the automatic construction of a multilingual Lexical Knowledge Base from pre-existing lexical resources. We present a new and robust approach for linking already existing lexical/semantic hierarchies. We used a…

Computation and Language · Computer Science 2007-05-23 J. Daude , L. Padro , G. Rigau

Bootstrapping Structure into Language: Alignment-Based Learning

This thesis introduces a new unsupervised learning framework, called Alignment-Based Learning, which is based on the alignment of sentences and Harris's (1951) notion of substitutability. Instances of the framework can be applied to an…

Machine Learning · Computer Science 2007-05-23 Menno M. van Zaanen

Experiments in Linear Template Combination using Genetic Algorithms

Natural Language Generation systems typically have two parts - strategic ('what to say') and tactical ('how to say'). We present our experiments in building an unsupervised corpus-driven template based tactical NLG system. We consider…

Computation and Language · Computer Science 2016-05-25 Nikhilesh Bhatnagar , Radhika Mamidi

Exploiting Sublanguage and Domain Characteristics in a Bootstrapping Approach to Lexicon and Ontology Creation

It is very costly to build up lexical resources and domain ontologies. Especially when confronted with a new application domain lexical gaps and a poor coverage of domain concepts are a problem for the successful exploitation of natural…

Computation and Language · Computer Science 2007-05-23 Dietmar Roesner , Manuela Kunze

Acoustic data-driven lexicon learning based on a greedy pronunciation selection framework

Speech recognition systems for irregularly-spelled languages like English normally require hand-written pronunciations. In this paper, we describe a system for automatically obtaining pronunciations of words for which pronunciations are not…

Computation and Language · Computer Science 2017-06-13 Xiaohui Zhang , Vimal Manohar , Daniel Povey , Sanjeev Khudanpur

Neural paraphrasing by automatically crawled and aligned sentence pairs

Paraphrasing is the task of re-writing an input text using other words, without altering the meaning of the original content. Conversational systems can exploit automatic paraphrasing to make the conversation more natural, e.g., talking…

Computation and Language · Computer Science 2024-02-19 Achille Globo , Antonio Trevisi , Andrea Zugarini , Leonardo Rigutini , Marco Maggini , Stefano Melacci