Related papers: Learning Efficient Disambiguation

Efficient Algorithms for Parsing the DOP Model

Excellent results have been reported for Data-Oriented Parsing (DOP) of natural language texts (Bod, 1993). Unfortunately, existing algorithms are both computationally intensive and difficult to implement. Previous algorithms are expensive…

cmp-lg · Computer Science 2008-02-03 Joshua Goodman

Combining semantic and syntactic structure for language modeling

Structured language models for speech recognition have been shown to remedy the weaknesses of n-gram models. All current structured language models are, however, limited in that they do not take into account dependencies between…

Computation and Language · Computer Science 2007-05-23 Rens Bod

Aspects of Pattern-Matching in Data-Oriented Parsing

Data-Oriented Parsing (dop) ranks among the best parsing schemes, pairing state-of-the art parsing accuracy to the psycholinguistic insight that larger chunks of syntactic structures are relevant grammatical and probabilistic units. Parsing…

Computation and Language · Computer Science 2007-05-23 Guy De Pauw

On Guaranteed Optimal Robust Explanations for NLP Models

We build on abduction-based explanations for ma-chine learning and develop a method for computing local explanations for neural network models in natural language processing (NLP). Our explanations comprise a subset of the words of the…

Artificial Intelligence · Computer Science 2021-10-19 Emanuele La Malfa , Agnieszka Zbrzezny , Rhiannon Michelmore , Nicola Paoletti , Marta Kwiatkowska

Analysis of Fast Structured Dictionary Learning

Sparsity-based models and techniques have been exploited in many signal processing and imaging applications. Data-driven methods based on dictionary and sparsifying transform learning enable learning rich image features from data, and can…

Machine Learning · Computer Science 2019-09-25 Saiprasad Ravishankar , Anna Ma , Deanna Needell

Word sense disambiguation via high order of learning in complex networks

Complex networks have been employed to model many real systems and as a modeling tool in a myriad of applications. In this paper, we use the framework of complex networks to the problem of supervised classification in the word…

Physics and Society · Physics 2013-02-20 Thiago C. Silva , Diego R. Amancio

Two Questions about Data-Oriented Parsing

In this paper I present ongoing work on the data-oriented parsing (DOP) model. In previous work, DOP was tested on a cleaned-up set of analyzed part-of-speech strings from the Penn Treebank, achieving excellent test results. This left,…

cmp-lg · Computer Science 2008-02-03 Rens Bod

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

The dominating NLP paradigm of training a strong neural predictor to perform one task on a specific dataset has led to state-of-the-art performance in a variety of applications (eg. sentiment classification, span-prediction based question…

Computation and Language · Computer Science 2021-09-06 Paul Michel

Learning Probabilistic Models of Word Sense Disambiguation

This dissertation presents several new methods of supervised and unsupervised learning of word sense disambiguation models. The supervised methods focus on performing model searches through a space of probabilistic models, and the…

Computation and Language · Computer Science 2009-09-29 Ted Pedersen

Prompt Optimization as a State-Space Search Problem

Language Models are extremely susceptible to performance collapse with even small changes to input prompt strings. Libraries such as DSpy (from Stanford NLP) avoid this problem through demonstration-based prompt optimisation. Inspired by…

Computation and Language · Computer Science 2025-11-25 Maanas Taneja

A Systematic Survey of Automatic Prompt Optimization Techniques

Since the advent of large language models (LLMs), prompt engineering has been a crucial step for eliciting desired responses for various Natural Language Processing (NLP) tasks. However, prompt engineering remains an impediment for end…

Computation and Language · Computer Science 2025-12-23 Kiran Ramnath , Kang Zhou , Sheng Guan , Soumya Smruti Mishra , Xuan Qi , Zhengyuan Shen , Shuai Wang , Sangmin Woo , Sullam Jeoung , Yawei Wang , Haozhu Wang , Han Ding , Yuzhe Lu , Zhichao Xu , Yun Zhou , Balasubramaniam Srinivasan , Qiaojing Yan , Yueyan Chen , Haibo Ding , Panpan Xu , Lin Lee Cheong

Robust Processing of Natural Language

Previous approaches to robustness in natural language processing usually treat deviant input by relaxing grammatical constraints whenever a successful analysis cannot be provided by ``normal'' means. This schema implies, that error…

cmp-lg · Computer Science 2016-08-31 Wolfgang Menzel

Improving Data and Parameter Efficiency of Neural Language Models Using Representation Analysis

This thesis addresses challenges related to data and parameter efficiency in neural language models, with a focus on representation analysis and the introduction of new optimization techniques. The first part examines the properties and…

Computation and Language · Computer Science 2025-07-17 Josip Jukić

Developing a hybrid NP parser

We describe the use of energy function optimization in very shallow syntactic parsing. The approach can use linguistic rules and corpus-based statistics, so the strengths of both linguistic and statistical approaches to NLP can be combined…

cmp-lg · Computer Science 2008-02-03 Atro Voutilainen , Lluis Padro

explanation-based learning of data oriented parsing

This paper presents a new view of Explanation-Based Learning (EBL) of natural language parsing. Rather than employing EBL for specializing parsers by inferring new ones, this paper suggests employing EBL for learning how to reduce ambiguity…

cmp-lg · Computer Science 2008-02-03 Khalil Sima'an

Domain Specific Fine-tuning of Denoising Sequence-to-Sequence Models for Natural Language Summarization

Summarization of long-form text data is a problem especially pertinent in knowledge economy jobs such as medicine and finance, that require continuously remaining informed on a sophisticated and evolving body of knowledge. As such,…

Computation and Language · Computer Science 2022-04-22 Brydon Parker , Alik Sokolov , Mahtab Ahmed , Matt Kalebic , Sedef Akinli Kocak , Ofer Shai

Data-Oriented Language Processing. An Overview

During the last few years, a new approach to language processing has started to emerge, which has become known under various labels such as "data-oriented parsing", "corpus-based interpretation", and "tree-bank grammar" (cf. van den Berg et…

cmp-lg · Computer Science 2008-02-03 Rens Bod , Remko Scha

Understanding Likelihood Over-optimisation in Direct Alignment Algorithms

Direct Alignment Algorithms (DAAs), such as Direct Preference Optimisation (DPO) and Identity Preference Optimisation (IPO), have emerged as alternatives to online Reinforcement Learning from Human Feedback (RLHF) algorithms such as…

Computation and Language · Computer Science 2024-10-21 Zhengyan Shi , Sander Land , Acyr Locatelli , Matthieu Geist , Max Bartolo

SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization

Transfer learning has fundamentally changed the landscape of natural language processing (NLP) research. Many existing state-of-the-art models are first pre-trained on a large text corpus and then fine-tuned on downstream tasks. However,…

Computation and Language · Computer Science 2021-09-10 Haoming Jiang , Pengcheng He , Weizhu Chen , Xiaodong Liu , Jianfeng Gao , Tuo Zhao

sense2vec - A Fast and Accurate Method for Word Sense Disambiguation In Neural Word Embeddings

Neural word representations have proven useful in Natural Language Processing (NLP) tasks due to their ability to efficiently model complex semantic and syntactic word relationships. However, most techniques model only one representation…

Computation and Language · Computer Science 2015-11-23 Andrew Trask , Phil Michalak , John Liu