English
Related papers

Related papers: Reified Context Models

200 papers

To answer a question, language models often need to integrate prior knowledge learned during pretraining and new information presented in context. We hypothesize that models perform this integration in a predictable way across different…

Computation and Language · Computer Science 2024-06-18 Kevin Du , Vésteinn Snæbjarnarson , Niklas Stoehr , Jennifer C. White , Aaron Schein , Ryan Cotterell

Retrieval-augmented language models have demonstrated performance comparable to much larger models while requiring fewer computational resources. The effectiveness of these models crucially depends on the overlap between query and retrieved…

Computation and Language · Computer Science 2025-05-21 Ehsan Doostmohammadi , Marco Kuhlmann

Regression models often fail to generalize effectively in regions characterized by highly imbalanced label distributions. Previous methods for deep imbalanced regression rely on gradient-based weight updates, which tend to overfit in…

Machine Learning · Computer Science 2024-11-21 Ismail Nejjar , Faez Ahmed , Olga Fink

Extended sequence generation often leads to degradation in contextual consistency due to the inability of conventional self-attention mechanisms to effectively retain long-range dependencies. Existing approaches, including memory…

Computation and Language · Computer Science 2025-01-30 Jonathan Teel , Jocasta Cumberbatch , Raphael Benington , Quentin Baskerville

Large-scale models trained on broad data have recently become the mainstream architecture in computer vision due to their strong generalization performance. In this paper, the main focus is on an emergent ability in large vision models,…

Computer Vision and Pattern Recognition · Computer Science 2023-02-02 Yuanhan Zhang , Kaiyang Zhou , Ziwei Liu

Lexical inference in context (LIiC) is the task of recognizing textual entailment between two very similar sentences, i.e., sentences that only differ in one expression. It can therefore be seen as a variant of the natural language…

Computation and Language · Computer Science 2021-04-28 Martin Schmitt , Hinrich Schütze

When pre-trained on large unsupervised textual corpora, language models are able to store and retrieve factual knowledge to some extent, making it possible to use them directly for zero-shot cloze-style question answering. However, storing…

Computation and Language · Computer Science 2020-05-12 Fabio Petroni , Patrick Lewis , Aleksandra Piktus , Tim Rocktäschel , Yuxiang Wu , Alexander H. Miller , Sebastian Riedel

In-context learning is a surprising and important phenomenon that emerged when modern language models were scaled to billions of learned parameters. Without modifying a large language model's weights, it can be tuned to perform various…

Computation and Language · Computer Science 2023-03-15 Noam Wies , Yoav Levine , Amnon Shashua

Context information around words helps in determining their actual meaning, for example "networks" used in contexts of artificial neural networks or biological neuron networks. Generative topic models infer topic-word distributions, taking…

Information Retrieval · Computer Science 2018-08-14 Pankaj Gupta , Florian Buettner , Hinrich Schütze

Pre-trained language models have been successful on text classification tasks, but are prone to learning spurious correlations from biased datasets, and are thus vulnerable when making inferences in a new domain. Prior work reveals such…

Computation and Language · Computer Science 2022-01-03 Huihan Yao , Ying Chen , Qinyuan Ye , Xisen Jin , Xiang Ren

Self-attention model have shown its flexibility in parallel computation and the effectiveness on modeling both long- and short-term dependencies. However, it calculates the dependencies between representations without considering the…

Computation and Language · Computer Science 2019-02-18 Baosong Yang , Jian Li , Derek Wong , Lidia S. Chao , Xing Wang , Zhaopeng Tu

Increased adaptability of RNN language models leads to improved predictions that benefit many applications. However, current methods do not take full advantage of the RNN structure. We show that the most widely-used approach to adaptation…

Computation and Language · Computer Science 2017-04-24 Aaron Jaech , Mari Ostendorf

We consider the problem of automatically generating longer stories of over two thousand words. Compared to prior work on shorter stories, long-range plot coherence and relevance are more central challenges here. We propose the Recursive…

Computation and Language · Computer Science 2022-10-25 Kevin Yang , Yuandong Tian , Nanyun Peng , Dan Klein

Despite recent progress on conversational systems, they still do not perform smoothly and coherently when faced with ambiguous requests. When questions are unclear, conversational systems should have the ability to ask clarifying questions,…

Information Retrieval · Computer Science 2022-08-10 Negar Arabzadeh , Mahsa Seifikar , Charles L. A. Clarke

We consider problems of making sequences of decisions to accomplish tasks, interacting via the medium of language. These problems are often tackled with reinforcement learning approaches. We find that these models do not generalize well…

Computation and Language · Computer Science 2020-10-07 Xusen Yin , Ralph Weischedel , Jonathan May

With a handful of demonstration examples, large-scale language models show strong capability to perform various tasks by in-context learning from these examples, without any fine-tuning. We demonstrate that in-context learning performance…

Computation and Language · Computer Science 2022-11-10 Yiming Zhang , Shi Feng , Chenhao Tan

Traditional security scanners fail when facing new attack patterns they haven't seen before. They rely on fixed rules and predetermined signatures, making them blind to novel threats. We present a fundamentally different approach: instead…

Cryptography and Security · Computer Science 2025-11-21 Ayush Chaudhary

Query expansion is an effective approach for mitigating vocabulary mismatch between queries and documents in information retrieval. One recent line of research uses language models to generate query-related contexts for expansion. Along…

Computation and Language · Computer Science 2022-10-14 Linqing Liu , Minghan Li , Jimmy Lin , Sebastian Riedel , Pontus Stenetorp

The speed and accuracy with which robots are able to interpret natural language is fundamental to realizing effective human-robot interaction. A great deal of attention has been paid to developing models and approximate inference algorithms…

Robotics · Computer Science 2019-03-25 Siddharth Patki , Andrea F. Daniele , Matthew R. Walter , Thomas M. Howard

In-context learning is a recent paradigm in natural language understanding, where a large pre-trained language model (LM) observes a test instance and a few training examples as its input, and directly decodes the output without any update…

Computation and Language · Computer Science 2022-05-10 Ohad Rubin , Jonathan Herzig , Jonathan Berant
‹ Prev 1 2 3 10 Next ›